BestBuy RDF data — SPARQL access
Recently Best Buy made product and other information available from download, read about it here. I downloaded some of the data and made it available for SPARQL queries. Some of the tasks I have performed with the data.
- Data is made available as a series of follows with little or no description of what he breakdown is.
- I downloaded the first 20,000 rdf descriptions and 14610 of them had errors
- I proceeded to load the remaining 5390 of them into the OpenRDF(formerly Sesame) engine
- Deciding if I should write a general process to take the data directly from the sitemap files and download it directly into the triple store.
- Why is some of this data so old?
- Will Best Buy make updates available?
- What will Best Buy’s policy be on downloads?
This is an interesting trend to follow especially if the following happens.
- Ebay makes data available in some RDF like format
- Amazon makes data available in some RDF like format
- NetFlix makes data available in some RDF like format
- More applications that are not basically FOAF or social networking based
- Valuable technical data like this something simple like you go to the support market and buy some groceries, as part of your credit card service you get an rdf presentation of what you bought so you can look at your purchases from dimensions that interest you food composition, quality, calories, cost, environmental impact and so on.
The current trend of creating mashups and APIs is a yet another tower of Babel in the comptuer industry. Each individual application is usually very nice but when you want to have a more global scope it doesn’t work very well. General adoptions of RDF would provide a common language an shft the problem to solving some more tractable problems like.
- Vocabulary differences
- SPARQL performance and usage
Which I believe to be more tractable problems that trying to get all the Different apis to work together. Having a different mashup for each site that doesn’t work with other sites is not an internet I look forward to. Besides most of the data already exists in DB2, MySQL and Oracle databases making it available as RDF for corporate customers is a useful and not trivial service that would allow websites to instantly participate int eh Semantic Web experiment.
Some of the more relevant links, also repeated at GoodRelations, a promising application of Semantic web to commerce, products and services
I have quite a few experiments planned the simpler ones are using Tabulator, Operator and OpenRDF.
