FLOPPIES: A Framework for Large-Scale Ontology Population of Product Information from Tabular Data in E-commerce Stores

Autor: Flavius Frasincar, Damir Vandic, Lennart J. Nederstigt, Steven S. Aanen
Přispěvatelé: Econometrics
Jazyk: angličtina
Rok vydání: 2014
Předmět:
Zdroj: Decision Support Systems, 59, 296-311. Elsevier
ISSN: 0167-9236
Popis: With the vast amount of information available on the Web, there is an urgent need to structure Web data in order to make it available to both users and machines. E-commerce is one of the areas in which growing data congestion on the Web impedes data accessibility. This paper proposes FLOPPIES, a framework capable of semi-automatic ontology population of tabular product information from Web stores. By formalizing product information in an ontology, better product comparison or parametric search applications can be built, using the semantics of product attributes and their corresponding values. The framework employs both lexical and pattern matching for classifying products, mapping properties, and instantiating values. It is shown that the performance on instantiating TVs and MP3 players from Best Buy and Newegg.com looks promising, achieving an F^1-measure of approximately 77%.
Databáze: OpenAIRE