跳到主要导航 跳到搜索 跳到主要内容

A hybrid framework for product normalization n online shopping

  • East China Normal University
  • Fudan University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The explosive growth of products in both variety and quantity is an obvious evidence for the booming of C2C (Customer-to-Customer) E-commerce. Product normalization, which determines whether products are referring to the same underlying entity, is a fundamental task of data management in C2C market. However, product normalization in C2C market is challenging because the data is noisy and lacks a uniform schema. In this paper, we propose a hybrid framework, which achieves product normalization by the schema integration and data cleaning. In the framework, a graph-based method was proposed to integrate the schema. The missing data was filled and the incorrect data was repaired by using the evidence extracted from surrounding information, such as the title and textual description. We distinguish products by clustering on the product similarity matrix which is learned through logistic regression. We conduct experiments on the real-world data and the experimental results confirm the effectiveness of our design by comparing with the existing methods.

源语言英语
主期刊名Database Systems for Advanced Applications - 18th International Conference, DASFAA 2013, Proceedings
370-384
页数15
版本PART 2
DOI
出版状态已出版 - 2013
活动18th International Conference on Database Systems for Advanced Applications, DASFAA 2013 - Wuhan, 中国
期限: 22 4月 201325 4月 2013

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
编号PART 2
7826 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议18th International Conference on Database Systems for Advanced Applications, DASFAA 2013
国家/地区中国
Wuhan
时期22/04/1325/04/13

指纹

探究 'A hybrid framework for product normalization n online shopping' 的科研主题。它们共同构成独一无二的指纹。

引用此