跳到主要导航 跳到搜索 跳到主要内容

An NER-based product identification and lucene-based product linking approach to CPROD1 challenge: Description of submission system to CPROD1 Challenge

  • Zhiqiang Toh*
  • , Wenting Wang
  • , Man Lan
  • , Xiaoli Li
  • *此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

This paper presents our methodology for CPROD1 Challenge, which is to identify the product mentions from text and then link the product to the entries in the catalog file. Our solution follows 2 steps. First, we use processing pipelines to extract product mentions by incorporating multiple techniques including traditional named entities recognition (NER), regular expression rules and gazetteer-based string matching. Second, we view product linking task into an information retrieval (IR) problem, where the description catalog file is populated into a database. Thus, each product mention acts as a search query and the returned results from catalog entry database serve as the links. The F1 scores of our submission on public and private test data are 24.82% and 16.04%, respectively.

源语言英语
主期刊名Proceedings - 12th IEEE International Conference on Data Mining Workshops, ICDMW 2012
869-871
页数3
DOI
出版状态已出版 - 2012
已对外发布
活动12th IEEE International Conference on Data Mining Workshops, ICDMW 2012 - Brussels, 比利时
期限: 10 12月 201210 12月 2012

出版系列

姓名Proceedings - 12th IEEE International Conference on Data Mining Workshops, ICDMW 2012

会议

会议12th IEEE International Conference on Data Mining Workshops, ICDMW 2012
国家/地区比利时
Brussels
时期10/12/1210/12/12

指纹

探究 'An NER-based product identification and lucene-based product linking approach to CPROD1 challenge: Description of submission system to CPROD1 Challenge' 的科研主题。它们共同构成独一无二的指纹。

引用此