Challenges in Chinese knowledge graph construction

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

34 Scopus citations

Abstract

The automatic construction of large-scale knowledge graphs has received much attention from both academia and industry in the past few years. Notable knowledge graph systems include Google Knowledge Graph, DBPedia, YAGO, NELL, Probase and many others. Knowledge graph organizes the information in a structured way by explicitly describing the relations among entities. Since entity identification and relation extraction are highly depending on language itself, data sources largely determine the way the data are processed, relations are extracted, and ultimately how knowledge graphs are formed, which deeply involves the analysis of lexicon, syntax and semantics of the content. Currently, much progress has been made for knowledge graphs in English language. In this paper, we discuss the challenges facing Chinese knowledge graph construction because Chinese is significantly different from English in various linguistic perspectives. Specifically, we analyze the challenges from three aspects: data sources, taxonomy derivation and knowledge extraction. We also present our insights in addressing these challenges.

Original languageEnglish
Title of host publicationICDEW 2015 - 2015 IEEE 31st International Conference on Data Engineering Workshops
PublisherIEEE Computer Society
Pages59-61
Number of pages3
ISBN (Electronic)9781479984411
DOIs
StatePublished - 19 Jun 2015
Event2015 31st IEEE International Conference on Data Engineering Workshops, ICDEW 2015 - Seoul, Korea, Republic of
Duration: 13 Apr 201517 Apr 2015

Publication series

NameProceedings - International Conference on Data Engineering
Volume2015-June
ISSN (Print)1084-4627

Conference

Conference2015 31st IEEE International Conference on Data Engineering Workshops, ICDEW 2015
Country/TerritoryKorea, Republic of
CitySeoul
Period13/04/1517/04/15

Fingerprint

Dive into the research topics of 'Challenges in Chinese knowledge graph construction'. Together they form a unique fingerprint.

Cite this