On statistical characteristics of real-life knowledge graphs

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

The success of open-access knowledge graphs, such as YAGO, and commercial products, such as Google Knowledge Graph, has attracted much attention from both academic and industrial communities in building common-sense and domain-specific knowledge graphs. A natural question arises that how to effectively and efficiently manage a large-scale knowledge graph. Though systems and technologies that use relational storage engines or native graph database management systems are proposed, there exists no widely accepted solution. Therefore, a benchmark for management of knowledge graphs is required. In this paper, we analyze the requirements of benchmarking knowledge graph management from a specific yet important point-of-view, i.e. characteristics of knowledge graph data. Seventeen statistical features of four knowledge graphs as well as two social networks are studied. We show that through these graphs depict similar structures, their tiny differences may result in totally different storage and indexing strategies, that should not be omitted. Finally, we put forward the requirements to seeding datasets and synthetic data generators for benchmarking knowledge graph management based on the study.

Original languageEnglish
Title of host publicationBig Data Benchmarks, Performance Optimization, and Emerging Hardware - 6th Workshop, BPOE 2015, Revised Selected Papers
EditorsRoberto V. Zicari, Jianfeng Zhan, Rui Han
PublisherSpringer Verlag
Pages37-49
Number of pages13
ISBN (Print)9783319290058
DOIs
StatePublished - 2016
Event6th Workshop on Big Data Benchmarks, Performance Optimization, and Emerging Hardware, BPOE 2015 - Kohala, United States
Duration: 31 Aug 20154 Sep 2015

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9495
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference6th Workshop on Big Data Benchmarks, Performance Optimization, and Emerging Hardware, BPOE 2015
Country/TerritoryUnited States
CityKohala
Period31/08/154/09/15

Keywords

  • Benchmark
  • Knowledge graph
  • Social network

Fingerprint

Dive into the research topics of 'On statistical characteristics of real-life knowledge graphs'. Together they form a unique fingerprint.

Cite this