TrajSpark: A scalable and efficient in-memory management system for big trajectory data

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

43 Scopus citations

Abstract

The widespread application of mobile positioning devices has generated big trajectory data. Existing disk-based trajectory management systems cannot provide scalable and low latency query services any more. In view of that, we present TrajSpark, a distributed in-memory system to consistently offer efficient management of trajectory data. TrajSpark introduces a new abstraction called IndexTRDD to manage trajectory segments, and exploits a global and local indexing mechanism to accelerate trajectory queries. Furthermore, to alleviate the essential partitioning overhead, it adopts the time-decay model to monitor the change of data distribution and updates the data-partition structure adaptively. This model avoids repartitioning existing data when new batch of data arrives. Extensive experiments of three types of trajectory queries on both real and synthetic dataset demonstrate that the performance of TrajSpark outperforms state-of-the-art systems.

Original languageEnglish
Title of host publicationWeb and Big Data - 1st International Joint Conference, APWeb-WAIM 2017, Proceedings
EditorsCyrus Shahabi, Xiang Lian, Christian S. Jensen, Xiaochun Yang, Lei Chen
PublisherSpringer Verlag
Pages11-26
Number of pages16
ISBN (Print)9783319635781
DOIs
StatePublished - 2017
Event1st Asia-Pacific Web and Web-Age Information Management Joint Conference on Web and Big Data, APWeb-WAIM 2017 - Beijing, China
Duration: 7 Jul 20179 Jul 2017

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10366 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference1st Asia-Pacific Web and Web-Age Information Management Joint Conference on Web and Big Data, APWeb-WAIM 2017
Country/TerritoryChina
CityBeijing
Period7/07/179/07/17

Keywords

  • Big trajectory data
  • In-memory
  • Low latency query

Fingerprint

Dive into the research topics of 'TrajSpark: A scalable and efficient in-memory management system for big trajectory data'. Together they form a unique fingerprint.

Cite this