Skip to main navigation Skip to search Skip to main content

Less is More: Efficient Time Series Dataset Condensation via Two-fold Modal Matching

  • Hao Miao
  • , Ziqiao Liu
  • , Yan Zhao
  • , Chenjuan Guo
  • , Kai Zheng
  • , Christian S. Jensen
  • Aalborg University
  • University of Electronic Science and Technology of China
  • East China Normal

Research output: Contribution to journalConference articlepeer-review

Abstract

The expanding instrumentation of processes throughout society with sensors yields a proliferation of time series data that may in turn enable important applications, e.g., related to transportation infrastructures or power grids. Machine-learning based methods are increasingly being used to extract value from such data. We provide means of reducing the resulting considerable computational and data storage costs. We achieve this by providing means of condensing large time series datasets such that models trained on the condensed data achieve performance comparable to those trained on the original, large data. Specifically, we propose a time series dataset condensation framework, Time DC, that employs two-fold modal matching, encompassing frequency matching and training trajectory matching. Thus, Time DC performs time series feature extraction and decomposition-driven frequency matching to preserve complex temporal dependencies in the reduced time series. Further, Time DC employs curriculum training trajectory matching to ensure effective and generalized time series dataset condensation. To avoid memory overflow and to reduce the cost of dataset condensation, the framework includes an expert buffer storing pre-computed expert trajectories. Extensive experiments on real data offer insight into the effectiveness and efficiency of the proposed solutions.

Original languageEnglish
Pages (from-to)226-238
Number of pages13
JournalProceedings of the VLDB Endowment
Volume18
Issue number2
DOIs
StatePublished - 2025
Externally publishedYes
Event51st International Conference on Very Large Data Bases, VLDB 2025 - London, United Kingdom
Duration: 1 Sep 20255 Sep 2025

Fingerprint

Dive into the research topics of 'Less is More: Efficient Time Series Dataset Condensation via Two-fold Modal Matching'. Together they form a unique fingerprint.

Cite this