SAM: A Spatial-Aware Learned Index for Disk-Based Multi-dimensional Search

Huan Zhou, Lei Yang, Yu Xiao, Yuanxiong He, Jian Hu, Weining Qian

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Although existing learned multi-dimensional indexes achieve fast similarity query processing, they still incur high I/O cost and large computational consumption. To address these issues, we propose a spatial-aware learned index for disk-based multi-dimensional search (SAM for short). Its core idea is to use a data transformation technique based on dual-distance metric to map more similar data in space into compact regions and the mapped values are totally ordinal. SAM partitions data into clusters, redistributes data by utilizing a pivot for each cluster and Euclidean distance and Manhattan distance and uses a learned index to approximate the position of each data record on disk. Our experimental evaluation on real-world and synthetic datasets shows that SAM outperforms the SOTA learned indexes by ∼2× for range queries and ∼9× for nearest neighbor queries.

Original languageEnglish
Title of host publicationWeb and Big Data - 8th International Joint Conference, APWeb-WAIM 2024, Proceedings
EditorsWenjie Zhang, Zhengyi Yang, Xiaoyang Wang, Anthony Tung, Zhonglong Zheng, Hongjie Guo
PublisherSpringer Science and Business Media Deutschland GmbH
Pages3-18
Number of pages16
ISBN (Print)9789819772407
DOIs
StatePublished - 2024
Event8th Asia-Pacific Web and Web-Age Information Management Joint International Conference on Web and Big Data, APWeb-WAIM 2024 - Jinhua, China
Duration: 30 Aug 20241 Sep 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14964 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference8th Asia-Pacific Web and Web-Age Information Management Joint International Conference on Web and Big Data, APWeb-WAIM 2024
Country/TerritoryChina
CityJinhua
Period30/08/241/09/24

Keywords

  • Learned Index
  • Multi-dimension
  • Spatial-aware

Fingerprint

Dive into the research topics of 'SAM: A Spatial-Aware Learned Index for Disk-Based Multi-dimensional Search'. Together they form a unique fingerprint.

Cite this