Abstract
For the past decades, the subgraph similarity search over a largescale data graph has become increasingly important and crucial in many real-world applications, such as social network analysis, bioinformatics network analytics, knowledge graph discovery, and many others. While previous works on subgraph similarity search used various graph similarity metrics such as the graph isomorphism, graph edit distance, and so on, in this paper, we propose a novel problem, namely subgraph similarity search under aggregated neighbor difference semantics (3AND), which identifies subgraphs g in a data graph G that are similar to a given query graph q by considering both keywords and graph structures (under new keyword/structural matching semantics). To efficiently tackle the 3AND problem, we design two effective pruning methods, keyword set and aggregated neighbor difference lower bound pruning, which rule out false alarms of candidate vertices/subgraphs to reduce the 3AND search space. Furthermore, we construct an effective indexing mechanism to facilitate our proposed efficient 3AND query answering algorithm. Through extensive experiments, we demonstrate the effectiveness and efficiency of our S3AND approach over both real and synthetic graphs under various parameter settings.
| Original language | English |
|---|---|
| Pages (from-to) | 3708-3720 |
| Number of pages | 13 |
| Journal | Proceedings of the VLDB Endowment |
| Volume | 18 |
| Issue number | 11 |
| DOIs | |
| State | Published - 2025 |
| Event | 51st International Conference on Very Large Data Bases, VLDB 2025 - London, United Kingdom Duration: 1 Sep 2025 → 5 Sep 2025 |