FuseFormer: A Manifold Metric Fusing Attention for Pedestrian Trajectory Prediction

Yi Zou, Kohsin Ko, Jian Yang, Yingjie Liu, Ke Li, Xiong You, Jinpeng Mi, Xuan Tang, Mingsong Chen, Xian Wei

Research output: Contribution to journalArticlepeer-review

Abstract

Accurate pedestrian trajectory prediction is critical for ensuring the safety of autonomous vehicles and advancing higher levels of driving automation. However, the complex interpersonal interactions and highly dynamic trajectory patterns in real-world scenarios pose significant challenges to achieving precise predictions. Recently, Transformers have shown remarkable success in pedestrian trajectory prediction, primarily due to their effective modeling of temporal and spatial dependencies via Multi-Head Self-Attention (MHA) mechanisms. Despite these advancements, existing self-attention methods often rely on Euclidean distance-based metrics and dot-product operations, which are inadequate for capturing interaction-induced trajectory curvatures. To address this limitation, we propose a novel hybrid Transformer architecture, FuseFormer, that incorporates Geodesic Self-Attention (GSA) mechanisms. GSA utilizes geodesic distances to characterize interaction features effectively, complementing MHA, which excels in capturing local features and maintaining temporal correlations. FuseFormer employs a gating network to adaptively combine GSA and MHA embeddings, leveraging their complementary strengths. Additionally, FuseFormer integrates a Transformer-based Neural Ordinary Differential Equation (ODE) decoder to model trajectory temporal dynamics. This design enables the generation of future trajectories that align closely with motion trends while adapting the network depth to input sequence lengths. Experimental results demonstrate that FuseFormer achieves state-of-the-art performance across widely used pedestrian trajectory prediction datasets, including ETH/UCY, SDD, and NBA. These results underscore the model's effectiveness and generalization capability in capturing complex interaction patterns and handling diverse scenarios.

Original languageEnglish
Pages (from-to)12372-12386
Number of pages15
JournalIEEE Transactions on Intelligent Transportation Systems
Volume26
Issue number8
DOIs
StatePublished - 2025

Keywords

  • Pedestrian trajectory prediction
  • geodesic distance
  • manifold
  • neural ODE
  • non-Euclidean

Fingerprint

Dive into the research topics of 'FuseFormer: A Manifold Metric Fusing Attention for Pedestrian Trajectory Prediction'. Together they form a unique fingerprint.

Cite this