跳到主要导航 跳到搜索 跳到主要内容

Human Motion Synthesis in 3D Scenes via Unified Scene Semantic Occupancy

  • Jingyu Gong
  • , Kunkun Tong
  • , Zhuoran Chen
  • , Chuanhan Yuan
  • , Mingang Chen
  • , Zhizhong Zhang
  • , Xin Tan*
  • , Yuan Xie
  • *此作品的通讯作者
  • East China Normal University
  • Shanghai Development Center of Computer Software Technology
  • Chongqing University

科研成果: 期刊稿件会议文章同行评审

摘要

Human motion synthesis in 3D scenes relies heavily on scene comprehension, while current methods focus mainly on scene structure but ignore the semantic understanding. In this paper, we propose a human motion synthesis framework that take an unified Scene Semantic Occupancy (SSO) for scene representation, termed SSOMotion. We design a bi-directional triplane decomposition to derive a compact version of the SSO, and scene semantics are mapped to an unified feature space via CLIP encoding and shared linear dimensionality reduction. Such strategy can derive the fine-grained scene semantic structures while significantly reduce redundant computations. We further take these scene hints and movement direction derived from instructions for motion control via frame-wise scene query. Extensive experiments and ablation studies conducted on cluttered scenes using ShapeNet furniture, as well as scanned scenes from PROX and Replica datasets, demonstrate its cutting-edge performance while validating its effectiveness and generalization ability.

源语言英语
页(从-至)4248-4256
页数9
期刊Proceedings of the AAAI Conference on Artificial Intelligence
40
6
DOI
出版状态已出版 - 2026
活动40th AAAI Conference on Artificial Intelligence, AAAI 2026 - Singapore, 新加坡
期限: 20 1月 202627 1月 2026

指纹

探究 'Human Motion Synthesis in 3D Scenes via Unified Scene Semantic Occupancy' 的科研主题。它们共同构成独一无二的指纹。

引用此