Skip to main navigation Skip to search Skip to main content

Symbolic Music Generation with Adaptive Representation Alignment in Texture

  • Qing Chen
  • , Hengyu Zhang
  • , Kaiyuan Liu
  • , Tianyi Lu
  • , Qidang Zhou
  • , Daoguo Dong*
  • , Liang He
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Symbolic music is gaining more and more attention for its high semantic precision and editorial flexibility. However, using traditional condition processing methods to generate symbolic music is problematic since these approaches often result in semantic loss by overlooking the intrinsic connections between musical elements. We introduce the SMART, Symbolic Music generation with Adaptive Representation alignment in Texture which incorporates a condition disentanglement method and an adaptive representation alignment framework, capable of fully utilizing multi-scale information and generating music with consistent texture. In SMART, the condition disentanglement method is used to decouple complex condition into more refined musical elements and the representation alignment framework is for semantic constraint during the generation process. Our method demonstrates remarkable advancements in texture consistency and chord controllability compared to standard strong baselines in symbolic music generation.

Original languageEnglish
Title of host publicationMcGE 2025 - Proceedings of the 3rd International Workshop on Multimedia Content Generation and Evaluation
Subtitle of host publicationNew Methods and Practice, Co-Located with MM 2025
PublisherAssociation for Computing Machinery, Inc
Pages12-18
Number of pages7
ISBN (Electronic)9798400720604
DOIs
StatePublished - 26 Oct 2025
Event3rd International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice, McGE 2025 - Dublin, Ireland
Duration: 31 Oct 202531 Oct 2025

Publication series

NameMcGE 2025 - Proceedings of the 3rd International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice, Co-Located with MM 2025

Conference

Conference3rd International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice, McGE 2025
Country/TerritoryIreland
CityDublin
Period31/10/2531/10/25

Keywords

  • diffusion model
  • representation alignment
  • symbolic music generation

Fingerprint

Dive into the research topics of 'Symbolic Music Generation with Adaptive Representation Alignment in Texture'. Together they form a unique fingerprint.

Cite this