FIGMA: Towards FIne-Grained Music retrievAl 事件

PRODUCT_LAUNCH2026-06-08影响: MEDIUM

FIGMA: Towards FIne-Grained Music retrievAl arXiv:2606.06615v1 Announce Type: cross Abstract: Retrieving music using natural language descriptions has improved with contrastive audio-text models such as CLAP, but current systems remain limited to coarse semantic queries. When descriptions specify fine-grained musical attributes such as tempo, key, chord progression, or rhythmic structure, existing models often fail to retrieve the correct audio. We show that this limitation stems from the contr