VocSim: A Training-free Benchmark for Zero-shot Content Identity in Single-source Audio 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

VocSim: A Training-free Benchmark for Zero-shot Content Identity in Single-source Audio arXiv:2512.10120v2 Announce Type: replace-cross Abstract: General-purpose audio representations aim to map acoustically variable instances of the same event to nearby points, resolving content identity in a zero-shot setting. Unlike supervised classification benchmarks that measure adaptability via parameter updates, we introduce VocSim, a training-free benchmark probing the intrinsic geometric alignment of

VocSim: A Training-free Benchmark for Zero-shot Content Identity in Single-source Audio · 相关人物