Capturing LLM Capabilities via Evidence-Calibrated Query Clustering 文章

ArXiv CS.AI2026-06-02NEWSen作者: Fangzhou Wu, Sandeep Silwal, Qiuyi Zhang

摘要

arXiv:2605.17110v2 Announce Type: replace Abstract: Query clustering organizes queries into groups that reflect shared latent capability demands, enabling capability-aware LLM evaluation. Existing clustering methods, which primarily rely on semantic taxonomies or embeddings, often fail to capture such latent capability requirements due to a misalignment between surface-level semantics and actual model performance. We propose ECC, an algorithm that calibrates prior semantic embeddings using limited posterior model comparisons to bridge the gap between surface-level semantics and latent capability requirements. ECC characterizes each cluster through a capability profile parameterized by a Bradley-Terry model and uses trainable mixture weights to accommodate queries with mixed capability demands, jointly learning a flexible, capability-aware clustering structure that supports query-specific inference of LLM capabilities.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据