Beyond Query Memorization: Large Language Model Routing with Query Decomposition and Historical Matching 事件
PRODUCT_LAUNCH2026-05-26影响: MEDIUM
Beyond Query Memorization: Large Language Model Routing with Query Decomposition and Historical Matching arXiv:2605.25558v1 Announce Type: new Abstract: Optimizing the trade-off among predictive performance and computational cost is a central focus in the deployment of Large Language Models (LLMs). Current routing methods primarily rely on direct mapping from queries to models based on surface-level features, making them susceptible to the memorization trap and leading to poor generalizability