Performance and Complexity Trade-off Optimization of Speech Models During Training 文章

ArXiv CS.AI2026-06-01NEWSen作者: Esteban G\'omez, Tom Backstr\"om

详细信息

来源站点: ArXiv CS.AI
作者: Esteban G\'omez, Tom Backstr\"om
文章类型: NEWS
语言: en
发布日期: 2026-06-01

摘要

arXiv:2601.13704v3 Announce Type: replace-cross Abstract: In speech machine learning, neural network models are typically designed by choosing an architecture with fixed layer sizes and structure. These models are then trained to maximize performance on metrics aligned with the task's objective. While the overall architecture is usually guided by prior knowledge of the task, the sizes of individual layers are often chosen heuristically. However, this approach does not guarantee an optimal trade-off between performance and computational complexity; consequently, post hoc methods such as weight quantization or model pruning are typically employed to reduce computational cost. This occurs because stochastic gradient descent (SGD) methods can only optimize differentiable functions, while factors influencing computational complexity, such as layer sizes and floating-point operations per second (FLOP/s), are non-differentiable and require modifying the model structure during training.

Performance and Complexity Trade-off Optimization of Speech Models During Training 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品

相关技术查看全部 (1)