Think Fast, Talk Smart: Partitioning Deterministic and Neural Computation for Structured Health Text Generation 文章

ArXiv CS.AI2026-05-29NEWSen作者: Kai-Chen Cheng, Haejun Han, David Q. Sun

摘要

arXiv:2605.29652v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly being used to generate health text from structured records such as wearable time series, biomarkers, vitals, and care-management logs. For recurring health outputs, fluency is not enough: systems must remain faithful to source data, ground explanatory claims in available evidence, follow stated policies, emit machine-readable outputs, and run cheaply enough for repeated use. We ask which responsibilities in structured health generation should be deterministic computation rather than runtime LLM prompting. We introduce Think Fast, Talk Smart, a sleep-health insight pipeline in which deterministic code performs recurring analysis before one bounded LLM writer call. Across 280 user-nights and six models, achieves lower numeric error, lower instruction-compliance error, and lower end-to-end cost than structured zero-shot and few-shot one-call baselines.