E-valuator: Reliable Agent Verifiers with Sequential Hypothesis Testing 文章

ArXiv CS.AI2026-05-29NEWSen作者: Shuvom Sadhuka, Drew Prinster, Clara Fannjiang, Gabriele Scalia, Bonnie Berger, Aviv Regev, Hanchen Wang

E-valuator: Reliable Agent Verifiers with Sequential Hypothesis Testing · 相关技术