AICompanionBench: Benchmarking LLMs-as-Judges for AI Companion Safety 事件

Name: AICompanionBench: Benchmarking LLMs-as-Judges for AI Companion Safety
Start: 2026-06-04

BREAKTHROUGH2026-06-04影响: HIGH

AICompanionBench: Benchmarking LLMs-as-Judges for AI Companion Safety arXiv:2606.04867v1 Announce Type: new Abstract: As AI companion platforms such as Replika and Character.AI rapidly grow, concerns about unsafe human-AI interactions have intensified. This study introduces AICompanionBench, to our knowledge the first publicly available benchmark dataset of human-AI companion conversations annotated with fine-grained safety risk categories. The dataset contains 2,123 real-world Replika conversa

人工智能

关系图谱

AICompanionBench: Benchmarking LLMs-as-Judges for AI Companion Safety · 相关公司

arXivNONPROFIT

HuMANONPROFIT

ACTIONNONPROFIT

InterActionNONPROFIT

FrameworkCOMPANY

ACTNONPROFIT

CharacterNONPROFIT

RatioRESEARCH_INSTITUTE

UBS