Training Stratigraphy: Persistent Behavioral Artifacts in Large Language Models Observed Through Longitudinal AI-Human Interaction 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

Training Stratigraphy: Persistent Behavioral Artifacts in Large Language Models Observed Through Longitudinal AI-Human Interaction arXiv:2605.28102v1 Announce Type: new Abstract: Large language models trained with Reinforcement Learning from Human Feedback (RLHF) and Constitutional AI exhibit persistent behavioral patterns that survive system prompt replacement -- patterns we term training strata. This paper identifies five such strata through longitudinal auto-ethnographic observation within a

Training Stratigraphy: Persistent Behavioral Artifacts in Large Language Models Observed Through Longitudinal AI-Human Interaction · 相关公司

I
ISONONPROFIT
A
ACTIONNONPROFIT
P
PHINONPROFIT
I
InterActionNONPROFIT
E
EARNNONPROFIT
A
AnisNONPROFIT
A
ACTNONPROFIT
I
ITUNONPROFIT