REED: Post-Training Representation Editing for Cross-Domain Linguistic Steganalysis 文章

ArXiv CS.AI2026-05-28NEWSen作者: Ruohan Lei, Jianxin Gao, Wanli Peng, Huimin Pei

摘要

arXiv:2605.28298v1 Announce Type: new Abstract: In real-world scenarios of linguistic steganalysis, tested texts usually come from unseen domains with different vocabularies, topics, writing styles, and steganographic generation patterns, which can significantly degrade the detection performance. Although existing cross-domain steganalysis methods can effectively alleviate this problem through distribution alignment, domain-invariant feature learning, etc., the detection performance is not satisfactory. In this paper, we propose a post-training representation editing method for cross-domain linguistic steganalysis. Specifically, the detector is first trained on source-domain data, and then the feature extractor and classifier are kept frozen, and the intermediate representations are deterministically edited before classification. For domain adaptation, we construct a domain-offset vector from marginal source and target representations.