Efficient and Scalable Provenance Tracking for LLM-Generated Code Snippets 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

Efficient and Scalable Provenance Tracking for LLM-Generated Code Snippets arXiv:2605.28510v1 Announce Type: cross Abstract: Large language models (LLMs) for code completion and generation are increasingly used in software development, yet they may reproduce training examples verbatim and without authorship attribution, raising legal and ethical concerns around plagiarism and license compliance. Classical fingerprint-based plagiarism detectors based on fingerprinting, such as Winnowing, remain