Images as Tables: In-Context Learning with TabPFN for Low-Data Detection of AI-Generated Images 文章

ArXiv CS.CV2026-06-02NEWSen作者: Jan Philip Walter, Shashank Agnihotri, Margret Keuper

摘要

arXiv:2606.00872v1 Announce Type: new Abstract: AI-generated image detection is a moving-target problem: detectors trained on one generator often fail when a new generator appears, and only a few labeled examples are available. We study a simple image-to-table formulation for this regime, where each image is encoded by a frozen DINOv3 backbone, its CLS feature is reduced to a 500-dimensional structured row with PCA, and TabPFN performs real/fake classification by in-context tabular inference rather than task-specific classifier training. This turns fake-image detection into low-data structured prediction over learned visual features, making detector adaptation depend on the labeled context set instead of gradient-based fine-tuning. On GenImage, LATTE, a recent state-of-the-art detector, remains stronger when many labeled samples from all generators are available, by 7.