Deep Psychovisual Image Representations 文章

ArXiv CS.CV2026-05-29NEWSen作者: Wendi Ma, Aryaman Sharma, Wei Dai, Shekhar S. Chandra

详细信息

来源站点: ArXiv CS.CV
作者: Wendi Ma, Aryaman Sharma, Wei Dai, Shekhar S. Chandra
文章类型: NEWS
语言: en
发布日期: 2026-05-29

摘要

arXiv:2605.29260v1 Announce Type: new Abstract: Psychovisual models suggest human vision decouples low-level feature extraction from higher cognition by first forming intermediate abstractions. In contrast, deep learning-based vision models routinely extract and aggregate features using homogeneous stacks of spatial layers, rendering their decision-making processes opaque. In this paper, we propose Deep Visual Coding, a learned frequency-domain representation inspired by 1990s image codes that quantised perceptually salient frequencies, which together with complex-valued image representations produces psychovisual-style abstractions. This approach enables the first psychovisual-based deep learning framework, utilizing data-driven spectral filters that learn to encode task-relevant semantic structures within distinct frequency sub-bands.

Deep Psychovisual Image Representations 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品

相关技术查看全部 (1)