GaMi: Geometry-Agnostic Material Identification via Cross-Modal Subtractive Disentanglement 文章

ArXiv CS.AI2026-06-01NEWSen作者: Zhiwei Chen (UESTC, Chengdu, China), Yijie Li (National University of Singapore, Singapore), Yimo Zhang (UESTC, Chengdu, China), Shiyun Shao (UESTC, Chengdu, China), Yichao Chen (Shanghai Jiao Tong University, Shanghai, China), Dian Ding (Shanghai Jiao Tong University, Shanghai, China), Liang Wang (Northwestern Polytechnical University, Xi'an, China), Haiwei Wu (UESTC, Chengdu, China), Liwei Guo (UESTC, Chengdu, China), Jie Yang (UESTC, Chengdu, China), Xiaosong Zhang (UESTC, Chengdu, China), Yongzhao Zhang (UESTC, Chengdu, China)

摘要

arXiv:2605.30818v1 Announce Type: cross Abstract: Non-contact material identification enables adaptive interaction for embodied intelligence yet faces challenges from geometry-induced variations (e.g., orientation, shape, distance) and single-modality ambiguities. In this paper, we present GaMi, a multimodal material identification system integrating mmWave and acoustic sensing to robustly operate under unconstrained geometric conditions. By leveraging the insight of shared geometric consistency between co-located bimodal sensors, GaMi employs an intra-sample cross-modal subtractive disentanglement framework. By semantically aligning modalities and subtracting the shared geometric context, it isolates intrinsic material features. Furthermore, GaMi incorporates inter-sample contrastive learning to correct the residual interference caused by cross-modal misalignment. Additionally, a pairing-based adaptation strategy between two modalities enables few-shot generalization across devices.