Spatial-Temporal Decoupled Adapter for Micro-gesture Online Recognition 文章

ArXiv CS.CV2026-06-08NEWSen作者: Xucheng Shen, Kun Li, Fei Wang, Wei Qian, Jin Jiang, Dan Guo

详细信息

来源站点: ArXiv CS.CV
作者: Xucheng Shen, Kun Li, Fei Wang, Wei Qian, Jin Jiang, Dan Guo
文章类型: NEWS
语言: en
发布日期: 2026-06-08

摘要

arXiv:2606.07355v1 Announce Type: new Abstract: Micro-gesture online recognition aims to temporally localize and classify subtle gestures in untrimmed videos. Owing to their extremely short duration, low motion amplitude, and ambiguous visual cues, capturing discriminative spatiotemporal representations remains highly challenging. Existing parameter-efficient adapters typically employ a single branch to model spatial and temporal cues jointly, which may fail to capture the fine-grained patterns of micro-gestures. To address this limitation, we propose a Spatial-Temporal Decoupled Adapter that decomposes video adaptation into independent temporal and spatial branches via lightweight depthwise convolutions. In addition, to address the long-tail distribution problem in the benchmark dataset, we introduce Adaptive Soft Balanced Augmentation, which dynamically allocates augmentation intensity based on class rarity and learning difficulty, without manual thresholds.

Spatial-Temporal Decoupled Adapter for Micro-gesture Online Recognition 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品

相关技术查看全部 (2)