Spatial-Temporal Decoupled Adapter for Micro-gesture Online Recognition 文章

ArXiv CS.CV2026-06-08NEWSen作者: Xucheng Shen, Kun Li, Fei Wang, Wei Qian, Jin Jiang, Dan Guo

详细信息

来源站点
ArXiv CS.CV
作者
Xucheng Shen, Kun Li, Fei Wang, Wei Qian, Jin Jiang, Dan Guo
文章类型
NEWS
语言
en
发布日期
2026-06-08

摘要

arXiv:2606.07355v1 Announce Type: new Abstract: Micro-gesture online recognition aims to temporally localize and classify subtle gestures in untrimmed videos. Owing to their extremely short duration, low motion amplitude, and ambiguous visual cues, capturing discriminative spatiotemporal representations remains highly challenging. Existing parameter-efficient adapters typically employ a single branch to model spatial and temporal cues jointly, which may fail to capture the fine-grained patterns of micro-gestures. To address this limitation, we propose a Spatial-Temporal Decoupled Adapter that decomposes video adaptation into independent temporal and spatial branches via lightweight depthwise convolutions. In addition, to address the long-tail distribution problem in the benchmark dataset, we introduce Adaptive Soft Balanced Augmentation, which dynamically allocates augmentation intensity based on class rarity and learning difficulty, without manual thresholds.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据