Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation 论文

2024IEEE Transactions on Pattern Analysis and Machine Intelligence引用 217

Machine Learning and Data ClassificationAdvanced Neural Network ApplicationsAdvanced Image and Video Retrieval Techniques

人工智能 Advanced Neural Network Applications Advanced Image and Video Retrieval Techniques Machine Learning and Data Classification

关系图谱

作者

摘要

We introduce Hyper-YOLO, a new object detection method that integrates hypergraph computations to capture the complex high-order correlations among visual features. Traditional YOLO models, while powerful, have limitations in their neck designs that restrict the integration of cross-level features and the exploitation of high-order feature interrelationships. To address these challenges, we propose the Hypergraph Computation Empowered Semantic Collecting and Scattering (HGC-SCS) framework, which transposes visual feature maps into a semantic space and constructs a hypergraph for high-order message propagation. This enables the model to acquire both semantic and structural information, advancing beyond conventional feature-focused learning. Hyper-YOLO incorporates the proposed Mixed Aggregation Network (MANet) in its backbone for enhanced feature extraction and introduces the Hypergraph-Based Cross-Level and Cross-Position Representation Network (HyperC2Net) in its neck. HyperC2Net operates across five scales and breaks free from traditional grid structures, allowing for sophisticated high-order interactions across levels and positions. This synergy of components positions Hyper-YOLO as a state-of-the-art architecture in various scale models, as evidenced by its superior performance on the COCO dataset. Specifically, Hyper-YOLO-N significantly outperforms the advanced YOLOv8-N and YOLOv9-T with 12% and 9% improvements.

作者查看全部 (9)

Yifan Feng

Yue Gao

Rongrong Ji

Guiguang Ding

Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation 论文

摘要

作者查看全部 (9)

相关技术查看全部 (3)

相关事件

相关文章