clips 公司
1
产品数
0
专利数
clips · 相关事件
相关事件
NoRA: Evaluating Grounded Reasonableness in Visual First-person Normative Action Reasoning
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
VidMsg: A Benchmark for Implicit Message Inference in Short Videos
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
Benchmarking Visual State Tracking in Multimodal Video Understanding
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
Benchmarking Visual State Tracking in Multimodal Video Understanding
2026-06-03BREAKTHROUGH影响: HIGH
SagaQA: A Multi-hop Reasoning Benchmark for Long-form Narrative Understanding in TV Series
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
JenBridge: Adaptive Long-Form Video Soundtracking across Scene Transitions
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Jailbreaking Multimodal Large Language Models using Multi-Clip Video
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
VocSim: A Training-free Benchmark for Zero-shot Content Identity in Single-source Audio
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
SuperMemory-VQA: An Egocentric Visual Question-Answering Benchmark for Long-Horizon Memory
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Temporal Evidence Routing with Structured Visual Evidence for TimeLogicQA
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Explainable Forensics of Manipulated Segments in Untrimmed Long Videos
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
nuReasoning: A Reasoning-Centric Dataset and Benchmark for Long-Tail Autonomous Driving
2026-06-01PRODUCT_LAUNCH影响: MEDIUM
Recognizing Co-Speech Gestures in-the-Wild
2026-06-01PRODUCT_LAUNCH影响: MEDIUM
LoCoT2V-Bench: Benchmarking Long-Form and Complex Text-to-Video Generation
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
VideoFDB: Evaluating Full-Duplex Vision-Speech Capabilities in Conversational Agents
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
CapTalk: Text-Guided Stylization and Speech-Driven 3D Head Animation
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
Fewer Steps, Better Performance: Efficient Cross-Modal Clip Trimming for Video Moment Retrieval Using Language
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
Clark Hash: Stateless Sparse Johnson-Lindenstrauss Quantization for Neural Embeddings
2026-05-28PRODUCT_LAUNCH影响: MEDIUM
CLANE: Continual Learning of Actions on Neuromorphic Hardware from Event Cameras
2026-05-28PRODUCT_LAUNCH影响: MEDIUM
SONIC-O1: A Real-World Benchmark for Evaluating Multimodal Large Language Models on Audio-Video Understanding
2026-05-28PRODUCT_LAUNCH影响: MEDIUM
SONIC-O1: A Real-World Benchmark for Evaluating Multimodal Large Language Models on Audio-Video Understanding
2026-05-28OPEN_SOURCE影响: MEDIUM
Pop-Up Distractions Reveal Bag-of-Events Behavior in Video Large Language Models
2026-05-27PRODUCT_LAUNCH影响: MEDIUM
LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV
2026-05-27PRODUCT_LAUNCH影响: MEDIUM
FoodMonitor: Benchmarking MLLMs for Explainable Compliance Analysis
2026-05-26PRODUCT_LAUNCH影响: MEDIUM
FoodMonitor: Benchmarking MLLMs for Explainable Compliance Analysis
2026-05-26REGULATION影响: MEDIUM
Do Image-Text Metrics Respect Semantic Invariances?
2026-05-26PRODUCT_LAUNCH影响: MEDIUM
LRDDv3: High-Resolution Long-Range Drone Detection Dataset with Range Information and Thermal Data
2026-05-26PRODUCT_LAUNCH影响: MEDIUM
TIE: Time Interval Encoding for Video Generation over Events
2026-05-26PRODUCT_LAUNCH影响: MEDIUM
Do Understanding and Generation Fight? A Diagnostic Study of DPO for Unified Multimodal Models
2026-05-26PRODUCT_LAUNCH影响: MEDIUM