modAL · 相关事件
相关事件
Evaluating Stochastic Collapse and Implicit Bias in Multimodal Large Language Models
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
MCBench: A Multicontext Safety Assessment Benchmark for Omni Large Language Models
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
MCBench: A Multicontext Safety Assessment Benchmark for Omni Large Language Models
2026-06-05BREAKTHROUGH影响: HIGH
InfoShield: Privacy-Preserving Speech Representations for Mental Health Screening via Information-Theoretic Optimization
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
PlanBench-V: A Spatial Planning Map Benchmark for Vision-Language Models
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
An ERP Study on Recursive Locative Processing in Mandarin-Speaking Children with Autism
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
PlanBench-V: A Spatial Planning Map Benchmark for Vision-Language Models
2026-06-05REGULATION影响: MEDIUM
MARDoc: A Memory-Aware Refinement Agent Framework for Multimodal Long Document QA
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Mechanistic Insights into Functional Sparsity in Multimodal LLMs via CoRe Heads
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
To Be Multimodal or Not to Be: Query-Adaptive Audio-Visual Person Retrieval via Active Modality Detection
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Automatic Labelling of Speech Translation Errors
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Revising Context, Shifting Simulated Stance: Auditing LLM-Based Stance Simulation in Online Discussions
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Harnessing Generalist Agents for Contextualized Time Series
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Almieyar-Oryx-BloomBench: A Bilingual Multimodal Benchmark for Cognitively Informed Evaluation of Vision-Language Models
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
LongSpace: Exploring Long-Horizon Spatial Memory from Perception to Recall in Video
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
UNIVID: Unified Vision-Language Model for Video Moderation
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
UNIVID: Unified Vision-Language Model for Video Moderation
2026-06-05OPEN_SOURCE影响: MEDIUM
MemoryCard: Topic-Aware Multi-Modal Clue Compression for Long-Video Question Answering
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
The Mirage of Performance Gains: Why Contrastive Decoding Fails to Mitigate Object Hallucinations in MLLMs?
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Do MLLMs Capture How Interfaces Guide User Behavior? A Benchmark for Multimodal UI/UX Design Understanding
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
DocHop-QA: Towards Multi-Hop Reasoning over Multimodal Document Collections
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Seeing is Believing? Evaluating Vision-Language Model Susceptibility in Agent-to-Agent Multimodal Persuasion
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
ChartAttack: Testing the Vulnerability of LLMs to Malicious Prompting in Chart Generation
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Biomazon: A Multimodal Dataset for 3D Forest Structure and Biomass Modeling in the Amazon Basin
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Deep Learning-assisted AMD Staging based on OCT and OCT Angiography
2026-06-05ACQUISITION影响: HIGH
Deep Learning-assisted AMD Staging based on OCT and OCT Angiography
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Disentangled Fine-Grained Prototype Learning for Incomplete Image-Tabular Classification
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
LLM-Guided ANN Index Optimization for Human-Object Interaction Retrieval
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Unpaired RGB-Thermal Gaussian-Splatting Using Visual Geometric Transformers
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
BRepCLIP: Contrastive Multimodal Pretraining on BRep Primitives for CAD Understanding
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
UltraVR: A Diagnostic Ultra-Resolution Image-VQA Benchmark for Evidence-Grounded Reasoning
2026-06-05ACQUISITION影响: HIGH
UltraVR: A Diagnostic Ultra-Resolution Image-VQA Benchmark for Evidence-Grounded Reasoning
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
ViCuR: Visual Cues as Recoverable Privilege for Multimodal On-Policy Distillation
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
VTI-CoT: Visual-Textual Interleaved Chain of Thought for Video Reasoning
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
ExpSpeech-Net: Multimodal Fusion of Expression and Speech for Deepfake Detection
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Learning Geometric Representations from Videos for Spatial Intelligent Multimodal Large Language Models
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Faithful, Enriched, and Precise: Benchmarking Natural-Science Illustration Generation by T2I models
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Multimodal Sexism Identification and Characterization using Large Language Models and Gradient Boosting
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
LLM-Conditioned Synthesis of Pathological Gaits via Structured Gait-Language Representations
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
HyperVis: Continuous Latent Visual Relational Graphs on the Lorentz Hyperboloid for Compositional Reasoning
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
DisasterBench: A Multimodal Benchmark for UAV-Based Disaster Response in Complex Environments
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
GRAMformer: Any-Order Modality Interactions via Volumetric Multimodal Cross-Attention
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
A Vision-language Framework for Comparative Reasoning in Radiology
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
PAR3D: A Unified 3D-MLLM with Part-Aware Representation for Scene Understanding
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Drishti AI-Event Guardian: An Intelligent Real-Time Crowd Monitoring and Emergency Response System for Mass Gathering Events
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Flash-WAM: Modality-Aware Distillation for World Action Models
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Seeing Time: Benchmarking Chronological Reasoning and Shortcut Biases in Vision-Language Models
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Learning Visual Spatial Planning from Symbolic State via Modality-Gap-Aware Self-Distillation
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Learning Predictive Visuomotor Coordination
2026-06-05PRODUCT_LAUNCH影响: MEDIUM