CLIP · 相关事件
相关事件
Soft Sequence Policy Optimization
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
GIPO: Gaussian Importance Sampling Policy Optimization
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Brain-CLIPLM: Semantic Compression for EEG-to-Text Decoding
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
BRepCLIP: Contrastive Multimodal Pretraining on BRep Primitives for CAD Understanding
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Emotion-Aware Image Generation from Korean Diary Text via LLM-based Prompt Translation and LoRA Fine-Tuning
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Adversarial Attacks Already Tell the Answer: Directional Bias-Guided Test-time Defense for Vision-Language Models
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Towards Accurate Heart Rate Measurement from Ultra-Short Video Clips via Periodicity-Guided rPPG Estimation and Signal Reconstruction
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
UnHype: CLIP-Guided Hypernetworks for Dynamic LoRA Unlearning
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Unified Pix Token And Word Token Generative Language Model
2026-06-05ACQUISITION影响: HIGH
Unified Pix Token And Word Token Generative Language Model
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Unified Pix Token And Word Token Generative Language Model
2026-06-05OPEN_SOURCE影响: MEDIUM
NoRA: Evaluating Grounded Reasonableness in Visual First-person Normative Action Reasoning
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Revisiting Model Stitching In the Foundation Model Era
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Segment, Embed, and Align: A Universal Recipe for Aligning Subtitles to Signing
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
VT-3DAD: Cross-Category 3D Anomaly Detection via Visual-Text Normal Space Alignment
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Drift-Augmented Scoring: Text-Derived Noise Robustness for Zero-Shot Audio-Language Classification
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
SkelHCC: A Hyperbolic CLIP-Driven Cache Adaptation Framework for Skeleton-based One-Shot Action Recognition
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
VidMsg: A Benchmark for Implicit Message Inference in Short Videos
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
Investigating Adversarial Robustness of Multi-modal Large Language Models
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
Beyond False Stability: High-Noise Drift Gating for Test-Time Adversarial Defenses in Vision-Language Models
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
Beyond Compression: Quantifying Spectral Accessibility in Vision Representations
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
Benchmarking Visual State Tracking in Multimodal Video Understanding
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
Benchmarking Visual State Tracking in Multimodal Video Understanding
2026-06-03BREAKTHROUGH影响: HIGH
SagaQA: A Multi-hop Reasoning Benchmark for Long-form Narrative Understanding in TV Series
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
ResCLIP: Residual Attention for Training-free Dense Vision-language Inference
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
Concept-wise Attention for Fine-grained Concept Bottleneck Models
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
ASymPO: Asymmetric-Scale Policy Optimization for Asynchronous LLM Post-Training Without Behavior Information
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
Local Guidance, Global Impact: Gaussian-Reshaped Trust Region Unlocks Behavior Transitions
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
CourseTimeQA: A Lecture-Video Benchmark and a Latency-Constrained Cross-Modal Fusion Method for Timestamped QA
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
CAST: Non-Privileged Clipped Asymmetric Self-Teaching with Advantage Flipping for GRPO
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Multi-Contrast MRI Motion Correction via Parameter-Informed Disentanglement and Adaptive Experts
2026-06-02ACQUISITION影响: HIGH
Multi-Contrast MRI Motion Correction via Parameter-Informed Disentanglement and Adaptive Experts
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Detect Before You Leap: Mirage Detection in Vision-Language Models
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
JenBridge: Adaptive Long-Form Video Soundtracking across Scene Transitions
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Jailbreaking Multimodal Large Language Models using Multi-Clip Video
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Repurposing Adversarial Perturbations for Continual Learning: From Defense to Active Alignment
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
v-HUB: A Benchmark for Video Humor Understanding from Vision and Sound
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
v-HUB: A Benchmark for Video Humor Understanding from Vision and Sound
2026-06-02OPEN_SOURCE影响: MEDIUM
VocSim: A Training-free Benchmark for Zero-shot Content Identity in Single-source Audio
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Calibrating Uncertainty for Zero-Shot Adversarial CLIP
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Multi-Modal Learning meets Genetic Programming: Analyzing Alignment in Latent Space Optimization
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Revisiting Reinforcement Learning with Verifiable Rewards from a Contrastive Perspective
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
CLIP Tricks You: Training-free Token Pruning for Efficient Pixel Grounding in Large VIsion-Language Models
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Stabilizing Policy Optimization via Logits Convexity
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Physics from Video: Identifiability of Time-Invariant Second-Order ODEs under Minimal Trajectory Conditions
2026-06-02PRODUCT_LAUNCH影响: MEDIUM