CLIP 产品

来源: githubOPEN_SOURCE开源Jupyter NotebookMIT发布于 2020-12-16

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

33477

Stars

4003

Forks

技术栈

替代方案

CLIP · 相关事件

相关事件

Soft Sequence Policy Optimization

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

GIPO: Gaussian Importance Sampling Policy Optimization

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Brain-CLIPLM: Semantic Compression for EEG-to-Text Decoding

2026-06-05PRODUCT_LAUNCH影响: MEDIUM

BRepCLIP: Contrastive Multimodal Pretraining on BRep Primitives for CAD Understanding

2026-06-05PRODUCT_LAUNCH影响: MEDIUM

Emotion-Aware Image Generation from Korean Diary Text via LLM-based Prompt Translation and LoRA Fine-Tuning

2026-06-05PRODUCT_LAUNCH影响: MEDIUM

Adversarial Attacks Already Tell the Answer: Directional Bias-Guided Test-time Defense for Vision-Language Models

2026-06-05PRODUCT_LAUNCH影响: MEDIUM

Towards Accurate Heart Rate Measurement from Ultra-Short Video Clips via Periodicity-Guided rPPG Estimation and Signal Reconstruction

2026-06-05PRODUCT_LAUNCH影响: MEDIUM

UnHype: CLIP-Guided Hypernetworks for Dynamic LoRA Unlearning

2026-06-05PRODUCT_LAUNCH影响: MEDIUM

Unified Pix Token And Word Token Generative Language Model

2026-06-05ACQUISITION影响: HIGH

Unified Pix Token And Word Token Generative Language Model

2026-06-05PRODUCT_LAUNCH影响: MEDIUM

Unified Pix Token And Word Token Generative Language Model

2026-06-05OPEN_SOURCE影响: MEDIUM

NoRA: Evaluating Grounded Reasonableness in Visual First-person Normative Action Reasoning

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

BioBlue: Systematic runaway-optimiser-like LLM failure modes on biologically and economically aligned AI safety benchmarks for LLMs with simplified observation format

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

BioBlue: Systematic runaway-optimiser-like LLM failure modes on biologically and economically aligned AI safety benchmarks for LLMs with simplified observation format

2026-06-04REGULATION影响: MEDIUM

Revisiting Model Stitching In the Foundation Model Era

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Segment, Embed, and Align: A Universal Recipe for Aligning Subtitles to Signing

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

VT-3DAD: Cross-Category 3D Anomaly Detection via Visual-Text Normal Space Alignment

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Drift-Augmented Scoring: Text-Derived Noise Robustness for Zero-Shot Audio-Language Classification

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

SkelHCC: A Hyperbolic CLIP-Driven Cache Adaptation Framework for Skeleton-based One-Shot Action Recognition

2026-06-03PRODUCT_LAUNCH影响: MEDIUM

VidMsg: A Benchmark for Implicit Message Inference in Short Videos

2026-06-03PRODUCT_LAUNCH影响: MEDIUM

Investigating Adversarial Robustness of Multi-modal Large Language Models

2026-06-03PRODUCT_LAUNCH影响: MEDIUM

Beyond False Stability: High-Noise Drift Gating for Test-Time Adversarial Defenses in Vision-Language Models

2026-06-03PRODUCT_LAUNCH影响: MEDIUM

Beyond Compression: Quantifying Spectral Accessibility in Vision Representations

2026-06-03PRODUCT_LAUNCH影响: MEDIUM

Benchmarking Visual State Tracking in Multimodal Video Understanding

2026-06-03PRODUCT_LAUNCH影响: MEDIUM

Benchmarking Visual State Tracking in Multimodal Video Understanding

2026-06-03BREAKTHROUGH影响: HIGH

SagaQA: A Multi-hop Reasoning Benchmark for Long-form Narrative Understanding in TV Series

2026-06-03PRODUCT_LAUNCH影响: MEDIUM

ResCLIP: Residual Attention for Training-free Dense Vision-language Inference

2026-06-03PRODUCT_LAUNCH影响: MEDIUM

WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation

2026-06-03PRODUCT_LAUNCH影响: MEDIUM

Concept-wise Attention for Fine-grained Concept Bottleneck Models

2026-06-03PRODUCT_LAUNCH影响: MEDIUM

ASymPO: Asymmetric-Scale Policy Optimization for Asynchronous LLM Post-Training Without Behavior Information

2026-06-03PRODUCT_LAUNCH影响: MEDIUM

Local Guidance, Global Impact: Gaussian-Reshaped Trust Region Unlocks Behavior Transitions

2026-06-03PRODUCT_LAUNCH影响: MEDIUM

CourseTimeQA: A Lecture-Video Benchmark and a Latency-Constrained Cross-Modal Fusion Method for Timestamped QA

2026-06-03PRODUCT_LAUNCH影响: MEDIUM

CAST: Non-Privileged Clipped Asymmetric Self-Teaching with Advantage Flipping for GRPO