rio · 相关事件
相关事件
A case study of evaluating AI agents on a neuroscience data-to-discovery pipeline
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
Beyond Goodhart's Law: A Dynamic Benchmark for Evaluating Compliance in Multi-Agent Systems
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
Beyond Goodhart's Law: A Dynamic Benchmark for Evaluating Compliance in Multi-Agent Systems
2026-06-09REGULATION影响: MEDIUM
Scaling Participation in Modular AI Systems
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
Safety is Contextual, LLM-Judges Are Not: Navigating the Rigid Priors of Evaluators
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
EditSR: Enhancing Neural Symbolic Regression via Edit-based Rectification
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
Unification of Closed-Open Industrial Detection Scenarios: New Large-Scale Benchmarks,Challenges and Baselines
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
Zero-Shot Learning in Industrial Scenarios: New Large-Scale Benchmark, Challenges and Baseline
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
Online Agent-as-a-Judge: Situation-Generating Evaluation for Interactive Agents
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
To Nuke or Not to Nuke: LLMs' (Missing) Ethical Reasoning and Actions in a High-Stakes Decision-Making Simulation
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
VESTA: A Fully Automated Scenario Generation and Safety Evaluation Framework for LLM Agents
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
DN-Hypo-Pipeline: An AI-Driven Workflow for Hypothesis Generation via Large Language Models and Scientific Explanations
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
Distilling LLM Reasoning into an Interpretable Policy Tree for Human-AI Collaboration
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
InA-Probe: Instruction-Aware Active Probing for Time Series Forecasting with LLMs
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
AlloSpatial: Agentic Harness Framework for Spatial Reasoning in Foundation Models
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
A Multi-Agent System for IPMSM Design Optimization via an FEA-AI Hybrid Approach
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
ComplexConstraints and Beyond: Expert Rubrics for RLVR
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
Vision Language Model Helps Private Information De-Identification in Vision Data
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
IMUG-Bench: Benchmarking Unified Multimodal Models on Interleaved Understanding and Generation
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
Anything2Skill: Compiling External Knowledge into Reusable Skills for Agents
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
Anything2Skill: Compiling External Knowledge into Reusable Skills for Agents
2026-06-09SHUTDOWN影响: LOW
Experience Makes Skillful: Enabling Generalizable Medical Agent Reasoning via Self-Evolving Skill Memory
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
From Coarse to Fine: Managing Temporal Granularity in Spatio-Temporal Data for Fine-Grained Traffic Prediction
2026-06-09ACQUISITION影响: HIGH
Capability-Aligned Hierarchical Learning for Tool-Augmented LLMs
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
From Coarse to Fine: Managing Temporal Granularity in Spatio-Temporal Data for Fine-Grained Traffic Prediction
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
Capacity, Not Format: Rethinking Structured Reasoning Failures
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
Bayesian Selective Latent Inference for Wastewater-First Influenza Monitoring
2026-06-09ACQUISITION影响: HIGH
Bayesian Selective Latent Inference for Wastewater-First Influenza Monitoring
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
SIFT: Selective-Index For Fast Compute of RAG Prefill by Exploiting Attention Invariance
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
Concerns and Strategic Responses of Older Workers Navigating Generative AI in Bridge Employment
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
The Montparnasse Algorithm for RNA Design
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
The Montparnasse Algorithm for RNA Design
2026-06-09BREAKTHROUGH影响: HIGH
SurfDesign: Effective Protein Design on Molecular Surfaces
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
Customer Churn Prediction on Structured Data Using FT-Transformer and Stacking Ensembles
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
SRT: Super-Resolution for Time Series via Disentangled Rectified Flow
2026-06-09ACQUISITION影响: HIGH
SRT: Super-Resolution for Time Series via Disentangled Rectified Flow
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
Can You Trust What You See? Human and AI Detection of Synthetic Legal Evidence
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
Eyes All Around: Design and Analysis of 360-Degree LiDAR Perception Using Equivariant Feature Learning in Unstructured Traffic
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
Active Learning with Foundation Model Priors: Efficient Learning under Class Imbalance
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
AgentCompile: An LLM-Guided Compiler for Direct CUDA Inference
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
MemoVAD: Resource-Efficient Video Anomaly Detection via Dynamic Semantic Memory in Edge Computing Scenarios
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
HARP: Efficient Data Selection for Finetuning Large Language Models
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
Pharmacogenomic Knowledge Graph Augmentation for Graph Neural Network-Based Drug-Drug Interaction Prediction
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
MLingualFC: Evaluating Jailbreak Vulnerabilities in Multilingual Vision-Language Models
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
MLingualFC: Evaluating Jailbreak Vulnerabilities in Multilingual Vision-Language Models
2026-06-09BREAKTHROUGH影响: HIGH
Cross-View Urban Traffic Dataset: Drone-Supervised Ground Truth for Monocular Bird's-Eye View Localization
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
SHIELD-IDS: Structurally Heterogeneous Ensemble with Integrated Layered Defense for Intrusion Detection Systems
2026-06-09PRODUCT_LAUNCH影响: MEDIUM