axon · 相关事件
相关事件
Model Context Protocols in Adaptive Transport Systems: A Survey
2026-06-08PRODUCT_LAUNCH影响: MEDIUM
SWE-IF: Aligning Code Evaluation with Human Preference
2026-06-08PRODUCT_LAUNCH影响: MEDIUM
Automated Root-Cause Subclassification and No-Code Fix Generation for Invalid Bug Reports
2026-06-08PRODUCT_LAUNCH影响: MEDIUM
Korean Culture into LLM Alignment: Toward Cultural Coherence
2026-06-08PRODUCT_LAUNCH影响: MEDIUM
MADE: Beyond Scoring via a Multilingual Agentic Diagnosing Engine for Fine-Grained Evaluation Insights
2026-06-08PRODUCT_LAUNCH影响: MEDIUM
Reference-Free Evaluation of Taxonomies
2026-06-08PRODUCT_LAUNCH影响: MEDIUM
A Dynamic Self-Evolving Extraction System
2026-06-08PRODUCT_LAUNCH影响: MEDIUM
WorldBench: A Challenging and Visually Diverse Multimodal Reasoning Benchmark
2026-06-08PRODUCT_LAUNCH影响: MEDIUM
Individual Gain, Collective Loss: Metacognitive Adaptation in AI-Assisted Creativity
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Agent Memory: Characterization and System Implications of Stateful Long-Horizon Workloads
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
A Taxonomy of Runtime Faults in Model Context Protocol Servers
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Agents' Last Exam
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Leveraging Large Language Models for Generating Research Topic Ontologies: A Multi-Disciplinary Study
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Cascading Hallucination in Agentic RAG: The CHARM Framework for Detection and Mitigation
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Supportive Token Revealing for Fast Diffusion Language Model Decoding
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
From Untrusted Input to Trusted Memory: A Systematic Study of Memory Poisoning Attacks in LLM Agents
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Description-Code Inconsistency in Real-world MCP Servers: Measurement, Detection, and Security Implications
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Do LLMs Hold Their Values? MANTA: A Multi-Turn Adversarial Benchmark for Animal Welfare Reasoning
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
BEATS: Bootstrapping E-commerce Attribute Taxonomies for Search through Iterative Human-AI Collaboration
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Talk is (Not) Cheap: A Taxonomy and Benchmark Coverage Audit for LLM Attacks
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Diagnosis of Human Object Interaction Detectors for Real World Educational Applications
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
Diagnosis of Human Object Interaction Detectors for Real World Educational Applications
2026-06-03BREAKTHROUGH影响: HIGH
Learning to See via Epiretinal Implant Stimulation in silico with Model-Based Deep Reinforcement Learning
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
AUDITFLOW: Executable Symbolic Environments for Structured Financial Reporting Verification
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
DeskCraft: Benchmarking Desktop Agents on Professional Workflows and Human-in-the-Loop Collaboration
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
When RLHF Fails: A Mechanistic Taxonomy of Reward Hacking, Collapse, and Evaluator Gaming
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
FORGE: Multi-Agent Graduated Exploitation and Detection Engineering
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
A cross-domain tropical species dataset with Chinese vernacular names and CITES source links
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel Reasoning
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
CogRAG: Tackling Heterogeneous Cognitive Demands in RAG via Stratified Retrieval and Reasoning
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
ForeSci: Evaluating LLM Agents for Forward-Looking AI Research Judgment
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Comprehensive AI governance requires addressing non-model gains
2026-06-02PRODUCT_LAUNCH