elicit 公司
1
产品数
0
专利数
elicit · 相关事件
相关事件
MUSE: A Unified Agentic Harness for MLLMs
2026-06-03BREAKTHROUGH影响: HIGH
MUSE: A Unified Agentic Harness for MLLMs
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
Eliciting Complex Spatial Reasoning in MLLMs through Wide-Baseline Matching
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
Causal Preference Elicitation
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
Whom to Query for What: Adaptive Group Elicitation via Multi-Turn LLM Interactions
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
See, Infer, Intervene: Proactive World Modeling for Goal-Oriented Social Intelligence
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
Large Language Models Are Overconfident in Their Own Responses
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
Hidden Thoughts Are Not Secret: Reasoning Trace Exposure in LLMs
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
AI From the Margins (AIM): Rethinking Participatory AI Design Through the Lived Experience of Minoritized Communities
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
On Information Self-Locking in Reinforcement Learning for Active Reasoning of LLM agents
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
REALISTA: Realistic Latent Adversarial Attacks that Elicit LLM Hallucinations
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Deep Research as Rubric for Reinforcement Learning
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
OncoReason: Structuring Clinical Reasoning in LLMs for Robust and Interpretable Survival Prediction
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Post-Training LLMs as Better Decision-Making Agents: A Regret-Minimization Approach
2026-06-01PRODUCT_LAUNCH影响: MEDIUM
Dual Mechanisms of Value Expression: Intrinsic vs. Prompted Values in Large Language Models
2026-06-01PRODUCT_LAUNCH影响: MEDIUM
Empirical Characterization of Inference-Time Elicited Probability Transformations in Large Language Models
2026-06-01PRODUCT_LAUNCH影响: MEDIUM
PROWL: Prioritized Regret-Driven Optimization for World Model Learning
2026-06-01PRODUCT_LAUNCH影响: MEDIUM
Robust Dreamer: Deviation-Aware Latent Gaussian Memory for Action-Controlled AR Video Generation
2026-06-01PRODUCT_LAUNCH影响: MEDIUM
Training Deliberative Monitors for Black-Box Scheming Detection
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
Reasoning with Sampling: Cutting at Decision Points
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
MechELK: A Mechanistic Interpretability Framework for Eliciting Latent Knowledge in Large Language Models
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
User-Aware Active Knowledge Acquisition for Emotional Support Dialogue
2026-05-29ACQUISITION影响: HIGH
User-Aware Active Knowledge Acquisition for Emotional Support Dialogue
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
Metric-Dependent Annotation Saturation for Learning from Label Distributions
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
HE-SNR: Uncovering Latent Logic via Entropy for Guiding Mid-Training on SWE-bench
2026-05-29ACQUISITION影响: HIGH
HE-SNR: Uncovering Latent Logic via Entropy for Guiding Mid-Training on SWE-bench
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
How Far Ahead Do LLMs Plan? Uncovering the Latent Horizon in Chain-of-Thought Reasoning
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
Multi-Turn Adaptive Prompting Attack on Large Vision-Language Models
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
Asking Is Not Enough: Protocol Sensitivity in LLM Confidence Calibration
2026-05-28PRODUCT_LAUNCH影响: MEDIUM
On Compositional Learning Behaviours in Formal Mathematics
2026-05-28PRODUCT_LAUNCH影响: MEDIUM
GraphSteal: Structural Knowledge Stealing from Graph RAG via Traversal Reconstruction
2026-05-28PRODUCT_LAUNCH影响: MEDIUM
Self-Consistency via Marginal Sharpening
2026-05-28PRODUCT_LAUNCH影响: MEDIUM
SNARE: Adaptive Scenario Synthesis for Eliciting Overeager Behavior in Coding Agents
2026-05-28PRODUCT_LAUNCH影响: MEDIUM
Formula-One Prompting: A Composable Equation-First Prefix for Applied Mathematics
2026-05-28PRODUCT_LAUNCH影响: MEDIUM
Structure-Guided Visual Perturbation Neutralization for LVLMs
2026-05-28PRODUCT_LAUNCH影响: MEDIUM
PICACO: Pluralistic In-Context Value Alignment of LLMs via Total Correlation Optimization
2026-05-28PRODUCT_LAUNCH影响: MEDIUM
Compliance versus Sensibility: On the Reasoning Controllability in Large Language Models
2026-05-28ACQUISITION影响: HIGH
Compliance versus Sensibility: On the Reasoning Controllability in Large Language Models
2026-05-28PRODUCT_LAUNCH影响: MEDIUM
Compliance versus Sensibility: On the Reasoning Controllability in Large Language Models
2026-05-28REGULATION影响: MEDIUM
VisualNeedle: Benchmarking Active Visual Search in Information-Dense Scenes
2026-05-27PRODUCT_LAUNCH影响: MEDIUM
Beyond Questions: Evaluating What Large Language Models (Actually) Know
2026-05-27PRODUCT_LAUNCH影响: MEDIUM
LiPUP-MA: A Residential Experience-centric Multi-Agent Framework for Living-in-the-loop Participatory Urban Planning
2026-05-27PRODUCT_LAUNCH影响: MEDIUM
Post-training makes large language models less human-like
2026-05-27PRODUCT_LAUNCH影响: MEDIUM
Med-CoReasoner: Reducing Language Disparities in Medical Reasoning via Language-Informed Co-Reasoning
2026-05-27PRODUCT_LAUNCH影响: MEDIUM
Dimensional Distribution Emotion State: Leveraging Valence and Arousal as a Common Embedding Space for Visual Emotion Analysis
2026-05-27PRODUCT_LAUNCH影响: MEDIUM
Direct Preference Optimization for English-Mandarin Code-Switching Speech Recognition in Audio LLMs
2026-05-26PRODUCT_LAUNCH影响: MEDIUM
JudgmentBench: Comparing Rubric and Preference Evaluation for Quality Assessment
2026-05-26PRODUCT_LAUNCH影响: MEDIUM
GroupTravelBench: Benchmarking LLM Agents on Multi-Person Travel Planning
2026-05-26PRODUCT_LAUNCH影响: MEDIUM
Clarification Is Not Enough: Post-Clarification Answering Remains the Bottleneck in Multi-Turn QA
2026-05-26PRODUCT_LAUNCH影响: MEDIUM
Learning to Route Languages for Multilingual Policy Optimization
2026-05-26PRODUCT_LAUNCH影响: MEDIUM