Interview · 相关事件
相关事件
RAGPPI: RAG Benchmark for Protein-Protein Interactions in Drug Discovery
2026-06-12PRODUCT_LAUNCH影响: MEDIUM
C-QUERI: Congressional Questions, Exchanges, and Responses in Institutions Dataset
2026-06-12PRODUCT_LAUNCH影响: MEDIUM
LingxiDiagBench: A Multi-Agent Framework for Benchmarking LLMs in Chinese Psychiatric Consultation and Diagnosis
2026-06-12PRODUCT_LAUNCH影响: MEDIUM
Divination by Prompt: LLM-Mediated Xuanxue on Chinese Social Media
2026-06-12PRODUCT_LAUNCH影响: MEDIUM
From Awareness to Action: Understanding and Overcoming the Research-Practice Gap in Algorithmic Fairness for Public Health
2026-06-11PRODUCT_LAUNCH影响: MEDIUM
IntElicit: Eliciting and Assessing Contextualized Creativity via Dialogue Policy Optimization
2026-06-11PRODUCT_LAUNCH影响: MEDIUM
End-to-End Machine Learning for Depressive State Classification via EEG and fNIRS
2026-06-11PRODUCT_LAUNCH影响: MEDIUM
Frozen Multimodal Embeddings for Personality and Cognitive Ability Assessment in Asynchronous Video Interviews
2026-06-11PRODUCT_LAUNCH影响: MEDIUM
Dep-LLM: Training-Free Depression Diagnosis via Evidence-Guided Structured Multi-factor with Reliable LLM Reasoning
2026-06-10PRODUCT_LAUNCH影响: MEDIUM
Assessment of Personality Dimensions Across Situations in Dyadic Role-Play Scenarios
2026-06-10PRODUCT_LAUNCH影响: MEDIUM
Automated Alignment between Elicitation Interviews and Requirements
2026-06-10PRODUCT_LAUNCH影响: MEDIUM
Concerns and Strategic Responses of Older Workers Navigating Generative AI in Bridge Employment
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
"So There's a Catch-22 Here": How Early Adopters Who Build Multi-Agent LLM Systems Conceptualize Transparency
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
FieldWorkArena: Agentic AI Benchmark for Real Field Work Tasks
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
Context Over Compute Human-in-the-Loop Outperforms Iterative Chain-of-Thought Prompting in Interview Answer Quality
2026-06-09PRODUCT_LAUNCH影响: MEDIUM
Measuring Agents in Production
2026-06-08PRODUCT_LAUNCH影响: MEDIUM
Synthetic Personalities: How Well Can LLMs Mimic Individual Respondents Using Socio-Economic Microdata?
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
NBQ: Next-Best-Question for Dynamic Profiling
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Benchmarking and Enhancing Text-to-Image Models for Generating Visual Representations in Early Arithmetic Education
2026-06-01PRODUCT_LAUNCH影响: MEDIUM
Paper Agents, Paper Gains: An Empirical Analysis of DeFi Investment Agents
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
Surfacing Isolated Learners with Outcome-Independent Mediation of Feedback between Teachers and Students Using AI
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
Adaptive Interviewing for Persona Simulation in LLMs: Evidence-Grounded Reasoning Improves Decision Alignment
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
The Trust Paradox: How CS Researchers Engage LLM Leaderboards
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
When Seekers Are Hard to Help: Evaluating Emotional Support Dialogue Systems in Worst-Case Interactions
2026-05-28PRODUCT_LAUNCH影响: MEDIUM
StoryMI: Steerable Multi-Agent Therapeutic Dialogue Generation
2026-05-28PRODUCT_LAUNCH影响: MEDIUM
CALM-IT: Generating Realistic Long-Form Motivational Interviewing Dialogues with Dual-Actor Conversational Dynamics Tracking
2026-05-28PRODUCT_LAUNCH影响: MEDIUM
Analyzing Cancer Patients' Experiences with Embedding-based Topic Modeling and LLMs
2026-05-28PRODUCT_LAUNCH影响: MEDIUM
AI in the Workplace: The Impact of AI on Perceived Job Decency and Meaningfulness
2026-05-28PRODUCT_LAUNCH影响: MEDIUM
Heterogeneous Causal Discovery of Repeated Undesirable Health Outcomes
2026-05-28PRODUCT_LAUNCH影响: MEDIUM
An investigation of AI integration in sound designer workflows and experiences
2026-05-27PRODUCT_LAUNCH影响: MEDIUM
From Attribution to Action: A Human-Centered Application of Activation Steering
2026-05-27PRODUCT_LAUNCH影响: MEDIUM
A Multi-Probe Audit of Clinical-Interview Depression Detection Benchmarks
2026-05-26PRODUCT_LAUNCH影响: MEDIUM
When Symptoms Are Not Enough: Evidence-Weighting Patterns in Large Language Model Psychiatric Screening
2026-05-26PRODUCT_LAUNCH影响: MEDIUM
When Symptoms Are Not Enough: Evidence-Weighting Patterns in Large Language Model Psychiatric Screening
2026-05-26BREAKTHROUGH影响: HIGH
RCTs & Human Uplift Studies: Methodological Challenges and Practical Solutions for Frontier AI Evaluation
2026-05-26PRODUCT_LAUNCH影响: MEDIUM
RCTs & Human Uplift Studies: Methodological Challenges and Practical Solutions for Frontier AI Evaluation
2026-05-26REGULATION影响: MEDIUM
Two-Sided Time-Independent Regret for Matching Markets with Limited Interviews
2026-05-26PRODUCT_LAUNCH影响: MEDIUM