papers · 相关事件
相关事件
Ontology-constrained multi-LLM scoring of hypothesis support in the predictive processing literature
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Can AI Refute Economic Theory? Evidence from Beyond the Knowledge Cutoff
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Benchmarking Open-Source Layout Detection Models for Data Snapshot Extraction from Institutional Documents
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Benchmarking Open-Source Layout Detection Models for Data Snapshot Extraction from Institutional Documents
2026-06-05OPEN_SOURCE影响: MEDIUM
MIRAI: Prediction and Generation of High-Impact Academic Research
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Automatic Generation of Titles for Research Papers Using Language Models
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Automated Lexical Coverage for Language Learning: From General to Specialized Word Lists
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Merit or networks? What decides where research is published
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
Crystal: Characterizing Relative Impact of Scholarly Publications
2026-06-03PRODUCT_LAUNCH影响: MEDIUM
Crystal: Characterizing Relative Impact of Scholarly Publications
2026-06-03BREAKTHROUGH影响: HIGH
ForeSci: Evaluating LLM Agents for Forward-Looking AI Research Judgment
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Ryze: Evidence-Enriched Data Synthesis from Biomedical Papers
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Can AI Review Improve Paper Drafting? An Empirical Study on 20 Computer Architecture Submissions
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Consistency evaluation of benchmarks used for causal discovery
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Make Mechanistic Interpretability Auditable: A Call to Develop Guidelines via Continuous Collaborative Reviewing
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
TechGraphRAG: An Agentic Graph-Augmented RAG Framework for Technical Literature Reasoning
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
A Primer in Post-Training Reasoning Data: What We Know About How It Works
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Who Annotates in NLP? A Large-scale Assessment of Human Annotation Reporting between 2018 and 2025
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
AutoForest: Automatically Generating Forest Plots from Biomedical Studies with End-to-End Evidence Extraction and Synthesis
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Introduction to Graph Neural Networks for Machine Learning Engineers
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Language-Native Materials Processing Design by Lightly Structured Text Database and Reasoning Large Language Model
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Agreement Metrics for LLM-as-Judge Evaluation: What to Report and Why
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
I-WebGenBench : Evaluating Interactivity in LLM-Generated Scientific Web Applications
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
OARelatedWork: A Large-Scale Dataset of Related Work Sections with Full-texts from Open Access Sources
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
ADRA-Bank: A Modular Benchmark for Academic Deep Research Agents
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
PaperVoyager : Building Interactive Web with Visual Language Models
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
A Data-Driven Approach to Idiomaticity Based on Experts' Criteria in Theoretical Linguistics
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
HakushoBench: A Japanese Chart and Table VQA Benchmark from Governmental White Papers
2026-06-02PRODUCT_LAUNCH影响: MEDIUM
Industrializing Prediction-Powered Inference: The GLIDE Library for Reliable GenAI and Agentic Systems Evaluation
2026-06-01OPEN_SOURCE影响: MEDIUM
Industrializing Prediction-Powered Inference: The GLIDE Library for Reliable GenAI and Agentic Systems Evaluation
2026-06-01PRODUCT_LAUNCH影响: MEDIUM
Industrializing Prediction-Powered Inference: The GLIDE Library for Reliable GenAI and Agentic Systems Evaluation
2026-06-01BREAKTHROUGH影响: HIGH
Reading Between the Citations: A Typed Claim Network for Scientific Literature
2026-06-01PRODUCT_LAUNCH影响: MEDIUM
SPM-Bench: Benchmarking Large Language Models for Scanning Probe Microscopy
2026-06-01PRODUCT_LAUNCH影响: MEDIUM
SPM-Bench: Benchmarking Large Language Models for Scanning Probe Microscopy
2026-06-01BREAKTHROUGH影响: HIGH
Human-Alignment and Calibration of Inference-Time Uncertainty in Large Language Models
2026-06-01PRODUCT_LAUNCH影响: MEDIUM
AI for Monitoring and Classifying Data Used in Research Literature
2026-06-01PRODUCT_LAUNCH影响: MEDIUM
Inference-Free Multimodal Learned Sparse Retrieval for Production-Scale Visual Document Search
2026-06-01PRODUCT_LAUNCH影响: MEDIUM
PRAIB: Peer Review AI Benchmark of Behaviour of LLM-Assisted Reviewing
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
Review Arcade: On the Human Alignment and Gameability of LLM Reviews
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
DeepSurvey: Enhancing Analytical Depth and Citation Reliability in Automated Survey Generation
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
PRAIB: Peer Review AI Benchmark of Behaviour of LLM-Assisted Reviewing
2026-05-29OPEN_SOURCE影响: MEDIUM
Compass: Navigating Global Marine Lead Data Integration through Expert-Guided LLM Agent
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
AfriScience-MT: Towards Decolonizing Science in Africa through Text Translation
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
AfriScience-MT: Towards Decolonizing Science in Africa through Text Translation
2026-05-29OPEN_SOURCE影响: MEDIUM
COMPOSE: Composing Future Theorems from Citations and Formal Structure
2026-05-29PRODUCT_LAUNCH影响: MEDIUM
DiagramRAG: A Lightweight Framework to Retrieve Scientific Diagram for Figure Generation
2026-05-28PRODUCT_LAUNCH影响: MEDIUM
From paper to benchmark: agentic, framework-based reproduction of under-specified methods in machine health intelligence
2026-05-28PRODUCT_LAUNCH影响: MEDIUM