sonnet 产品

来源: githubOPEN_SOURCE开源PythonApache-2.0发布于 2017-04-03

TensorFlow-based neural network library

9919

Stars

1309

Forks

技术栈

替代方案

sonnet · 相关事件

2026-06-05PRODUCT_LAUNCH影响: MEDIUM

Domain-Conditioned Safety in Frontier Computer-Using Agents: A 793-Episode Browser Benchmark, a Coding-Domain Cross-Reference, and a Reproducibility Audit of Recent Red-Teaming

2026-06-05SHUTDOWN影响: LOW

Statistically Reliable LLM-Based Ranking Evaluation via Prediction-Powered Inference

2026-06-05PRODUCT_LAUNCH影响: MEDIUM

Ten Headache Specialists versus Artificial Intelligence for Clinical Literature Summarization: A Critical Evaluation and Comparison

2026-06-05PRODUCT_LAUNCH影响: MEDIUM

Ten Headache Specialists versus Artificial Intelligence for Clinical Literature Summarization: A Critical Evaluation and Comparison

2026-06-05BREAKTHROUGH影响: HIGH

Scaffold, Not Vocabulary? A Controlled, Two-Tier, Pre-Registered Study of a Popperian Code-Generation Skill

2026-06-05PRODUCT_LAUNCH影响: MEDIUM

Scaffold, Not Vocabulary? A Controlled, Two-Tier, Pre-Registered Study of a Popperian Code-Generation Skill

2026-06-05SHUTDOWN影响: LOW

CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives

2026-06-05PRODUCT_LAUNCH影响: MEDIUM

Ontology-Constrained Neural Reasoning in Enterprise Agentic Systems: A Neurosymbolic Architecture for Domain-Grounded AI Agents

2026-06-05PRODUCT_LAUNCH影响: MEDIUM

Ontology-Constrained Neural Reasoning in Enterprise Agentic Systems: A Neurosymbolic Architecture for Domain-Grounded AI Agents

2026-06-05REGULATION影响: MEDIUM

AIP: A Graph Representation for Learning and Governing Agent Skills

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

AIP: A Graph Representation for Learning and Governing Agent Skills

2026-06-04SHUTDOWN影响: LOW

Long Live Fine-Tuning: Task-Specific Transformers Outperform Zero-Shot LLMs for Misinformation Response Classification on Reddit

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

GTBench: A Curriculum-Grounded Benchmark for Evaluating LLMs as Mathematical Research Assistants in Graph Theory

2026-06-03PRODUCT_LAUNCH影响: MEDIUM

Gender-Dependent Diagnostic Substitution in LLM Medical Triage: Same Symptoms, Unequal Urgency

2026-06-03PRODUCT_LAUNCH影响: MEDIUM

MultiTurnPSB: Evaluating Multi-Turn Jailbreak Attacks an dClassifier-Based Defenses for Medical AI Safety

2026-06-03PRODUCT_LAUNCH影响: MEDIUM

MultiTurnPSB: Evaluating Multi-Turn Jailbreak Attacks an dClassifier-Based Defenses for Medical AI Safety

2026-06-03REGULATION影响: MEDIUM

Dynamic Coordination Strategy Selection for Enterprise Multi-Agent Systems

2026-06-02PRODUCT_LAUNCH影响: MEDIUM

LLM Consortium for Software Design Refinement: A Controlled Experiment on Multi-Agent Collaboration Topologies

2026-06-02PRODUCT_LAUNCH影响: MEDIUM

From Outliers to Errors: Auditing Pali-to-English LLM Translations with Multi-Reference Adjudication

2026-06-02PRODUCT_LAUNCH影响: MEDIUM

Quality-Diversity Evolution for Discovering Diverse Vulnerabilities in LLM Safety

2026-06-02PRODUCT_LAUNCH影响: MEDIUM

Beyond Scalar Rewards: Dense Feedback for LLM Policy Synthesis in Sequential Social Dilemmas

2026-06-02PRODUCT_LAUNCH影响: MEDIUM

ImmigrationQA: A Source-Grounded Dataset and Small-Model Adaptation for U.S. Immigration Law

2026-06-01PRODUCT_LAUNCH影响: MEDIUM

ImmigrationQA: A Source-Grounded Dataset and Small-Model Adaptation for U.S. Immigration Law

2026-06-01REGULATION影响: MEDIUM

Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet