training 产品

来源: githubOPEN_SOURCE开源PythonApache-2.0发布于 2018-03-29

Reference implementations of MLPerf® training benchmarks

1755

Stars

586

Forks

技术栈

替代方案

training · 相关事件

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Spectral Scaling Laws of Muon

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

AIP: A Graph Representation for Learning and Governing Agent Skills

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

AIP: A Graph Representation for Learning and Governing Agent Skills

2026-06-04SHUTDOWN影响: LOW

Spectral Scaling Laws of Muon

2026-06-04OPEN_SOURCE影响: MEDIUM

BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Beyond Objective Equivalence: Constraint Injection for LLM-Based Optimization Modeling on Vehicle Routing Problems

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Spectral Scaling Laws of Muon

2026-06-04BREAKTHROUGH影响: HIGH

How do machines learn? Evaluating the AIcon2abs method

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

DiffAero: A GPU-Accelerated Differentiable Simulation Framework for Efficient Quadrotor Policy Learning

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

The Variance Brain Foundation Models Forgot: Third-Order Statistics Predict Cognition Where Billion-Parameter Models Fail

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

The Variance Brain Foundation Models Forgot: Third-Order Statistics Predict Cognition Where Billion-Parameter Models Fail

2026-06-04BREAKTHROUGH影响: HIGH

Gravity-Aware Hierarchical Routing for Lightweight SensorLLM on Human Activity Recognition

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Position: Deployed Reinforcement Learning should be Continual

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Beyond Static Priors: Dynamic Neural Guidance for Large-Scale Ant Colony Optimization

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Unlocking Feature Learning in Gated Delta Networks at Scale

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

POLARIS: Guiding Small Models to Write Long Stories

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

LLM Compression with Jointly Optimizing Architectural and Quantization choices

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Large Language Models Hack Rewards, and Society

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

TPA-AD: A Two-Stage Pseudo Anomaly-Guided Method for Bearing Time-Series Anomaly Detection

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

StepPRM-RTL: Stepwise Process-Reward Guided LLM Fine-Tuning for Enhanced RTL Synthesis

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Can Generalist Agents Automate Data Curation?

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

AgentJet: A Flexible Swarm Training Framework for Agentic Reinforcement Learning

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Large Language Models Hack Rewards, and Society

2026-06-04REGULATION影响: MEDIUM

dMX: Differentiable Mixed-Precision Assignment for Low-Precision Floating-Point Formats

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Building The Ph(ysical)AI Layer Of Machine Intelligence

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

SymTRELLIS: Symmetry-Enforced Voxel Latents for 3D Generation

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Exact Unlearning in Reinforcement Learning

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

EvalStop: Using World Feedback to Detect and Correct Reward Overoptimization in Multi-Tenant RLHF Platforms

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

ADAPTOOD: Uncertainty-Aware Fine-Tuning for Out-of-Distribution ECG Time Series Models

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Smart Transportation Without Neurons -- Fair Metro Network Expansion with Tabular Reinforcement Learning

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Supportive Token Revealing for Fast Diffusion Language Model Decoding

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Instant-Fold: In-Context Imitation Learning for Deformable Object Manipulation

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Sparse Mixture-of-Experts Reward Models Learn Interpretable and Specialized Experts for Personalized Preference Modeling

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Scaling Novel Graph Generation via Lightweight Structure-Guided Autoregressive Models

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

OpenRFM: Dissecting Relational In-Context Learning

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

MorphoQuant: Modality-Aware Quantization for Omni-modal Large Language Models

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Multi-Granularity 3D Kidney Lesion Characterization from CT Volumes

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Low-Rank Decay for Grokking in Scale-Invariant Transformers: A Spectral-Geometric View

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

An Empirical Study of Data Scale, Model Complexity, and Input Modalities in Visual Generalization

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

SePO: Self-Evolving Prompt Agent for System Prompt Optimization

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

ParetoPilot: Zero-Surrogate Offline Multi-Objective Optimization via Infer-Perturb-Guide Diffusion

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Smart Picks in the Dark: Towards Efficient RLVR for Reasoning via Tracing Metacognitive Pivots

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Self-Evolving Deep Research via Joint Generation and Evaluation

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

GeoMin: Data-Efficient Semi-Supervised RLVR via Geometric Distribution Modeling

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models

2026-06-04REGULATION影响: MEDIUM

Rollout-Level Advantage-Prioritized Experience Replay for GRPO

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

QuBLAST: A Framework for Quantizing Large Language Models with Block-Level Compression Approach and Activation Scaling Strategy

2026-06-04PRODUCT_LAUNCH影响: MEDIUM

QuBLAST: A Framework for Quantizing Large Language Models with Block-Level Compression Approach and Activation Scaling Strategy

2026-06-04BREAKTHROUGH影响: HIGH