training · 相关事件
相关事件
DEER: Disentangled Mixture of Experts with Instance-Adaptive Routing for Generalizable Machine-Generated Text Detection
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Spectral Scaling Laws of Muon
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
AIP: A Graph Representation for Learning and Governing Agent Skills
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
AIP: A Graph Representation for Learning and Governing Agent Skills
2026-06-04SHUTDOWN影响: LOW
Spectral Scaling Laws of Muon
2026-06-04OPEN_SOURCE影响: MEDIUM
BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Beyond Objective Equivalence: Constraint Injection for LLM-Based Optimization Modeling on Vehicle Routing Problems
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Spectral Scaling Laws of Muon
2026-06-04BREAKTHROUGH影响: HIGH
How do machines learn? Evaluating the AIcon2abs method
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
DiffAero: A GPU-Accelerated Differentiable Simulation Framework for Efficient Quadrotor Policy Learning
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
The Variance Brain Foundation Models Forgot: Third-Order Statistics Predict Cognition Where Billion-Parameter Models Fail
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Gravity-Aware Hierarchical Routing for Lightweight SensorLLM on Human Activity Recognition
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Position: Deployed Reinforcement Learning should be Continual
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Beyond Static Priors: Dynamic Neural Guidance for Large-Scale Ant Colony Optimization
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Unlocking Feature Learning in Gated Delta Networks at Scale
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
POLARIS: Guiding Small Models to Write Long Stories
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
LLM Compression with Jointly Optimizing Architectural and Quantization choices
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Large Language Models Hack Rewards, and Society
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
TPA-AD: A Two-Stage Pseudo Anomaly-Guided Method for Bearing Time-Series Anomaly Detection
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
StepPRM-RTL: Stepwise Process-Reward Guided LLM Fine-Tuning for Enhanced RTL Synthesis
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Can Generalist Agents Automate Data Curation?
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
AgentJet: A Flexible Swarm Training Framework for Agentic Reinforcement Learning
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Large Language Models Hack Rewards, and Society
2026-06-04REGULATION影响: MEDIUM
dMX: Differentiable Mixed-Precision Assignment for Low-Precision Floating-Point Formats
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Building The Ph(ysical)AI Layer Of Machine Intelligence
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
SymTRELLIS: Symmetry-Enforced Voxel Latents for 3D Generation
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Exact Unlearning in Reinforcement Learning
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
EvalStop: Using World Feedback to Detect and Correct Reward Overoptimization in Multi-Tenant RLHF Platforms
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
ADAPTOOD: Uncertainty-Aware Fine-Tuning for Out-of-Distribution ECG Time Series Models
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Smart Transportation Without Neurons -- Fair Metro Network Expansion with Tabular Reinforcement Learning
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Supportive Token Revealing for Fast Diffusion Language Model Decoding
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Instant-Fold: In-Context Imitation Learning for Deformable Object Manipulation
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Sparse Mixture-of-Experts Reward Models Learn Interpretable and Specialized Experts for Personalized Preference Modeling
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Scaling Novel Graph Generation via Lightweight Structure-Guided Autoregressive Models
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
OpenRFM: Dissecting Relational In-Context Learning
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
MorphoQuant: Modality-Aware Quantization for Omni-modal Large Language Models
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Multi-Granularity 3D Kidney Lesion Characterization from CT Volumes
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Low-Rank Decay for Grokking in Scale-Invariant Transformers: A Spectral-Geometric View
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
An Empirical Study of Data Scale, Model Complexity, and Input Modalities in Visual Generalization
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
SePO: Self-Evolving Prompt Agent for System Prompt Optimization
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
ParetoPilot: Zero-Surrogate Offline Multi-Objective Optimization via Infer-Perturb-Guide Diffusion
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Smart Picks in the Dark: Towards Efficient RLVR for Reasoning via Tracing Metacognitive Pivots
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Self-Evolving Deep Research via Joint Generation and Evaluation
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
GeoMin: Data-Efficient Semi-Supervised RLVR via Geometric Distribution Modeling
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models
2026-06-04REGULATION影响: MEDIUM
Rollout-Level Advantage-Prioritized Experience Replay for GRPO
2026-06-04PRODUCT_LAUNCH影响: MEDIUM
QuBLAST: A Framework for Quantizing Large Language Models with Block-Level Compression Approach and Activation Scaling Strategy
2026-06-04PRODUCT_LAUNCH影响: MEDIUM