learning · 相关事件
相关事件
Edit-R2: Context-Aware Reinforcement Learning for Multi-Turn Image Editing
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Integrating Mechanistic and Data-Driven Models for Neurological Disorders through Differentiable Programming
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Severity-Aware Curriculum Learning with Multi-Model Response Selection for Medical Text Generation
2026-06-06ACQUISITION影响: HIGH
Severity-Aware Curriculum Learning with Multi-Model Response Selection for Medical Text Generation
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Critic-Guided Heterogeneous Multi-Agent Reasoning for Reliable Mathematical Problem Solving
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Class-Specific Branch Attention for Mitigating Gradient Interference under Class Imbalance
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Retry Policy Gradients in Continuous Action Spaces
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
A Pre-Registered Causal Partition of Self-Consistency Elicitation and Reward Design in RLVR
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Step-adaptive multimodal fusion network with multi-scale cloud feature learning for ultra-short-term solar irradiance forecasting
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Towards Healthy Evolution: Exploring the Role and Mechanisms of Human-Agent Interaction in Self-Evolving Systems
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Towards Healthy Evolution: Exploring the Role and Mechanisms of Human-Agent Interaction in Self-Evolving Systems
2026-06-06OPEN_SOURCE影响: MEDIUM
Amortizing Federated Adaptation: Hypernetwork Driven LoRA for Personalized Foundation Models
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Learning to replenish: A hybrid deep reinforcement learning for dynamic inventory management in the pharmaceutical supply chains
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
AIS-Based Vessel Trajectory Prediction Using Memory-Augmented Neural Networks
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Policy-Conditioned Counterfactual Credit for Verifiable Reinforcement Learning of Long-Horizon Language Agents
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents
2026-06-06BREAKTHROUGH影响: HIGH
Gradient descent at the Edge of Stability: free energy model and kinetic description of the two-layer network
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Willing but Unable: Separating Refusal from Capability in Code LLMs via Abliteration
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
CausalPOI: Spatio-Temporal Graph-Based Causal Modeling for Cold-Start POI Check-in Forecasting
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Towards Unified and Data-Efficient Prognostics and Health Management with Tabular Foundation Models
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
The Role of Instructional Guidance in Generative AI-Assisted Learning: Empirical Evidence from Construction Engineering Education
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Conformal Risk-Averse Decision Making with Action Conditional Guarantee
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Balancing Image Compression and Generation with Bootstrapped Tokenization
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Representation Learning Enables Scalable Multitask Deep Reinforcement Learning
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Dimensionality Reduction for Cyberattack Classification: A Comparative Evaluation of PCA and Linear Predictive Coding
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Cross-Epoch Adaptive Rollout Optimization for RL Post-Training
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Beyond Waveform Robustness: Robust Feature-Vocoder Adversarial Attacks on Automatic Speech Recognition
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Benchmarking Counterfactual Prediction in Epidemic Time Series with Time-Varying Interventions
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Cognitive Threat Intelligence and Explainable Federated Security Analytics for distributed Infrastructure Systems
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
An Improved CNN-LSTM Based Intrusion Detection System for IoT Networks
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Consistency Training Along the Transformer Stack
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
EEGDancer: Dynamic Emotion Latent Space Masked Modeling with Reinforcement Learning for EEG Continuous Emotion Prediction
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Deciphering Two Training Clocks in Grokking via Deep Linear Network Theory with Conditional ReLU Reduction
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Learning of Robot Safety Policies via Adversarial Synthetic Scenarios
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
AttackPathGNN: Cross-function vulnerability detection in smart contracts using state interference graphs and conjunction pooling
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Sample-efficient Low-level Motion Planning for Robotic Manipulation Tasks via Zero-shot Transfer Learning
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Metamorphic Testing with the Rashomon Set: Explanation Faithfulness in Machine Learning
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
ITP-STDP: An Intrinsic-Timing Power-of-Two Learning Engine for On-Chip SNN Training
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Your GFlowNet Secretly Learns an Optimal Transport Plan
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Quantum enhanced rare event discovery and sampling
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
PAMF: Prior-Aware Multimodal Fusion for Incomplete Time Series Data
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Double Preconditioning (DoPr): Optimization for Test-Time Performance, not Validation Loss
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
RREDCoT: Segment-Level Reward Redistribution for Reasoning Models
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
Pretraining Recurrent Networks without Recurrence
2026-06-06ACQUISITION影响: HIGH
Pretraining Recurrent Networks without Recurrence
2026-06-06PRODUCT_LAUNCH影响: MEDIUM