learning 产品

来源: githubOPEN_SOURCE开源MIT发布于 2017-11-26

A log of things I'm learning

6865

Stars

883

Forks

技术栈

替代方案

learning · 相关事件

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Integrating Mechanistic and Data-Driven Models for Neurological Disorders through Differentiable Programming

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

An interpretable and trustworthy AI framework for large-scale longitudinal structure-pain association studies using data from the Osteoarthritis Initiative (OAI)

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Severity-Aware Curriculum Learning with Multi-Model Response Selection for Medical Text Generation

2026-06-06ACQUISITION影响: HIGH

Severity-Aware Curriculum Learning with Multi-Model Response Selection for Medical Text Generation

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Critic-Guided Heterogeneous Multi-Agent Reasoning for Reliable Mathematical Problem Solving

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Class-Specific Branch Attention for Mitigating Gradient Interference under Class Imbalance

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Retry Policy Gradients in Continuous Action Spaces

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

A Pre-Registered Causal Partition of Self-Consistency Elicitation and Reward Design in RLVR

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Step-adaptive multimodal fusion network with multi-scale cloud feature learning for ultra-short-term solar irradiance forecasting

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Towards Healthy Evolution: Exploring the Role and Mechanisms of Human-Agent Interaction in Self-Evolving Systems

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Towards Healthy Evolution: Exploring the Role and Mechanisms of Human-Agent Interaction in Self-Evolving Systems

2026-06-06OPEN_SOURCE影响: MEDIUM

Amortizing Federated Adaptation: Hypernetwork Driven LoRA for Personalized Foundation Models

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Learning to replenish: A hybrid deep reinforcement learning for dynamic inventory management in the pharmaceutical supply chains

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

AIS-Based Vessel Trajectory Prediction Using Memory-Augmented Neural Networks

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Finite Element-Based Material Learning via Automatic Differentiation: Learning constitutive neural network models from full-field deformation data

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Where's the Structure? A Systematic Literature Review of Empirical Research on Human-AI Collaboration and Hybrid Intelligence for Learning

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Policy-Conditioned Counterfactual Credit for Verifiable Reinforcement Learning of Long-Horizon Language Agents

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents

2026-06-06BREAKTHROUGH影响: HIGH

Gradient descent at the Edge of Stability: free energy model and kinetic description of the two-layer network

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Willing but Unable: Separating Refusal from Capability in Code LLMs via Abliteration

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

CausalPOI: Spatio-Temporal Graph-Based Causal Modeling for Cold-Start POI Check-in Forecasting

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Selective-Advantage Entropy-Adaptive Horizon GRPO: Asymmetric Token-Level Discounting for Efficient Reinforcement Learning of Language Models

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Towards Unified and Data-Efficient Prognostics and Health Management with Tabular Foundation Models

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

The Role of Instructional Guidance in Generative AI-Assisted Learning: Empirical Evidence from Construction Engineering Education

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Conformal Risk-Averse Decision Making with Action Conditional Guarantee

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Balancing Image Compression and Generation with Bootstrapped Tokenization

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Representation Learning Enables Scalable Multitask Deep Reinforcement Learning

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Dimensionality Reduction for Cyberattack Classification: A Comparative Evaluation of PCA and Linear Predictive Coding

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Cross-Epoch Adaptive Rollout Optimization for RL Post-Training

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Beyond Waveform Robustness: Robust Feature-Vocoder Adversarial Attacks on Automatic Speech Recognition

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Benchmarking Counterfactual Prediction in Epidemic Time Series with Time-Varying Interventions

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Cognitive Threat Intelligence and Explainable Federated Security Analytics for distributed Infrastructure Systems

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

An Improved CNN-LSTM Based Intrusion Detection System for IoT Networks

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Consistency Training Along the Transformer Stack

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

EEGDancer: Dynamic Emotion Latent Space Masked Modeling with Reinforcement Learning for EEG Continuous Emotion Prediction

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Deciphering Two Training Clocks in Grokking via Deep Linear Network Theory with Conditional ReLU Reduction

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Learning of Robot Safety Policies via Adversarial Synthetic Scenarios

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

AttackPathGNN: Cross-function vulnerability detection in smart contracts using state interference graphs and conjunction pooling

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Sample-efficient Low-level Motion Planning for Robotic Manipulation Tasks via Zero-shot Transfer Learning

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Metamorphic Testing with the Rashomon Set: Explanation Faithfulness in Machine Learning

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

ITP-STDP: An Intrinsic-Timing Power-of-Two Learning Engine for On-Chip SNN Training

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Your GFlowNet Secretly Learns an Optimal Transport Plan

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Quantum enhanced rare event discovery and sampling

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

PAMF: Prior-Aware Multimodal Fusion for Incomplete Time Series Data

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Double Preconditioning (DoPr): Optimization for Test-Time Performance, not Validation Loss

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

RREDCoT: Segment-Level Reward Redistribution for Reasoning Models

2026-06-06PRODUCT_LAUNCH影响: MEDIUM

Pretraining Recurrent Networks without Recurrence

2026-06-06ACQUISITION影响: HIGH

Pretraining Recurrent Networks without Recurrence

2026-06-06PRODUCT_LAUNCH影响: MEDIUM