Machine Learning

Dense Supervision, Sparse Updates: On the Sparsity and Geometry of On-Policy Distillation

Dense Supervision, Sparse Updates: On the Spar...

Machine Learning

librarian

4 views

The Stable Recovery Manifold: Geometric Principles Governing Recoverability in Continual Learning

The Stable Recovery Manifold: Geometric Princi...

Machine Learning

librarian

8 views

Beyond the Commitment Boundary: Probing Epiphenomenal Chain-of-Thought in Large Reasoning Models

Beyond the Commitment Boundary: Probing Epiphe...

Machine Learning

Daniel Scalena

11 views

Existence Precedes Value: Joint Modeling of Observational Existence and Evolving States in Time Series Forecasting

Existence Precedes Value: Joint Modeling of Ob...

Machine Learning

librarian

9 views

A2D2: Fine-Tuning Any-Length Discrete Diffusion for Adaptive Decoding

A2D2: Fine-Tuning Any-Length Discrete Diffusio...

Machine Learning

Sophia Tang

10 views

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

MaxProof: Scaling Mathematical Proof with Gene...

Machine Learning

librarian

17 views

Learning with Simulators: No Regret in a Computationally Bounded World

Learning with Simulators: No Regret in a Compu...

Machine Learning

Sasha Voitovych

8 views

Understanding Truncated Positional Encodings for Graph Neural Networks

Understanding Truncated Positional Encodings f...

Machine Learning

librarian

8 views

The Standard Interpretable Model: A general theory of interpretable machine learning to deductively design interpretable methods using Lagrangian mechanics

The Standard Interpretable Model: A general th...

Machine Learning

Pietro Barbiero

13 views

Breaking Entropy Bounds: Accelerating RL Training via MTP with Rejection Sampling

Breaking Entropy Bounds: Accelerating RL Train...

Machine Learning

librarian

14 views

On Subquadratic Architectures: From Applications to Principles

On Subquadratic Architectures: From Applicatio...

Machine Learning

librarian

15 views

APPO: Agentic Procedural Policy Optimization

APPO: Agentic Procedural Policy Optimization

Machine Learning

librarian

13 views

Redesign Mixture-of-Experts Routers with Manifold Power Iteration

Redesign Mixture-of-Experts Routers with Manif...

Machine Learning

Songhao Wu

17 views

Generalization Hacking: Models Can Game Reinforcement Learning by Preventing Behavioral Generalization

Generalization Hacking: Models Can Game Reinfo...

Machine Learning

librarian

11 views

EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents

EEVEE: Towards Test-time Prompt Learning in th...

Machine Learning

Weixian Xu

25 views

A Unifying Lens on Supervised Fine-Tuning Through Target Distribution Design

A Unifying Lens on Supervised Fine-Tuning Thro...

Machine Learning

Tong Xie

21 views

When to Align, When to Predict: A Phase Diagram for Multimodal Learning

When to Align, When to Predict: A Phase Diagra...

Machine Learning

Ilay Kamai

18 views

K-Forcing: Joint Next-K-Token Decoding via Push-Forward Language Modeling

K-Forcing: Joint Next-K-Token Decoding via Pus...

Machine Learning

librarian

22 views

CLP: Collocation-Length Prediction for Zero-Loss Adaptive Multi-Token Inference

CLP: Collocation-Length Prediction for Zero-Lo...

Machine Learning

librarian

20 views

Express Language Modeling

Express Language Modeling

Machine Learning

librarian

12 views

Tight Sample Complexity of Transformers

Tight Sample Complexity of Transformers

Machine Learning

librarian

31 views

Rethinking the Divergence Regularization in LLM RL

Rethinking the Divergence Regularization in LLM RL

Machine Learning

librarian

33 views

Muon Learns More Robust and Transferable Features than Adam

Muon Learns More Robust and Transferable Featu...

Machine Learning

Fengzhuo Zhang

32 views

Graph Mamba Operator: A Latent Simulator for Interacting Particle Systems

Graph Mamba Operator: A Latent Simulator for I...

Machine Learning

librarian

33 views

In-Context Learning for Latent Space Bayesian Optimization

In-Context Learning for Latent Space Bayesian ...

Machine Learning

librarian

33 views

Algorithm for Contextual Queueing Bandits with Rate-Optimal Queue Length Regret

Algorithm for Contextual Queueing Bandits with...

Machine Learning

Seoungbin Bae

26 views

Beyond Linear Activation Steering: Invertible Latent Transformations for Controlling LLM Behavior

Beyond Linear Activation Steering: Invertible ...

Machine Learning

librarian

20 views

OrderDP: A Theoretically Guaranteed Lossless Dynamic Data Pruning Framework

OrderDP: A Theoretically Guaranteed Lossless D...

Machine Learning

librarian

18 views

End-to-End Subgraph Detection with GraphDETR

End-to-End Subgraph Detection with GraphDETR

Machine Learning

librarian

37 views

The Post-GCN Decade Revisited: Curvature-Stratified Evaluation of Relational Learning

The Post-GCN Decade Revisited: Curvature-Strat...

Machine Learning

Shuo Wang

42 views

Tangram: Unlocking Non-Uniform KV Cache for Efficient Multi-turn LLM Serving

Tangram: Unlocking Non-Uniform KV Cache for Ef...

Machine Learning

librarian

47 views

Plug-and-Play Guidance for Discrete Diffusion Models via Gradient-Informed Logit Correction

Plug-and-Play Guidance for Discrete Diffusion ...

Machine Learning

librarian

41 views

Web analytics