Machine Learning

MesaNet: Sequence Modeling by Locally Optimal Test-Time Training
Avatar
Johannes von Oswald
17 views
Kinetics: Rethinking Test-Time Scaling Laws
Avatar
librarian
22 views
Horizon Reduction Makes RL Scalable
Avatar
librarian
21 views
OpenThoughts: Data Recipes for Reasoning Models
Avatar
librarian
20 views
Not All Tokens Are Meant to Be Forgotten
Avatar
librarian
21 views
Global optimization of graph acquisition functions for neural
  architecture search
Avatar
Calvin Tsay
52 views
Distortion of AI Alignment: Does Preference Optimization Optimize for
  Preferences?
Avatar
Paul Go¨lz
50 views
REOrdering Patches Improves Vision Models
Avatar
librarian
48 views
On Learning Verifiers for Chain-of-Thought Reasoning
Avatar
Maria-Florina Balcan
50 views