2025 | Jack (Hao) Bai

Dec 14, 2025	Autoregressive Embedding Models: Training, Attention, and Performance
Dec 13, 2025	Non-Diatonic Notes
Nov 25, 2025	Ilya Sutskever: From the Age of Scaling to the Age of Research
Nov 22, 2025	Adaptive Sampling and Curriculum Methods
Oct 01, 2025	Position: Why Web is a Good Environment to Study RL?
Sep 18, 2025	Foundations of Reductionism
Sep 01, 2025	Pretraining, Post-training, and Test-Time Reasoning
Aug 24, 2025	Jazz Chords and Their Variants
Aug 07, 2025	Challenges in Scaling Q-Learning
Jul 22, 2025	Are Multi-step Agents Overthinking?
Jul 04, 2025	Kolmogorov Complexity
Jun 13, 2025	The Komuro Progression
May 27, 2025	Policy Optimization without a Critic: The GRPO Family
Mar 15, 2025	Can Language Models Be Critic Functions?