2025
an archive of posts from this year
| Dec 14, 2025 | Autoregressive Embedding Models: Training, Attention, and Performance |
|---|---|
| Dec 13, 2025 | Non-Diatonic Notes |
| Nov 25, 2025 | Ilya Sutskever: From the Age of Scaling to the Age of Research |
| Nov 22, 2025 | Adaptive Sampling and Curriculum Methods |
| Oct 01, 2025 | Position: Why Web is a Good Environment to Study RL? |
| Sep 18, 2025 | Foundations of Reductionism |
| Sep 01, 2025 | Pretraining, Post-training, and Test-Time Reasoning |
| Aug 24, 2025 | Jazz Chords and Their Variants |
| Aug 07, 2025 | Challenges in Scaling Q-Learning |
| Jul 22, 2025 | Are Multi-step Agents Overthinking? |
| Jul 04, 2025 | Kolmogorov Complexity |
| Jun 13, 2025 | The Komuro Progression |
| May 27, 2025 | Policy Optimization without a Critic: The GRPO Family |
| Mar 15, 2025 | Can Language Models Be Critic Functions? |