al-folio

a simple whitespace theme for academics

Zero Intervention, Short Thinking, and More Actions - A New Paradigm for Multi-step RL for Language Models

This article is a brief discussion of whether and why auto-regressive language models can perform well on simple reasoning tasks.

7 min read · June 10, 2025

2025 · reinforcement-learning language-model test-time-scaling · reasoning
Is Auto-Regressive Language Model Simply Memorizing Answers or Learning to Reason?

This article is a brief discussion of whether and why auto-regressive language models can perform well on simple reasoning tasks.

6 min read · October 24, 2024

2024 · transformer auto-regressive language-model · reasoning
A Complete Tutorial on Self-Attention & Transformer

This article explains the Transformer architecture thoroughly, from RNN to self-attention, and then to Transformer.

36 min read · June 07, 2023

2023 · architecture transformer attention · model-architecture