about

agentic-reasoning

an archive of posts in this category

2023

2024

2025

Nov 2022
ChatGPT released

Jan 2025
DeepSeek-R1 released

Sep 2025
Qwen3-VL released

Jul 2025
Are Multi-step Agents Overthinking? Oct 2025
Position: Why Web is a Good Environment to Study RL?

Oct 01, 2025	agent Position: Why Web is a Good Environment to Study RL?
Jul 22, 2025	agent Are Multi-step Agents Overthinking?

© Copyright 2026 Jack (Hao) Bai. Powered by Jekyll with al-folio theme.