Some posts are currently under review and may be updated.
51 min read · June 07, 2023
2023 · architecture transformer attention · lm-optimization
19 min read · May 20, 2023
2023 · algorithms dynamic-programming optimization math · algorithms
17 min read · April 27, 2023
2023 · deep-learning optimization language-model · lm-optimization
18 min read · February 01, 2018
2018 · reinforcement-learning language-models scalability · talks