Some posts are currently under review and may be updated.
7 min read · April 07, 2024
2024 · reinforcement-learning policy-gradient · reinforcement-learning
94 min read · March 13, 2024
2024 · reinforcement-learning · reinforcement-learning
14 min read · February 18, 2024
2024 · reinforcement-learning bellman-operator · reinforcement-learning
22 min read · December 16, 2023
2023 · language-model architecture · lm-optimization
22 min read · August 15, 2023
2023 · language-models generalization statistics · talks