lm-optimization
an archive of posts in this category
| Dec 14, 2025 | llm Autoregressive Embedding Models: Training, Attention, and Performance |
|---|---|
| Sep 01, 2025 | llm Pretraining, Post-training, and Test-Time Reasoning |
| Aug 01, 2024 | llm LLM Optimization Basics: Memory |
| Jun 15, 2024 | llm LLM Optimization Basics: Time |
| Dec 16, 2023 | llm Mixture of Experts Explained |
| Jun 07, 2023 | llm Self-Attention Layer and The Transformers Architecture |
| Apr 27, 2023 | llm Backpropagation |