2026
an archive of posts from this year
| Jun 09, 2026 | What Scaling Laws For Hybrid LLMs Can Tell Us About Pretraining Mixtures |
|---|---|
| Jun 06, 2026 | What Hybrid Models Mean for Scaling |
| Apr 06, 2026 | From Muon to Spectra |
| Feb 10, 2026 | A Simple Toy Model Bridging HTSR & $\alpha$-REQ |
| Jan 24, 2026 | Muon and Manifold Versions |