Depen Morwani
Toggle navigation
about
blog
(current)
publications
Anything but SGD: Evaluating Optimizers for LLM Training
July 12, 2024
Where Do Features Come From?
November 15, 2023