"Hello everybody. In this episode I’ll be speaking with Neel Nanda. After graduating from Cambridge, Neel did a variety of research internships, including one with me, before joining Anthropic where he worked with Chris Olah on the Transformer Circuits agenda. As we record this, he’s pursuing independent research and producing resources to help build the field of mechanistic interpretability. But around when this episode will likely be released, he’ll be joining the language model interpretability team at DeepMind."
no subject
Neel Nanda solved the mystery of "Grokking" as an independent researcher.
Before that he co-authored other cool papers about internal mechanisms of Transformer-based models while at Anthropic AI.
no subject
and https://github.com/anhinga/2022-notes/tree/main/Grokking-is-solved
no subject