![[personal profile]](https://www.dreamwidth.org/img/silk/identity/user.png)
This is a good starting point:
"A Mathematical Framework for Transformer Circuits", Dec 2021
transformer-circuits.pub/2021/framework/index.html
"A Mathematical Framework for Transformer Circuits", Dec 2021
transformer-circuits.pub/2021/framework/index.html
no subject
Date: 2023-10-30 04:57 pm (UTC)ATTENTION: The correction (bug fix) in calculation of compositions is a relatively recent addition: according to the Wayback Machine this correction has been added between May 21 and May 24, 2023
no subject
Date: 2023-10-30 05:13 pm (UTC)An interactive interface: https://transformer-circuits.pub/2021/framework/2L_HP_normal.html
no subject
Date: 2023-10-30 05:22 pm (UTC)But one should really study the next paper: https://transformer-circuits.pub/2022/in-context-learning-and-induction-heads/index.html