Dataflow matrix machines (by Anhinga anhinga) (
dmm
) wrote
2023-10-30 05:22 pm (UTC)
no subject
They say that compositions often copy sequences (exactly or approximately).
But one should really study the next paper:
https://transformer-circuits.pub/2022/in-context-learning-and-induction-heads/index.html
(
27 comments
)
Post a comment in response:
From:
Anonymous
This account has disabled anonymous posting.
OpenID
Identity URL:
Log in?
Dreamwidth account
Account name
Password
Log in?
If you don't have an account you can
create one now
.
Subject
HTML doesn't work in the subject.
Formatting type
Casual HTML
Markdown
Raw HTML
Rich Text Editor
Message
[
Home
|
Post Entry
|
Log in
|
Search
|
Browse Options
|
Site Map
]
no subject
But one should really study the next paper: https://transformer-circuits.pub/2022/in-context-learning-and-induction-heads/index.html