Account name:
Password
(OpenID?)
(Forgot it?)
Remember Me
You're viewing
dmm
's journal
Create a Dreamwidth Account
Learn More
Interest
Region
Site and Account
FAQ
Email
Reload page in style:
site
light
Dataflow matrix machines (by Anhinga anhinga)
WIRED published a story on Transformer invention
WIRED published a story on Transformer invention
Mar
.
21st
,
2024
03:59 pm
dmm
The history of the creation of "Attention Is All You Need",
arxiv.org/abs/1706.03762
It's pretty intense; it's very interesting what it took to achieve that.
Flat
|
Top-Level Comments Only
no subject
Date:
2024-03-21 09:05 pm (UTC)
From:
dmm
I think we'll do more sophisticated things again soon.
But I was stunned with their description of the last few weeks of working on that paper; that was intense...
no subject
Date:
2024-03-22 08:05 am (UTC)
From:
chaource
Пусть Jakob Uszkoreit тамъ все ускоряетъ!
6 comments
Reply
Flat
|
Top-Level Comments Only
Profile
Dataflow matrix machines (by Anhinga anhinga)
Neuromorphic Computations with Linear Streams
Recent Entries
Archive
Reading
Network
Tags
Memories
Profile
September
2025
S
M
T
W
T
F
S
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Most Popular Tags
ai art
-
5 uses
ai safety
-
11 uses
anthropic ai
-
4 uses
artificial intelligence
-
28 uses
biology
-
3 uses
category theory
-
3 uses
climate
-
2 uses
compact ml models
-
2 uses
computer art
-
3 uses
conference
-
20 uses
covid-19
-
2 uses
dall-e 3
-
5 uses
dataflow matrix machines
-
3 uses
differentiable programming
-
3 uses
february 24 2022
-
13 uses
github copilot
-
5 uses
gpt-4
-
17 uses
images as matrices
-
2 uses
julia
-
15 uses
large language models
-
10 uses
literature
-
5 uses
logic
-
3 uses
machine learning
-
13 uses
manin
-
2 uses
mathematics
-
10 uses
my talks
-
2 uses
neural networks
-
7 uses
openai
-
5 uses
openai codex
-
4 uses
philosophy
-
10 uses
physics
-
11 uses
politics
-
5 uses
program synthesis
-
7 uses
programming languages
-
2 uses
qualia
-
2 uses
quantum computing
-
2 uses
remember
-
4 uses
scifi
-
2 uses
sparsity
-
2 uses
technological singularity
-
9 uses
this blog
-
2 uses
transformers
-
23 uses
twitter
-
6 uses
understanding internals of ai
-
15 uses
visual art
-
4 uses
voevodsky
-
2 uses
zzznah
-
3 uses
арестович
-
2 uses
фашизм в рф
-
18 uses
🇺🇦
-
15 uses
Page Summary
dmm
-
(no subject)
Active Entries
1:
Helion details
2:
"Narrow AGI" this year?
3:
Tao on coordinate vs coordinate-free math reasoning
4:
"Aging as a loss of goal-directedness"
5:
New integrated mode for GPT-4 in ChatGPT+
6:
Китайский новый год начнётся 10-го февраля
7:
Automating the Search for Artificial Life with Foundation Models
8:
"Anatomy of a Formal Proof"
Style Credit
Style:
Neutral Good
for
Practicality
by
timeasmymeasure
Expand Cut Tags
No cut tags
Page generated Dec. 29th, 2025 12:50 pm
Powered by
Dreamwidth Studios
no subject
Date: 2024-03-21 09:05 pm (UTC)But I was stunned with their description of the last few weeks of working on that paper; that was intense...
no subject
Date: 2024-03-22 08:05 am (UTC)