Conferences; research updates
This week, Nov 17-18, Thu-Fri, 8am-11:45am Boston time, "Quantum physics and the first-person perspective": www.essentiafoundation.org/quantum-physics-and-the-first-person-perspective/seeing/
JuliaCon 2023, juliacon.org/2023/ the call for proposals is posted, deadline Dec 18: pretalx.com/juliacon2023/cfp
I've spent more quality time focusing of two breakthroughs in understanding the nature and the behavior of machine learning models which came from the "penumbra" of "prosaic alignment" start-ups and which I wrote about in my previous two posts.
"Grokking is (more or less) solved." I took brief notes between Oct 21 and Oct 23: github.com/anhinga/2022-notes/tree/main/Grokking-is-solved
"Generative autoregressive models are similators." I took extensive notes between Oct 5 and Oct 23: github.com/anhinga/2022-notes/tree/main/Generative-autoregressive-models-are-similators
I am continuing to develop thoughts related to these topics, I am going to gradually write more about those topics in the comments.
JuliaCon 2023, juliacon.org/2023/ the call for proposals is posted, deadline Dec 18: pretalx.com/juliacon2023/cfp
I've spent more quality time focusing of two breakthroughs in understanding the nature and the behavior of machine learning models which came from the "penumbra" of "prosaic alignment" start-ups and which I wrote about in my previous two posts.
"Grokking is (more or less) solved." I took brief notes between Oct 21 and Oct 23: github.com/anhinga/2022-notes/tree/main/Grokking-is-solved
"Generative autoregressive models are similators." I took extensive notes between Oct 5 and Oct 23: github.com/anhinga/2022-notes/tree/main/Generative-autoregressive-models-are-similators
I am continuing to develop thoughts related to these topics, I am going to gradually write more about those topics in the comments.
no subject
Three main directions of thoughts here are:
* How this can be used to control model's behavior better (or to train models better)
* What can happen inside those models (e.g. is it true that the most interesting and consequential things in the AI development will soon be happening INSIDE simulations which unfold when one is running those kinds of models or their successors?)
* Philosophy: how might this new "Janus' paradigm" illuminate the nature of "our reality"
Also, the sociology of this is interesting. How fast and wide the knowledge of this new way of thinking about GPT-3-like models would spread, and how much of short-to-medium-term impact on the AI development trajectory it would have?
no subject
Wow, thank you.
no subject
The organization has a YouTube channel, so there is a hope that the recording will be put there eventually: https://www.youtube.com/channel/UCHKZdDf09_8vVHm102fu0sg
(no subject)
(no subject)
(no subject)
(no subject)
(no subject)
(no subject)
(no subject)
(no subject)
(no subject)
(no subject)
(no subject)