dmm: (Default)
[personal profile] dmm
"WorldCoder, a Model-Based LLM Agent: BuildingWorld Models by Writing Code and Interacting with the Environment", arxiv.org/abs/2402.12275

Not a widely known paper (the authors don't promote it), but pretty spectacular (a friend of mine said, "Is it AGI already?").

I think I mostly understand how this works and I made some notes yesterday.

A meta-note here: GPT-4-level models mostly understand what they are doing, but are unreliable; so the question is, can one organize a process which reliably produces needed results based on that. There are plenty of papers trying to push in this direction, but this one is very elegant, and the results are quite good.

******

www.lesswrong.com/posts/jGuXSZgv6qfdhMCuJ/refusal-in-llms-is-mediated-by-a-single-direction - very elegant and simple


******

May 9, 2024 update: Since this is access-list-only at the moment (although this post is likely to become public eventually), it's a good place for my notes on switching to Twitter "X Premium" experience (in comments).

May 13: let's move this post to being public.


Date: 2024-05-09 03:27 am (UTC)
anhinga_anhinga: (Default)
From: [personal profile] anhinga_anhinga
testing comment notifications

Profile

dmm: (Default)
Dataflow matrix machines (by Anhinga anhinga)

May 2025

S M T W T F S
    123
456 78910
11 121314151617
18192021222324
25262728293031

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags
Page generated Jun. 17th, 2025 02:15 am
Powered by Dreamwidth Studios