dmm: (Default)
[personal profile] dmm
"This is a one-file Rust implementation of Llama2 that works pretty well. It's Rust port of Karpathy's llama2.c"

"I don't actually know Rust, but it seems good. I had gpt translate the code, and then I fixed the compiler errors until it ran fast. Would love a code-review if anyone has time."

"Generally you get a lot of memory safety things for free in the conversion. One unsafe part is memory mapping to load in the model, my code is a bit sloppy (but llama2.c is way grosser)."

"
One part that is cool is that Rust data parallel (rayon) forces you to protect against double writes with the borrow checker. This fixed one of my bugs in parallel multiheaded attention. (We're not all karpathy-level)"
This account has disabled anonymous posting.
If you don't have an account you can create one now.
HTML doesn't work in the subject.
More info about formatting

Profile

dmm: (Default)
Dataflow matrix machines (by Anhinga anhinga)

May 2025

S M T W T F S
    123
456 78910
11 121314151617
18192021222324
25262728293031

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags
Page generated Jul. 25th, 2025 12:30 am
Powered by Dreamwidth Studios