dmm: (Default)
[personal profile] dmm
"This is a one-file Rust implementation of Llama2 that works pretty well. It's Rust port of Karpathy's llama2.c"

"I don't actually know Rust, but it seems good. I had gpt translate the code, and then I fixed the compiler errors until it ran fast. Would love a code-review if anyone has time."

"Generally you get a lot of memory safety things for free in the conversion. One unsafe part is memory mapping to load in the model, my code is a bit sloppy (but llama2.c is way grosser)."

"
One part that is cool is that Rust data parallel (rayon) forces you to protect against double writes with the borrow checker. This fixed one of my bugs in parallel multiheaded attention. (We're not all karpathy-level)"

Date: 2023-08-01 02:40 pm (UTC)
juan_gandhi: (Default)
From: [personal profile] juan_gandhi

Да-да, занятно. Хорошо пошел раст.

Profile

dmm: (Default)
Dataflow matrix machines (by Anhinga anhinga)

April 2026

S M T W T F S
   1234
56 7891011
12131415161718
19202122232425
26 27282930  

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags
Page generated Jun. 4th, 2026 12:29 am
Powered by Dreamwidth Studios