dmm | C to safe Rust automatic translation using LLMs and dynamic analysis

C to safe Rust automatic translation using LLMs and dynamic analysis

This is a non-trivial angle: x.com/StringChaos/status/1869571815548596308 (UC Berkeley)

https://syzygy-project.github.io/

https://syzygy-project.github.io/assets/paper.pdf

> combine the generative capabilities of LLMs with semantic execution information collected by dynamic analyses on the source C codebase

> test translation approach for reliable equivalence testing

What I understood so far: The starting point must be an existing C codebase with good unit tests. The LLM is used as a possibly buggy code translator that also has the ability to sometimes fix errors and repeat. It is hoped that the feedback loop converges.

From their paper:

We restrict the input C program to the following conditions:
(1) Acyclic data structures: data structures [should have no] pointer cycles ...
(2) No multithreading ...
(3) No type punning: performing raw memory accesses on untyped or multi-typed memory regions would hinder the best effort type analysis that our approach aims to perform.

Edited 2024-12-19 11:14 (UTC)

They do what they can. But still, a good step in the right direction.

Flat | Top-Level Comments Only

C to safe Rust automatic translation using LLMs and dynamic analysis

no subject

no subject

no subject