|国家预印本平台
首页|PR2: Peephole Raw Pointer Rewriting with LLMs for Translating C to Safer Rust

PR2: Peephole Raw Pointer Rewriting with LLMs for Translating C to Safer Rust

PR2: Peephole Raw Pointer Rewriting with LLMs for Translating C to Safer Rust

来源:Arxiv_logoArxiv
英文摘要

There has been a growing interest in translating C code to Rust due to Rust's robust memory and thread safety guarantees. Tools such as C2RUST enable syntax-guided transpilation from C to semantically equivalent Rust code. However, the resulting Rust programs often rely heavily on unsafe constructs--particularly raw pointers--which undermines Rust's safety guarantees. This paper aims to improve the memory safety of Rust programs generated by C2RUST by eliminating raw pointers. Specifically, we propose a peephole raw pointer rewriting technique that lifts raw pointers in individual functions to appropriate Rust data structures. Technically, PR2 employs decision-tree-based prompting to guide the pointer lifting process. Additionally, it leverages code change analysis to guide the repair of errors introduced during rewriting, effectively addressing errors encountered during compilation and test case execution. We implement PR2 as a prototype and evaluate it using gpt-4o-mini on 28 real-world C projects. The results show that PR2 successfully eliminates 13.22% of local raw pointers across these projects, significantly enhancing the safety of the translated Rust code. On average, PR2 completes the transformation of a project in 5.44 hours, at an average cost of $1.46.

Yifei Gao、Chengpeng Wang、Pengxiang Huang、Xuwei Liu、Mingwei Zheng、Xiangyu Zhang

计算技术、计算机技术

Yifei Gao,Chengpeng Wang,Pengxiang Huang,Xuwei Liu,Mingwei Zheng,Xiangyu Zhang.PR2: Peephole Raw Pointer Rewriting with LLMs for Translating C to Safer Rust[EB/OL].(2025-05-07)[2025-05-22].https://arxiv.org/abs/2505.04852.点此复制

评论