Apr 2026
13 Mon
14 Tue
15 Wed
16 Thu
17 Fri
18 Sat 09:00 AM – 06:00 PM IST
19 Sun 09:00 AM – 06:00 PM IST
Submitted Mar 18, 2026
Now that LLMs are capable of generating working code in pretty much any programming language, it raises the question of which language to use? Is an LLM equally good at generating safe C code versus Rust? Which language helps LLMs correct themselves the best?
Previously, I’d presented the shortcomings of using LLMs for Systems programming at the FSTTCS conference. Models from OpenAI, Anthropic and Google had a tendency to produce C code that looked correct, but had crash rates approaching ~30% for Sanitizers (ASAN, UBSAN, INTSAN). Allowing the model to learn from the feedback of the sanitizers, varying the temperature, the kind of prompt and giving secure examples didnt help much. What did help was changing the language - Rust had far fewer runtime crashes.
In this talk I demonstrate the results of a new series of experiments to show the impact of using Rust over more traditional languages like C. I explore projects like Anthropic’s CCC, and try to push LLMs to their limit.
Asutosh Pandey is a Compiler Engineer at AMD working with the CPU Compiler team. He co-organizes the LLVM Social Bangalore Meetup, the Innovations In Compiler Technology Workshop and the Segfault Hackathon.
LinkedIn: https://www.linkedin.com/in/ashupdsce
Twitter: https://x.com/ashpande18
Hosted by
{{ gettext('Login to leave a comment') }}
{{ gettext('Post a comment…') }}{{ errorMsg }}
{{ gettext('No comments posted yet') }}