Memory Safe Inline Assembly

(fil-c.org)

50 points | by pizlonator 2 days ago

5 comments

dataflow 2 minutes ago
[delayed]
anitil 20 minutes ago
I find it charming that to distinguish Fil-C from the K&R language they use the term 'Yolo-C'. I have never used inline asm before, I actually didn't realise how much behaviour it's specifying! (When I've needed asm it was on non-gcc compilers)
Edit to add: If I'm understanding this correctly we should be able to run this against projects and detect asm violations, I feel like this would be very valuable to be able to feed these back to maintainers
anitil 15 minutes ago
Do we have a sense for how many of the programs that work [0] are now detected as having asm violations?
[0] https://fil-c.org/programs_that_work
[-]
- pizlonator 5 minutes ago
  Zero, since I made those programs work back when all inline asm was an error.
  So currently most of those still have the hacks to go down the no-inlineasm path when building with Fil-C
  For the few where I reinstated the inline assembly, there were no bugs found.
  It would be a good experiment to try to reinstate the inlineasm paths in all of the programs that had them. I suspect there’s a low chance of finding a bug if it’s in inline assembly that’s on the critical path.
jdw64 1 hour ago
What is more frightening about this than safe C assembly is that this level of implementation is achievable not with a SOTA model, but with a cost effective model like KIMI. There was human judgment involved in the middle, but reading the article, My reading of the process is as follows:
1.A developer identified the necessity of inline assembly.
2.Defined the safety boundaries for 'memory-safe' inline assembly.
3.Established strict policies for memory access.
4.Curated an allowlist of permissible instructions.
5.Set rigorous test criteria and 'done' conditions.
In short, with the overall guardrails in place, a sub agent loop was run, and this level of code was produced. This raises a number of interesting points about how we should use AI. I haven't looked at all the code, but the idea of passing assembly through safe zones without memory access, and using that as a foundation to achieve this level of implementation through AI, is quite impressive
IAmLiterallyAB 1 hour ago
I wonder if an adversarial user could bypass the checks and achieve memory corruption / code execution. Maybe not a practical attack in most situations but a fun exercise.
> This includes things like asm volatile("" : : : "memory"), which is an old-school way of saying atomic_signal_fence(memory_order_seq_cst).
Not quite. AIUI, the first is just a barrier for the compiler, while the second is also a CPU memory barrier. Godbolt seems to confirm that.
https://godbolt.org/z/a844zKej8
[-]
- pizlonator 1 hour ago
  Your godbolt code used atomic_thread_fence
  The quote uses atomic_signal_fence.
  If you find a way to bypass my checks, file a bug. I tried very hard to break it. My agent loops tried even harder
  [-]
  - jancsika 4 minutes ago
    > My agent loops tried even harder
    What happens if you ask to find the strings that will erroneously return True from validateSafeInlineAsm for disallowed asm? :)
  - IAmLiterallyAB 56 minutes ago
    Oops, you're right. I was thinking of those as nearly interchangeable but they're actually pretty different.
    I might give it a try when I have a chance, I'll let you know if anything comes of it.