cfi: Simpler launder implementation for common types #1714

swenson · 2024-10-10T23:55:50Z

cfi_launder is meant to prevent the Rust compiler from optimizing a value away.

Our current implementation uses core::hint::black_box(), which is the recommended way in Rust. The problem is, this appears to often force the argument to spill into memory and to be reloaded, which can be a lot of extra instructions.

The original inspiration for this function is from, I believe, OpenTitan's launder* functions. There, they use an LLVM-specific trick of a blank inline assembly block to force the compiler to keep the argument in a register.

After reviewing our code and speaking with @vsonims, it sounds like the intention of the launder in our code is to prevent the compiler from optimizing the value away (as the comments suggest), so the simpler inline assembly trick may be sufficient (since we use the official Rust compiler, which uses LLVM).

The biggest problem is that we launder many types of values in our code and not all of them fit into a register.

So, this PR represents an incremental change: for u32s and similar small types, we implement cfi_launder using the inline assembly trick from OpenTitan. For any other types, we have a trait that can be derived that will call core::hint::black_box in the same way as today.

We can do future follow-up PRs to try to try to clean up some of those other uses of cfi_launder to hopefully shrink the code more.

I also slipped in avoid a few extra copies in the verifier by using references instead of copies (this saves ~80 bytes of instruction space).

This PR appears to shrink the ROM code size by 1232 bytes and the runtime firmware by 700 bytes.

swenson · 2024-10-11T00:04:17Z

cfi/lib/src/cfi.rs

    } else {
        val
    }
 }

+pub trait LaunderTrait<T> {


These Rust type acrobatics are so that we can have special, smaller implementations for register-sized but default to core::hint::black_box for everything else.

I'm open to suggestions on how to make this simpler or avoid a derive.

jhand2 · 2024-10-11T16:49:51Z

cfi/lib/src/cfi.rs

+    fn launder(&self, val: u32) -> u32 {
+        let mut val = val;
+        unsafe {
+            core::arch::asm!(


I think this is magic enough to be worth a comment about why it works 😄 Also can we be confident that this will continue to work in future versions of LLVM? I wonder if we can write a test that proves the compiled binary didn't optimizer out values passed to cfi_launder.

I'll definitely write a comment.

It's quite difficult to test this, I think, and it certainly is not guaranteed by LLVM in the future (nor would be basically any trick, probably, since all of these are meant to fool the compiler without actually doing anything). I verified it by loading code into Compiler Explorer for now.

There are some ways to test this by invoking the compiler in a test and checking the emitted assembly and, for example, counting instructions with and without this function being applied.

If this stops working in the future, this doesn't stop the code using it from working, it just means that one layer of protection is removed. And theoretically we'd notice because the code size would change.

I added a test that invokes rustc --emit asm and does a simple check if a functions is optimized enough (with and without laundering).

Nice :) Thanks!

I talked with the OT folks and they do manual pre-TO validation for things like this. Might be worth doing something at ROM release time (or adding a note to pre-TO guidance for integrators) to spot check important sections.

I think we're mostly safe here, since we pin the version of Cargo (and presumably rustc and LLVM). So we mostly need to validate when we update our toolchain.

@vsonims

`cfi_launder` is meant to prevent the Rust compiler from optimizing a value away. Our current implementation uses `core::hint::black_box()`, which is the recommended way in Rust. The problem is, this appears to often force the argument to spill into memory and to be reloaded, which can be a lot of extra instructions. The original inspiration for this function is from, I believe, [OpenTitan's launder* functions](https://github.com/lowRISC/opentitan/blob/master/sw/device/lib/base/hardened.h#L193). There, they use an LLVM-specific trick of a blank inline assembly block to force the compiler to keep the argument in a register. After reviewing our code and speaking with @vsonims, it sounds like the intention of the launder in our code is to prevent the compiler from optimizing the value away (as the comments suggest), so the simpler inline assembly trick may be sufficient (since we use the official Rust compiler, which uses LLVM). The biggest problem is that we launder many types of values in our code and not all of them fit into a register. So, this PR represents an incremental change: for `u32`s and similar small types, we implement `cfi_launder` using the inline assembly trick from OpenTitan. For any other types, we have a trait that can be derived that will call `core::hint::black_box` in the same way as today. We can do future follow-up PRs to try to try to clean up some of those other uses of `cfi_launder` to hopefully shrink the code more. This PR appears to shrink the ROM code size by 1152 bytes and the runtime firmware by 616 bytes.

This saves an additional 80 and 84 bytes in the ROM and runtime, respectively.

…es additional 12 bytes

swenson · 2024-10-14T18:26:18Z

(I've also looked at removing Copy traits from a few other types and trying to use more references so that the laundering could be more effective without copying, but I was surprised to find that this increased the code size.

There are more likely a few more types though that would be worth streamlining so we can save more code space, but I leave that to a future PR.)

swenson requested review from FerralCoder, rusty1968, bluegate010, mhatrevi, vsonims, ajisaxena, korran and JohnTraverAmd as code owners October 10, 2024 23:55

swenson mentioned this pull request Oct 11, 2024

[draft] Constant-time equality checks for sensitive values #1712

Draft

11 tasks

swenson requested a review from jhand2 October 11, 2024 00:00

swenson commented Oct 11, 2024

View reviewed changes

jhand2 reviewed Oct 11, 2024

View reviewed changes

swenson force-pushed the cfi-launder branch from 640b617 to dc8c048 Compare October 11, 2024 17:20

swenson added 8 commits October 14, 2024 09:15

Clippy keeping me honest

fc5f1b9

Avoid some unnecessary copies in the verifier

f466b9f

This saves an additional 80 and 84 bytes in the ROM and runtime, respectively.

Add comments; actually use the result of the assembly for slices; sav…

153db1f

…es additional 12 bytes

Fix clippy

969221d

More fix clippy

17fe7f2

Add a test for laundering

608c607

Header fix

05a5902

swenson force-pushed the cfi-launder branch from 1921e07 to 05a5902 Compare October 14, 2024 16:15

Move CFI assembly tests to derive

40e768c

jhand2 previously approved these changes Oct 14, 2024

View reviewed changes

Revert dpe (not sure how that got in there)

ea51249

swenson dismissed jhand2’s stale review via ea51249 October 14, 2024 18:11

jhand2 approved these changes Oct 14, 2024

View reviewed changes

rusty1968 approved these changes Oct 24, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cfi: Simpler launder implementation for common types #1714

cfi: Simpler launder implementation for common types #1714

swenson commented Oct 10, 2024 •

edited

Loading

swenson Oct 11, 2024

jhand2 Oct 11, 2024 •

edited

Loading

swenson Oct 11, 2024

swenson Oct 11, 2024

jhand2 Oct 14, 2024

swenson Oct 14, 2024

swenson commented Oct 14, 2024

cfi: Simpler launder implementation for common types #1714

Are you sure you want to change the base?

cfi: Simpler launder implementation for common types #1714

Conversation

swenson commented Oct 10, 2024 • edited Loading

swenson Oct 11, 2024

Choose a reason for hiding this comment

jhand2 Oct 11, 2024 • edited Loading

Choose a reason for hiding this comment

swenson Oct 11, 2024

Choose a reason for hiding this comment

swenson Oct 11, 2024

Choose a reason for hiding this comment

jhand2 Oct 14, 2024

Choose a reason for hiding this comment

swenson Oct 14, 2024

Choose a reason for hiding this comment

swenson commented Oct 14, 2024

swenson commented Oct 10, 2024 •

edited

Loading

jhand2 Oct 11, 2024 •

edited

Loading