Re: [PATCH] x86/ibt: Implement FineIBT

From: Joao Moreira
Date: Tue Oct 18 2022 - 15:59:55 EST



o IBT WAIT-FOR-ENDBR state is a speculation stop; by placing
the hash validation in the immediate instruction after
the branch target there is a minimal speculation window
and the whole is a viable defence against SpectreBHB.

I still think it's worth noting it does technically weaken the
"attacker-controlled executable memory content injection" attack
requirements, too. While an attacker needs to make sure they place an
ENDBR at the start of their injected code, they no longer need to also
learn and inject the CFI hash too, as the malicious code can just not
do the check at all. The difference in protection currently isn't much.

It's not a very difficult requirement to get attacker-controlled bytes
into executable memory, as there are already existing APIs that provide
this to varying degrees of reachability, utility, and discoverability --
for example, BPF JIT when constant blinding isn't enabled (the unfortunate
default). And with the hashes currently being deterministic, there's no
secret that needs to be exposed first; an attack can just calculate it.
An improvement for kCFI would be to mutate all the hashes both at build
time (perhaps using the same seed infrastructure that randstruct depends
on for sharing a seed across compilation units), and at boot time, so
an actual .text content exposure is needed to find the target hash value.

If we look back at how well ASLR did over the years I think we can't really rely that randomizing the hashes will solve anything. So what you are suggesting is that we flip a "viable defence against SpectreBHB" for a randomization-based scheme, when what we really should be doing is getting constant blinding enabled by default.

In fact, even if an attacker is able to inject an ENDBR at the target through operation constants as you suggest, there is still the need for an info-leak to figure out the address of the ENDBR. I bet this is not a problem for any skilled attacker as much as figuring out the randomized hashes shouldn't be. Unfortunately no CFI scheme I know that relies on anything at the callee-side is fully reliable if an attacker can manipulate executable pages, and randomizing hashes won't change that. So I don't think there is a strong enough difference here. ClangCFI perhaps could be better in that perspective, but as we know it would bring many other drawbacks.

At this point I feel like going on is a bit of bike-shedding, but if this really matters, below is how to use randomization on FineIBT. Maybe with lot less entropy, but just ideas thrown that could be improved over time (don't take this as a serious proposal):

Assuming we got 16 bytes padding to play with on each function prologue, you can randomize between 0-11 in which offset you emit the ENDBR instruction. Caller/Callee would look like (hopefully I did not mess-up offset):

<caller>:
and 0xf3, r11b
call *r11

<callee>:
nop
nop
nop
endbr // <- this position is randomized/patched during boot time.
nop
nop
...

And of course, you get more entropy as you increase the padding nop area.