Re: [RFC] Bridging the gap between the Linux Kernel Memory Consistency Model (LKMM) and C11/C++11 atomics

From: Olivier Dion
Date: Thu Jul 06 2023 - 12:38:05 EST


On Tue, 04 Jul 2023, Alan Stern <stern@xxxxxxxxxxxxxxxxxxx> wrote:
> On Tue, Jul 04, 2023 at 01:19:23PM -0400, Olivier Dion wrote:
>> On Mon, 03 Jul 2023, Alan Stern <stern@xxxxxxxxxxxxxxxxxxx> wrote:
>> > On Mon, Jul 03, 2023 at 03:20:31PM -0400, Olivier Dion wrote:
[...]
> Oh, is that it? Then I misunderstood entirely; I thought you were
> talking about augmenting the set of functions or macros made available
> in liburcu. I did not realize you intended to change the compilers.

Yes. We want to extend the atomic builtins API of the toolchains.

>> Indeed, our intent is to discuss the Userspace RCU uatomic API by extending
>> the toolchain's atomic builtins and not the LKMM itself. The reason why
>> we've reached out to the Linux kernel developers is because the
>> original Userspace RCU uatomic API is based on the LKMM.
>
> But why do you want to change the compilers to better support urcu?
> That seems like going about things backward; wouldn't it make more sense
> to change urcu to better match the facilities offered by the current
> compilers?

The initial motivation for the migration of the Userspace RCU atomics
API from custom inline assembler (mimicking the LKMM) to the C11/C++11
memory model was for supporting userspace tools such as TSAN.

We did that by porting everything to the compiler's atomic builtins API.
However, because of the "fully-ordered" atomic semantic of the LKMM, we
had no other choices than to add memory fences which are redundant on
some strongly ordered architectures.

> What if everybody started to do this: modifying the compilers to better
> support their pet projects? The end result would be chaos!

This is why we are starting this discussion which involves members of
the Kernel and toolchains communities. We have prior experience, e.g. with
asm gotos which were implemented in GCC, and Clang afterward, in
response to Linux Kernel tracepoint's requirements.

Note that the motivation for supporting TSAN in Userspace RCU is coming
from the requirements of the ISC for the BIND 9 project.

[...]
>> If we go for the grouping in a), we have to take into account that the
>> barriers emitted need to cover the worse case scenario. As an example,
>> Clang can emit a store for a exchange with SEQ_CST on x86-64, if the
>> returned value is not used.
>>
>> Therefore, for the grouping in a), all RMW would need to emit a memory
>> barrier (with Clang on x86-64). But with the scheme in b), we can emit
>> the barrier explicitly for the exchange operation. We however question
>> the usefulness of this kind of optimization made by the compiler, since
>> a user should use a store operation instead.
>
> So in the end you settled on a compromise?

We have not settled on anything yet. Choosing between options a) and b)
is open to discussion.

[...]


Thanks,
Olivier
--
Olivier Dion
EfficiOS Inc.
https://www.efficios.com