Re: [PATCH 0/24] make atomic_read() behave consistently across allarchitectures

From: Nick Piggin
Date: Thu Aug 16 2007 - 23:16:58 EST

Next message: Paul Mackerras: "Re: [PATCH 0/24] make atomic_read() behave consistently across allarchitectures"
Previous message: Linus Torvalds: "Re: [PATCH 0/24] make atomic_read() behave consistently across allarchitectures"
In reply to: Paul E. McKenney: "Re: [PATCH 0/24] make atomic_read() behave consistently across all architectures"
Next in thread: Paul Mackerras: "Re: [PATCH 0/24] make atomic_read() behave consistently across allarchitectures"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

Paul E. McKenney wrote:

On Thu, Aug 16, 2007 at 06:42:50PM +0800, Herbert Xu wrote:

In fact, volatile doesn't guarantee that the memory gets
read anyway. You might be reading some stale value out
of the cache. Granted this doesn't happen on x86 but
when you're coding for the kernel you can't make such
assumptions.

So the point here is that if you don't mind getting a stale
value from the CPU cache when doing an atomic_read, then
surely you won't mind getting a stale value from the compiler
"cache".

Absolutely disagree. An interrupt/NMI/SMI handler running on the CPU
will see the same value (whether in cache or in store buffer) that
the mainline code will see. In this case, we don't care about CPU
misordering, only about compiler misordering. It is easy to see
other uses that combine communication with handlers on the current
CPU with communication among CPUs -- again, see prior messages in
this thread.

I still don't agree with the underlying assumption that everybody
(or lots of kernel code) treats atomic accesses as volatile.

Nobody that does has managed to explain my logic problem either:
loads and stores to long and ptr have always been considered to be
atomic, test_bit is atomic; so why are another special subclass of
atomic loads and stores? (and yes, it is perfectly legitimate to
want a non-volatile read for a data type that you also want to do
atomic RMW operations on)

Why are people making these undocumented and just plain false
assumptions about atomic_t? If they're using lockless code (ie.
which they must be if using atomics), then they actually need to be
thinking much harder about memory ordering issues. If that is too
much for them, then they can just use locks.

So, the architecture guys can implement atomic_read however they want
--- as long as it cannot be optimized away.*

They can implement it however they want as long as it stays
atomic.

Precisely. And volatility is a key property of "atomic". Let's please
not throw it away.

It isn't, though (at least not since i386 and x86-64 don't have it).
_Adding_ it is trivial, and can be done any time. Throwing it away
(ie. making the API weaker) is _hard_. So let's not add it without
really good reasons. It most definitely results in worse code
generation in practice.

I don't know why people would assume volatile of atomics. AFAIK, most
of the documentation is pretty clear that all the atomic stuff can be
reordered etc. except for those that modify and return a value.

It isn't even intuitive: `*lp = value` is like the most fundamental
atomic operation in Linux.

--
SUSE Labs, Novell Inc.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Paul Mackerras: "Re: [PATCH 0/24] make atomic_read() behave consistently across allarchitectures"
Previous message: Linus Torvalds: "Re: [PATCH 0/24] make atomic_read() behave consistently across allarchitectures"
In reply to: Paul E. McKenney: "Re: [PATCH 0/24] make atomic_read() behave consistently across all architectures"
Next in thread: Paul Mackerras: "Re: [PATCH 0/24] make atomic_read() behave consistently across allarchitectures"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]