Re: [PATCH v2] rcu: Add necessary WRITE_ONCE()

From: Will Deacon
Date: Mon Jun 26 2023 - 08:25:07 EST


On Sat, Jun 24, 2023 at 02:31:12AM +0800, Alan Huang wrote:
>
> > 2023年6月23日 下午1:17,Paul E. McKenney <paulmck@xxxxxxxxxx> 写道:
> >
> > On Wed, Jun 21, 2023 at 10:08:28AM +0800, Alan Huang wrote:
> >>
> >>> 2023年6月21日 06:26,Paul E. McKenney <paulmck@xxxxxxxxxx> 写道:
> >>>
> >>> On Tue, Jun 20, 2023 at 05:13:46PM +0000, Alan Huang wrote:
> >>>> Commit c54a2744497d("list: Add hlist_unhashed_lockless()") and
> >>>> commit 860c8802ace1("rcu: Use WRITE_ONCE() for assignments to
> >>>> ->pprev for hlist_nulls") added various WRITE_ONCE() to pair with
> >>>> the READ_ONCE() in hlist_unhashed_lockless(), but there are still
> >>>> some places where WRITE_ONCE() was not added, this commit adds that.
> >>>>
> >>>> Also add WRITE_ONCE() to pair with the READ_ONCE() in hlist_empty().
> >>>>
> >>>> Signed-off-by: Alan Huang <mmpgouride@xxxxxxxxx>
> >>>
> >>> On hlist_nulls_add_tail_rcu(), good catch, thank you!
> >>>
> >>> On the others, are there really cases where a lockless read races with
> >>> the update? At first glance, that sounds like a usage bug. For example,
> >>> as I understand it, when you use something like hlist_del(), you are
> >>> supposed to ensure that there are no concurrent readers. Which is the
> >>> point of the assignment of the special value LIST_POISON2, right?
> >>
> >> Do you mean there are cases where a lockless read races with hlist_add_head/hlist_add_before
> >> hlist_add_behind/__hlist_del, but there is no real case where a lockless read races with the hlist_del_init/hlist_del
> >> hlist_move_list?
> >>
> >> There may be no real case where a lockless read races with the hlist_del_init/hlist_del
> >> hlist_move_list. But for the sake of completeness, I added those WRITE_ONCE, after all, if there is WRITE_ONCE
> >> in __hlist_del, why not add WRITE_ONCE in its caller, like hlist_del()?
> >
> > You might well have located a larger issue. We want to be able to use
> > KCSAN to find unintended data races, but as you noted, there might
> > be different requirements for RCU-protected linked lists and for
> > lock-protected linked lists. If there are, then there is probably
> > existing linked-list code that is using the wrong primitive, for
> > example, using (or failing to use) the one that Eric Dumazet provided.
> > For example, mismatched API usage might be causing the differences in
> > uses of _ONCE() primitives that you are calling out.
>
> I noticed a thread:
>
> https://lore.kernel.org/lkml/20200324153643.15527-2-will@xxxxxxxxxx/
>
> It seems like Will wanted to remove that hlist_unhashed_lockless()?
> But I can’t find any further updates.
>
> Will: Can you tell me what happened later?

IIRC, there were potential correctness issues with accesses being torn
(possibly by the compiler) which meant that some additional surgery was
needed to make some of the list accesses safe without locks.

I then ran into problems understanding how list_empty_careful() is supposed
to work which weren't resolved. I think the best summary of where I got
stuck (and moved onto more pressing things) is:

https://lore.kernel.org/lkml/20200424173932.GK21141@willie-the-truck/

Will