Re: [PATCH stable] memcg: add refcnt for pcpu stock to avoid UAF problem in drain_all_stock()

From: Michal Hocko
Date: Wed Feb 21 2024 - 05:08:15 EST


On Wed 21-02-24 17:50:27, Gong Ruiqi wrote:
>
> On 2024/02/21 16:38, Michal Hocko wrote:
> > On Wed 21-02-24 16:18:01, GONG, Ruiqi wrote:
> >> commit 1a3e1f40962c445b997151a542314f3c6097f8c3 upstream.
> >
> > I think it would be good to mention that this is only a partial backport
> > and also explain why to do a partial rather than the full one.
> >
>
> Okay. I think to fix this problem we should add refcnt relation between
> memcg and stock, and since higher versions have achieved this, maybe
> it's better to use the same code and align with them. So I put a "commit
> xxx upstream" here, as requested in kernel docs[1]. So yes it's a
> partial backport as we only need the stock part.

I think it is sufficient to mention that this is a partial backport to
minimize the fix to the bare minimum.

[...]
> > What does prevent from the following?
> >
> > refill_stock(memcgC) drain_all_stock(memcgB)
> > drain_stock(memcgA) rcu_read_lock()
> > css_put(old->css) memcgA = stock->cached
> > mem_cgroup_is_descendant(memcgA, memcgB) UAF
> > stock->cached = NULL
> >
>
> I think it's not a problem since refill_stock() has disabled irq before
> calling drain_stock():
>
> refill_stock(memcgC)
> local_irq_save
> drain_stock(memcgA)
> css_put(old->css)
> <1>
> stock->cached = NULL
> local_irq_restore
> <2>
>
> And since css_put(old->css) is an RCU free, memcgA would not be freed at
> <1> as it's still in grace period. The actual release of memcgA could
> happen only after irq is enabled (at <2>).
>
> And for CPU2, the access to stock->cached in drain_all_stock() is
> protected by rcu_read_lock(), so from stock->cached we get either NULL,
> or a memcgA that is still not freed.
>
> Please correct me if I have some wrong understanding to RCU.

You are right. Thanks! IRQ disabling is there in one form or the other
since db2ba40c277d ("mm: memcontrol: make per-cpu charge cache IRQ-safe
for socket accounting") so 4.8+ is safe. Backports to older kernels
would nee to pull this one as well.

> >> Cc: stable@xxxxxxxxxxxxxxx # 4.19 5.4
> >> Fixes: cdec2e4265df ("memcg: coalesce charging via percpu storage")
> >> Signed-off-by: GONG, Ruiqi <gongruiqi1@xxxxxxxxxx>

Acked-by: Michal Hocko <mhocko@xxxxxxxx>

Thanks!

--
Michal Hocko
SUSE Labs