Re: [PATCH AUTOSEL 4.4 03/16] fs/btrfs: Add cond_resched() for try_release_extent_mapping() stalls

From: Paul E. McKenney
Date: Tue Aug 11 2020 - 09:48:03 EST


On Tue, Aug 11, 2020 at 09:57:20AM +0200, David Sterba wrote:
> On Mon, Aug 10, 2020 at 03:14:30PM -0400, Sasha Levin wrote:
> > From: "Paul E. McKenney" <paulmck@xxxxxxxxxx>
> >
> > [ Upstream commit 9f47eb5461aaeb6cb8696f9d11503ae90e4d5cb0 ]
> >
> > Very large I/Os can cause the following RCU CPU stall warning:
> >
> > RIP: 0010:rb_prev+0x8/0x50
> > Code: 49 89 c0 49 89 d1 48 89 c2 48 89 f8 e9 e5 fd ff ff 4c 89 48 10 c3 4c =
> > 89 06 c3 4c 89 40 10 c3 0f 1f 00 48 8b 0f 48 39 cf 74 38 <48> 8b 47 10 48 85 c0 74 22 48 8b 50 08 48 85 d2 74 0c 48 89 d0 48
> > RSP: 0018:ffffc9002212bab0 EFLAGS: 00000287 ORIG_RAX: ffffffffffffff13
> > RAX: ffff888821f93630 RBX: ffff888821f93630 RCX: ffff888821f937e0
> > RDX: 0000000000000000 RSI: 0000000000102000 RDI: ffff888821f93630
> > RBP: 0000000000103000 R08: 000000000006c000 R09: 0000000000000238
> > R10: 0000000000102fff R11: ffffc9002212bac8 R12: 0000000000000001
> > R13: ffffffffffffffff R14: 0000000000102000 R15: ffff888821f937e0
> > __lookup_extent_mapping+0xa0/0x110
> > try_release_extent_mapping+0xdc/0x220
> > btrfs_releasepage+0x45/0x70
> > shrink_page_list+0xa39/0xb30
> > shrink_inactive_list+0x18f/0x3b0
> > shrink_lruvec+0x38e/0x6b0
> > shrink_node+0x14d/0x690
> > do_try_to_free_pages+0xc6/0x3e0
> > try_to_free_mem_cgroup_pages+0xe6/0x1e0
> > reclaim_high.constprop.73+0x87/0xc0
> > mem_cgroup_handle_over_high+0x66/0x150
> > exit_to_usermode_loop+0x82/0xd0
> > do_syscall_64+0xd4/0x100
> > entry_SYSCALL_64_after_hwframe+0x44/0xa9
> >
> > On a PREEMPT=n kernel, the try_release_extent_mapping() function's
> > "while" loop might run for a very long time on a large I/O. This commit
> > therefore adds a cond_resched() to this loop, providing RCU any needed
> > quiescent states.
> >
> > Signed-off-by: Paul E. McKenney <paulmck@xxxxxxxxxx>
>
> Paul,
>
> this patch was well hidden in some huge RCU pile
> (https://lore.kernel.org/lkml/20200623002147.25750-11-paulmck@xxxxxxxxxx/)
>
> I wonder why you haven't CCed linux-btrfs, I spotted the patch queued
> for stable by incidentally. The timestamp is from June, that's quite
> some time ago. We can deal with one more patch and I tend to reply with
> acks quickly for easy patches like this to not block other peoples work
> but I'm a bit disappointed by sidestepping maintained subsystems. It's
> not just this patch, it happens from time time only to increase the
> disapointement.

My bad, and please accept my apologies. I clearly left out the
step of adding proper Cc: lines. :-/

Thanx, Paul