[PATCHSET] block, mempool, percpu: implement percpu mempool and fix blkcg percpu alloc deadlock

From: Tejun Heo
Date: Thu Dec 22 2011 - 16:45:40 EST


Hello, guys.

This patchset implements mempool for percpu memory and use it to solve
allocation deadlock problem in block cgroup paths. Percpu memory
allocator can't be called from memory reclaim path, mostly because its
on-demand chunk filling hooks into vmalloc area management, which in
turn hooks into arch pagetable code.

This usually isn't a problem but block cgroup code wants to do
opportunistic percpu allocation to track statistics in IO issue path.
Currently, it directly calls alloc_percpu() from IO path triggering
lockdep warning and falling into deadlocks under the right conditions.

This patchset adds percpu mempool which supports async refilling and
uses that as allocation buffer to solve the above problem. It solves
the problem for the next merge window but I'm not sure about what to
do about this window and -stable. The mempool updates seems a bit too
invasive. Maybe implement a stripped down allocation buffer in
block-cgroup?

This patchset contains the following seven patches.

0001-mempool-fix-and-document-synchronization-and-memory-.patch
0002-mempool-drop-unnecessary-and-incorrect-BUG_ON-from-m.patch
0003-mempool-fix-first-round-failure-behavior.patch
0004-mempool-factor-out-mempool_fill.patch
0005-mempool-separate-out-__mempool_create.patch
0006-mempool-percpu-implement-percpu-mempool.patch
0007-block-fix-deadlock-through-percpu-allocation-in-blk-.patch

0001-0003 are general mempool updates and already in -mm. Note that
Oleg has an alternative patch for 0003 which seems better to me.

0004-0006 prepare for and implement percpu mempool.

0007 uses it in block cgroup to fix the deadlock problem.

This patchset is on top of block/for-3.3/core but applies on mainline
too.

If people agree with this approach, it would probably be easier to
route everything through block, I think. Andrew, Jens, how do you
guys want to route these?

Oleg, if you can send patch with proper description and SOB, I'll
replace 0003 with yours.

The patchset is also available in the following git branch.

git://git.kernel.org/pub/scm/linux/kernel/git/tj/misc.git block-blkcg-pcpu-fix

and contains the following changes.

block/blk-cgroup.c | 37 +++++
block/blk-cgroup.h | 2
block/blk-throttle.c | 2
block/cfq-iosched.c | 2
include/linux/mempool.h | 80 ++++++++++++
mm/mempool.c | 310 +++++++++++++++++++++++++++++++++++++-----------
6 files changed, 363 insertions(+), 70 deletions(-)

Thanks.

--
tejun
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/