RE: rhashtable: ENOMEM errors when hit with a flood of insertions

From: David Laight
Date: Thu Dec 03 2015 - 10:10:04 EST

Next message: Steven Rostedt: "[GIT PULL] tracing: Add sched_wakeup_new and sched_waking tracepoints for pid filter"
Previous message: Will Deacon: "Re: [PATCH] arm64: ftrace: stop using kstop_machine to enable/disable tracing"
In reply to: Herbert Xu: "rhashtable: ENOMEM errors when hit with a flood of insertions"
Next in thread: Eric Dumazet: "Re: rhashtable: ENOMEM errors when hit with a flood of insertions"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

From: Herbert Xu
> Sent: 03 December 2015 12:51
> On Mon, Nov 30, 2015 at 06:18:59PM +0800, Herbert Xu wrote:
> >
> > OK that's better. I think I see the problem. The test in
> > rhashtable_insert_rehash is racy and if two threads both try
> > to grow the table one of them may be tricked into doing a rehash
> > instead.
> >
> > I'm working on a fix.
>
> While the EBUSY errors are gone for me, I can still see plenty
> of ENOMEM errors. In fact it turns out that the reason is quite
> understandable. When you pound the rhashtable hard so that it
> doesn't actually get a chance to grow the table in process context,
> then the table will only grow with GFP_ATOMIC allocations.
>
> For me this starts failing regularly at around 2^19 entries, which
> requires about 1024 contiguous pages if I'm not mistaken.

ISTM that you should always let the insert succeed - even if it makes
the average/maximum chain length increase beyond some limit.
Any limit on the number of hashed items should have been done earlier
by the calling code.
The slight performance decrease caused by scanning longer chains
is almost certainly more 'user friendly' than an error return.

Hoping to get 1024+ contiguous VA pages does seem over-optimistic.

With a 2-level lookup you could make all the 2nd level tables
a fixed size (maybe 4 or 8 pages?) and extend the first level
table as needed.

David
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Steven Rostedt: "[GIT PULL] tracing: Add sched_wakeup_new and sched_waking tracepoints for pid filter"
Previous message: Will Deacon: "Re: [PATCH] arm64: ftrace: stop using kstop_machine to enable/disable tracing"
In reply to: Herbert Xu: "rhashtable: ENOMEM errors when hit with a flood of insertions"
Next in thread: Eric Dumazet: "Re: rhashtable: ENOMEM errors when hit with a flood of insertions"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]