Re: [PATCH v4 1/2] pid: Replace pid bitmap implementation with IDR API

From: Rik van Riel
Date: Tue Oct 10 2017 - 11:46:29 EST


On Tue, 2017-10-10 at 13:35 +0100, Gargi Sharma wrote:
> On Tue, Oct 10, 2017 at 12:50 PM, Oleg Nesterov <oleg@xxxxxxxxxx>
> wrote:
> > On 10/09, Andrew Morton wrote:
> > >
> > > > @@ -240,17 +230,11 @@ void zap_pid_ns_processes(struct
> > > > pid_namespace *pid_ns)
> > > > ÂÂÂÂÂ*
> > > > ÂÂÂÂÂ*/
> > > > ÂÂÂÂread_lock(&tasklist_lock);
> > > > -ÂÂÂnr = next_pidmap(pid_ns, 1);
> > > > -ÂÂÂwhile (nr > 0) {
> > > > -ÂÂÂÂÂÂÂÂÂÂÂrcu_read_lock();
> > > > -
> > > > -ÂÂÂÂÂÂÂÂÂÂÂtask = pid_task(find_vpid(nr), PIDTYPE_PID);
> > > > +ÂÂÂnr = 2;
> > > > +ÂÂÂidr_for_each_entry_continue(&pid_ns->idr, pid, nr) {
> > > > +ÂÂÂÂÂÂÂÂÂÂÂtask = pid_task(pid, PIDTYPE_PID);
> > > > ÂÂÂÂÂÂÂÂÂÂÂÂif (task && !__fatal_signal_pending(task))
> > > > ÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂsend_sig_info(SIGKILL, SEND_SIG_FORCED,
> > > > task);
> > > > -
> > > > -ÂÂÂÂÂÂÂÂÂÂÂrcu_read_unlock();
> > > > -
> > > > -ÂÂÂÂÂÂÂÂÂÂÂnr = next_pidmap(pid_ns, nr);
> > > > ÂÂÂÂ}
> > > > ÂÂÂÂread_unlock(&tasklist_lock);
> > >
> > > Especially here.ÂÂI don't think pidmap_lock is held.ÂÂIs that IDR
> > > iteration safe?
> >
> > Yes, this doesn't look right, we need rcu_read_lock() or
> > pidmap_lock.
> >
> > And, we also need rcu_read_lock() for another reason, to protect
> > "struct pid".
>
> Ah, I missed this. From what I understood idr_for_each_entry_continue
> should be safe because calls idr_get_next which in turn calls
> radix_tree_iter_find to find the next populated entry in the idr. If
> the pid that you are looking up the task for is deleted, task will
> get
> a NULL from pid_task and no signal to kill will be sent.
> >
> > Gargi, I suggested to use idr_for_each_entry_continue(), but now I
> > am wondering
> > if we should use idr_for_each() instead. IIUC this would be a bit
> > faster? Not
> > that I think this is really important...
>
> I can run benchmarks with idr_for_each to see how much speed up is
> achieved and then we can go with whatever we think is better. How
> does
> that sounds?

I suspect this code will not be a hot path in any
conceivable "kill off hundreds of containers"
benchmark, since the overhead of having all of the
tasks in those containers exit will dwarf any
changes in this code.

Simply making it safe for fully preemptible
kernels by adding rcu_read_lock() around the
section is what matters the most.

The choice between idr_for_each_entry_continue()
and idr_for_each() is dictated more by which
of the two results in easier to read code.

--
All rights reversed

Attachment: signature.asc
Description: This is a digitally signed message part