Re: [RFC][PATCH 0/3] x86/nmi: Print all cpu stacks from NMI safely

From: Steven Rostedt
Date: Thu Jun 19 2014 - 18:58:24 EST


On Thu, 19 Jun 2014 23:56:36 +0200 (CEST)
Jiri Kosina <jkosina@xxxxxxx> wrote:


> I agree this is less intrusive than having printk() use two versions of
> the buffers and perform merging, OTOH, it doesn't really seem to be
> fully clean and systematic solution either.

I'm not sure why you consider it not fully clean. I think it is quite
elegant ;-)

>
> I had a different idea earlier today, and Petr seems to have implemented
> it already; I guess he'll be sending it out as RFC tomorrow for
> comparision.
>
> The idea basically is to *switch* what arch_trigger_all_cpu_backtrace()
> and arch_trigger_all_cpu_backtrace_handler() are doing; i.e. use the NMI
> as a way to stop all the CPUs (one by one), and let the CPU that is
> sending the NMIs around to actually walk and dump the stacks of the CPUs
> receiving the NMI IPI.

And this is cleaner? Stopping a CPU via NMI and then what happens if
something else goes wrong and that CPU never starts back up? This
sounds like something that can cause more problems than it was
reporting on.

The code had better handle all the strange cases where an NMI comes in
after the code that sent it timed out waiting for the NMIs to return.
Then you also need to print out the data while the NMIs still spin.

>
> It's the most trivial aproach I've been able to come up with, and should
> be usable for everybody (RCU stall detector and sysrq). The only tricky
> part is: if we want pt_regs to be part of the dump as well, how to pass
> those cleanly between the 'stopped' CPU and the CPU that is doing the
> printing. Other than that, it's just moving a few lines of code around, I
> believe.

Note, I first tried to just save the data and pass it to the calling
functions, but found that rather messy, as there is a lot of stuff
being dumped, and it's different if it is x86_64 vs i386 or if the NMI
happened when in userspace or kernel. That's when I realized it would
be trivial to just switch what printk does and then have the NMI do the
printing to another NMI safe buffer.

It was even more convenient that trace_seq already existed and made it
much easier to implement.

-- Steve

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/