Re: [PATCH] 2.6 workaround for Athlon/Opteron prefetch errata

From: Andi Kleen
Date: Thu Sep 11 2003 - 00:12:04 EST


On Thu, Sep 11, 2003 at 06:58:23AM +0200, dada1 wrote:
>
> From: "Andi Kleen" <ak@xxxxxx>
> >
> > Of course they may want to also fix it in a different way to run on older
> > kernels (e.g. handling the signal in user space or avoiding the
> conditions).
> > But doing it centrally in the kernel is a bit cleaner and at some point
> > people have to update their kernels anyways.
>
> Could you be kind enough to post here the example code for a SIGSEGV handler
> that would be necessary for old kernels ?

I don't have any. But it would be very similar to the in kernel checking
code (see the is_prefetch function in my patches). Just you feed it
the fields from sigcontext in the signal handler and replace __get_user
with a normal memory access.

>
> I do think it woul help some people like me, for the future googling on the
> prefetch errata.
>
> I do use preftechnta instructions on my programs, and this errata could
> explain some strange crashes.

The bogus faults are very easy to diagnose. When you have a core dump
and disassemble the faulting instruction (in gdb x/i $eip) and it is
a prefetch (prefetch/prefetchw/prefetchnt*) then it could be that.

If it is a different instruction it is unrelated.

It would also only happen when you prefetch ever on unmapped addresses.

>
> As the program crashing is a huge multi-threaded network application, with
> up to 300000 opened TCP sockets, the SIGSEGV fault is usually followed by a
> system crash (networks buffers using all of lowmem)

That sounds like an unrelated issue.

When user space crashes on this the kernel is unaffected.

In case the 2.6 kernel crashes on this (2.4 does not trigger it)
then you can also run the oops through ksymoops and check if the
faulting instruction is prefetch. If it isn't then it is something else.

Network buffers using up all of low mem and then crash
is likely some OOM handling problem. If you're on 2.4 try an -aa kernel,
they handle this much better than the marcelo tree. If it's 2.6 then
I would recommend posting oopses on this list, maybe someone can fix
it. I suspect 2.6's OOM handling could be still improved.

-Andi
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/