RE: Commit 34d76c41 causes linker errors on ia64 with NR_CPUS=4096

From: Jiri Kosina
Date: Thu Oct 22 2009 - 10:49:13 EST


On Wed, 21 Oct 2009, Luck, Tony wrote:

> But ... the architecturally supported page sizes go up by powers of 4,
> so next choice from 64K is 256K then 1M, 4M, etc. This is also requires
> an edit of source code and re-compile. We could easily make it a config
> option ... but that is still inconvenient.
>
> The bloat introduced by adding percpu variables is multiplied by NR_CPUS
> ... and in my case that is 4096. It is easy to just shrug this off and
> say that such big systems have plenty of memory anyway, but the case
> that led to this issue (adding a percpu object that included a [NR_CPUS]
> array) shows that, IMHO, people are do not care enough about the bloat.
>
> I suspect that if I just increase the percpu area to 256K or 1M, I'll
> see this same issue when someone adds:
>
> struct foo {
> char buf[NR_CPUS][PAGE_SIZE];
> };
> DECLARE_PER_CPU(struct foo, bar);
>
> which needs 4k * 64k = 256M of per-cpu space ... i.e. 1T total.
>
> If such code is going to be deemed acceptable, then we do need
> to move away from the ia64 TLB mapped percpu area.

Well, I must say I slightly agree that my gut feeling is that we should
try to avoid arrays which size depends on NR_CPUS as much as possible.

Now, what to do for 2.6.32? We definitely need some kind of fix for this,
otherwise the Altix guys would kill us.

Tony, is the change that will eventually have to be made to ia64 pagefault
handler too intrusive for -rc6, and should we rather go with my workaround
instead, and try to find something proper for 2.6.33?

Pure revert of 34d76c41 isn't possible as well, as it fixes a real bug.

--
Jiri Kosina
SUSE Labs, Novell Inc.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/