sched_domains + NUMA issue

From: Anton Blanchard
Date: Sun Aug 29 2004 - 06:22:34 EST



Hi,

We are seeing errors in the sched domains debug code when SMT + NUMA is
enabled. Nathan pointed out that the recent change to limit the number
of nodes in a scheduling group may be causing this - in particular
sched_domain_node_span.

It looks like ia64 are the only ones implementing a reasonable
node_distance, the others just do:

#define node_distance(from,to) (from != to)

On these architectures I wonder if we should disable the
sched_domain_node_span code since we will just get a random grouping of
cpus.

Anton

CPU0: online
domain 0: span 00000000,00000000,00000000,00000003
groups: 00000000,00000000,00000000,00000001 00000000,00000000,00000000,00000002
domain 1: span 00000000,00000000,00000000,00000003
groups: 00000000,00000000,00000000,00000003
domain 2: span 00000000,00000000,00000000,000f0003
groups: 00000000,00000000,00000000,00000003 00000000,00000000,00000000,000f0000
CPU1: online
domain 0: span 00000000,00000000,00000000,00000003
groups: 00000000,00000000,00000000,00000002 00000000,00000000,00000000,00000001
domain 1: span 00000000,00000000,00000000,00000003
groups: 00000000,00000000,00000000,00000003
ERROR parent span is not a superset of domain->span
domain 2: span 00000000,00000000,00000000,000f0000
ERROR domain->span does not contain CPU1
groups: 00000000,00000000,00000000,00000003 00000000,00000000,00000000,000f0000
ERROR groups don't span domain->span
CPU16: online
domain 0: span 00000000,00000000,00000000,00030000
groups: 00000000,00000000,00000000,00010000 00000000,00000000,00000000,00020000
domain 1: span 00000000,00000000,00000000,000f0000
groups: 00000000,00000000,00000000,00030000 00000000,00000000,00000000,000c0000
domain 2: span 00000000,00000000,00000000,000f0003
groups: 00000000,00000000,00000000,000f0000 00000000,00000000,00000000,00000003
CPU17: online
domain 0: span 00000000,00000000,00000000,00030000
groups: 00000000,00000000,00000000,00020000 00000000,00000000,00000000,00010000
domain 1: span 00000000,00000000,00000000,000f0000
groups: 00000000,00000000,00000000,00030000 00000000,00000000,00000000,000c0000
domain 2: span 00000000,00000000,00000000,000f0000
groups: 00000000,00000000,00000000,000f0000 00000000,00000000,00000000,00000003
ERROR groups don't span domain->span
CPU18: online
domain 0: span 00000000,00000000,00000000,000c0000
groups: 00000000,00000000,00000000,00040000 00000000,00000000,00000000,00080000
domain 1: span 00000000,00000000,00000000,000f0000
groups: 00000000,00000000,00000000,000c0000 00000000,00000000,00000000,00030000
ERROR parent span is not a superset of domain->span
domain 2: span 00000000,00000000,00000000,00000000
ERROR domain->span does not contain CPU18
groups: 00000000,00000000,00000000,000f0000 00000000,00000000,00000000,00000003
ERROR groups don't span domain->span
CPU19: online
domain 0: span 00000000,00000000,00000000,000c0000
groups: 00000000,00000000,00000000,00080000 00000000,00000000,00000000,00040000
domain 1: span 00000000,00000000,00000000,000f0000
groups: 00000000,00000000,00000000,000c0000 00000000,00000000,00000000,00030000
ERROR parent span is not a superset of domain->span
domain 2: span 00000000,00000000,00000000,00000000
ERROR domain->span does not contain CPU19
groups: 00000000,00000000,00000000,000f0000 00000000,00000000,00000000,00000003
ERROR groups don't span domain->span
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/