Re: [PATCH 0/2] Simple concepts extracted from tip/numa/core.

From: Mel Gorman
Date: Fri May 03 2013 - 08:48:59 EST


On Wed, May 01, 2013 at 11:19:50PM +0530, Srikar Dronamraju wrote:
> Hi,
>
> Here is an attempt to pick few interesting patches from tip/numa/core.
> For the initial stuff, I have selected the last_nidpid (which was
> last_cpupid + Make gcc to not reread the page tables patches).
>
> Here is the performance results of running autonumabenchmark on a 8 node 64
> core system. Each of these tests were run for 5 iterations.
>
>
> KernelVersion: v3.9
> Testcase: Min Max Avg
> numa01: 1784.16 1864.15 1800.16
> numa01_THREAD_ALLOC: 293.75 315.35 311.03
> numa02: 32.07 32.72 32.59
> numa02_SMT: 39.27 39.79 39.69
>
> KernelVersion: v3.9 + last_nidpid + gcc: no reread patches
> Testcase: Min Max Avg %Change
> numa01: 1774.66 1870.75 1851.53 -2.77%
> numa01_THREAD_ALLOC: 275.18 279.47 276.04 12.68%
> numa02: 32.75 34.64 33.13 -1.63%
> numa02_SMT: 32.00 36.65 32.93 20.53%
>
> We do see some degradation in numa01 and numa02 cases. The degradation is
> mostly because of the last_nidpid patch. However the last_nidpid helps
> thread_alloc and smt cases and forms the basis for few more interesting
> ideas in the tip/numa/core.
>

I did not have time unfortunately to review the patches properly but ran
some of the same tests that were used for numa balancing originally.

One of the threads segfaulted when running specjbb in single JVM mode with
the patches applied so there is either a stability issue in there or it
makes an existing problem with migration easier to hit by virtue of the
fact it's migrating more agressively.

Specjbb in multi-JVM somed some performance improvements with a 4%
improvement at the peak but the results for many thread instances were a
lot more variable with the patches applied. System CPU time increased by
16% and the number of pages migrated was increased by 18%.

NAS-MPI showed both performance gains and losses but again the system
CPU time was increased by 9.1% and 30% more pages were migrated with the
patches applied.

For autonuma, the system CPU time is reduced by 40% for numa01 *but* it
increased by 70%, 34% and 9% for NUMA01_THEADLOCAL, NUMA02 and
NUMA02_SMT respectively and 45% more pages were migrated overall.

So while there are some performance improvements, they are not
universal, tehre is at least one stability issue and I'm not keen on the
large increase in system CPU cost and number of pages being migrated as
a result of the patch when there is no co-operation with the scheduler
to make processes a bit stickier on a node once memory has been migrated
locally.

--
Mel Gorman
SUSE Labs
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/