Re: [PATCH] ARM : sparsemem: Crashes on ARM platform when sparsememenabled in linux-2.6.???35.13 due to pfn_valid(???) andpfn_valid_???within().

From: Mel Gorman
Date: Wed Aug 03 2011 - 06:56:20 EST


On Tue, Aug 02, 2011 at 01:36:49PM +0530, Kautuk Consul wrote:
> Hi,
>
> On my ARM machine, I have linux-2.6.35.13 installed and the total
> kernel memory is not aligned to the section size SECTION_SIZE_BITS.
>
> I observe kernel crashes in the following 3 scenarios:
> i) When we do a "cat /proc/pagetypeinfo": This happens because the
> pfn_valid() macro is not able to detect invalid PFNs in the loop in
> vmstat.c: pagetypeinfo_showblockcount_print().
> ii) When we do "echo xxxx > /proc/vm/sys/min_free_kbytes": This
> happens because the pfn_valid() macro is not able to detect invalid
> PFNs in page_alloc.c: setup_zone_migrate_reserve().
> iii) When I try to copy a really huge file: This happens because the
> CONFIG_HOLES_IN_ZONE config option is not set.
> The code then crashes in the VM_BUG_ON in loop in
> move_freepages() as pfn_valid_within() did not compile correctly to
> pfn_valid().
>
> This patch is a combination of :
> a) Back-ported changes of the patch from Will Deacon found at:
> http://git.kernel.org/?p=linux/kernel/git/stable/linux-3.0.y.git;a=commit;h=7b7bf499f79de3f6c85a340c8453a78789523f85
> b) Addition of the CONFIG_HOLES_IN_ZONE config option to
> arch/arm/Kconfig in order to prevent crashes in move_freepages()
> when/if the total kernel memory is not aligned to SECTION_SIZE_BITS.
> This also leads to
> proper compilation of the pfn_valid_within() macro which otherwise
> will always return 1 to the caller.
>

Ok, for -stable backports, it is required that each patch is a backport
of a single commit. This is to ensure that there are no accidental
forks or fixes that get lost. If this patch is a combination of two
patches, then the expected format for review would be in three mails

Mail 1: [Patch 0/2] A leader explaining why the series is required. The
information you have above is fine

Mail 2: [Patch 1/2] would be commit
[7b7bf499: ARM: 6913/1: sparsemem: allow pfn_valid to be
overridden when using SPARSEMEM]

The only difference between that commit and your commit would
be that it's being sent to stable and at the beginning of the
mail you should have

"commit: 7b7bf499f79de3f6c85a340c8453a78789523f85"

This is to the history of the patch is clear at a glance

Mail 3: [Patch 2/2] would be whatever commit introduces
CONFIG_HOLES_IN_ZONE. If this is not in mainline already,
it should be merged to mainline before backporting.

See Documentation/stable_kernel_rules.txt .

In this format for example, I would not even have to review patch 1/2
properly other than checking it's equivalent to the mainstream patch
that I already ack'd. It'd probably have been picked up by now :/ .

I don't know about patch 2 yet because I didn't go to the effort of
applying this patch, reverting the upstream commit and examining the
remainder.

I know this appears awkward but review bandwidth is extremely limited
and it's important that fixes do not get lost.

--
Mel Gorman
SUSE Labs
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/