Re: [RFCv1 3/6] PASR: mm: Integrate PASR in Buddy allocator

From: Mel Gorman
Date: Tue Jan 31 2012 - 09:01:56 EST


On Tue, Jan 31, 2012 at 01:15:55PM +0100, Maxime Coquelin wrote:
> Hello Mel,
> On 01/30/2012 05:52 PM, Maxime Coquelin wrote:
> >
> >On 01/30/2012 04:22 PM, Mel Gorman wrote:
> >
> >>You may be able to use the existing arch_alloc_page() hook and
> >>call PASR on architectures that support it if and only if PASR is
> >>present and enabled by the administrator but even this is likely to be
> >>unpopular as it'll have a measurable performance impact on platforms
> >>with PASR (not to mention the PASR lock will be even heavier as it'll
> >>now be also used for per-cpu page allocations). To get the hook you
> >>want, you'd need to show significant benefit before they were happy with
> >>the hook.
> >Your proposal sounds good.
> >AFAIK, per-cpu allocation maximum size is 32KB. Please correct me
> >if I'm wrong.
> >Since pasr_kget/kput() calls the PASR framework only on MAX_ORDER
> >allocations, we wouldn't add any locking risks nor contention
> >compared to current patch.
> >I will update the patch set using arch_alloc/free_page().
> >
> I just had a deeper look at when arch_alloc_page() is called. I
> think it does not fit with PASR framework needs.
> pasr_kget() calls pasr_get() only for max order pages (same for
> pasr_kput()) to avoid overhead.
>

I see. My bad.

> In current patch set, pasr_kget() is called when pages are removed
> from the free lists, and pasr_kput() when pages are inserted in the
> free lists.
> So, pasr_get() is called in case of :
> - allocation of a max order page
> - split of a max order page into lower order pages to fulfill
> allocation of pages smaller than max order
> And pasr_put() is called in case of:
> - release of a max order page
> - coalescence of two "max order -1" pages when smaller pages are
> released
>
> If we call the PASR framework in arch_alloc_page(), we have two
> possibilities:
> 1) using pasr_kget(): the PASR framework will only be notified
> of max order allocations, so the coalesce/split of free pages case
> will not be taken into account.
> 2) using pasr_get(): the PASR framework will be called for every
> orders of page allocation/release. The induced overhead is not
> acceptable.
>
> To avoid calling pasr_kget/kput() directly in page_alloc.c, do you
> think adding some arch specific hooks when a page is inserted or
> removed from the free lists could be acceptable?

It's not the name that is the problem, I'm strongly against any hook
that can delay the page allocator for arbitrary lengths of time like
this. I am open to being convinced otherwise but for me PASR would
need to demonstrate large savings for a wide variety of machines and
the alternatives would have to be considered and explained why they
would be far inferior or unsuitable.

For example - it seems like this could be also be done with a
balloon driver instead of page allocator hooks. A governer would
identify when the machine was under no memory pressure or triggered
from userspace. To power down memory, it would use page reclaim and
page migration to allocate large contiguous ranges of memory - CMA
could potentially be adapted when it gets merged to save a lot of
implementation work. The governer should register a slab shrinker
so that under memory pressure it gets called so it can shrink the
ballon, power the DIMMS back up and free the memory back to the
buddy allocator. This would keep all the cost out of the allocator
paths and move the cost to when the machine is either idle (in the
case of powering down) or under memory pressure (where the cost of
powering up will be small in comparison to the overall cost of the
page reclaim operation).

--
Mel Gorman
SUSE Labs
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/