Re: PG_zero

From: Andrew Morton
Date: Tue Nov 02 2004 - 16:08:15 EST


"Martin J. Bligh" <mbligh@xxxxxxxxxxx> wrote:
>
> > The idea that the quicklist is meant to take the lock once every X pages
> > is limiting. The object is to never ever have to enter the buddy, not
> > just to "buffer" allocations.
>
> That'd be nicer, yes.
>
> > The two separated cold/hot lists prevents that. As far as there's a single
> > page available we should use it since bouncing the cacheline is very costly.
>
> Well, it doesn't really prevent it ... it used to work, I think. However,
> the current code appears to be broken - what it's meant to do is refill
> the hot list from the cold list if the hot list was completely empty.
> I could've sworn it used to do that, but no matter

We discussed it, but iirc we worked out that it wouldn't have any useful
effect on the frequency at which we take the buddy lock.

It adds more code and it adds special knowledge of hotness and coldness to
the core page allocator. The intention was that the argument called `cold'
could be renamed to `pcp_index' or whatever so that we could add more head
arrays in the future. For known-to-be-zero pages.

> ... I think we just
> need to fix up the bit in buffered_rmqueue() here:
>
> if (pcp->count <= pcp->low)
> pcp->count += rmqueue_bulk(zone, 0, pcp->batch, &pcp->list);
>
> To say something more like:
>
> if (pcp->count <= pcp->low && !cold)
> <shift some pages from cold to hot>
> if (pcp->count <= pcp->low)
> pcp->count += rmqueue_bulk(zone, 0, pcp->batch, &pcp->list);
>
> Though I'm less than convinced in retrospect that there was any point in
> having low watermarks, rather than running it down to zero. Andrew, can
> you recall why we did that?

Nope.

> > It's really a question if you believe the cache effects are going to be
> > more significant than the cacheline bouncing on the zone lock.
>
> Exactly. The disadvantage of the single list is that cold allocs can steal
> hot pages, which we believe are precious, and as CPUs get faster, will only
> get more so (insert McKenney's bog roll here).

The cold pages are mainly intended to be the pages which will be placed
under DMA transfers. We should never return hot pages in response for a
request for a cold page.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/