Re: [PATCHSET] workqueue: concurrency managed workqueue, take#4

From: Tejun Heo
Date: Tue Mar 16 2010 - 12:02:12 EST

Next message: Ingo Molnar: "[GIT PULL] perf fixes"
Previous message: Clemens Ladisch: "Re: [PATCH] hpet: factor timer allocate from open"
In reply to: David Howells: "Re: [PATCHSET] workqueue: concurrency managed workqueue, take#4"
Next in thread: David Howells: "Re: [PATCHSET] workqueue: concurrency managed workqueue, take#4"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

Hello,

On 03/16/2010 11:38 PM, David Howells wrote:
>> Well, you can RR queue them but in general I don't think things like
>> that would be much of a problem for IO bounded works.
>
> "RR queue"? Do you mean realtime?

I meant round-robin as the last resort but if fscache really needs
such workaround, cmwq is probably a bad fit for it.

>> If it becomes bad, scheduler will end up moving the source around
>
> "The source"? Do you mean the process that's loading the deferred
> work items onto the workqueue? Why should it get moved? Isn't it
> pinned to a CPU?

Whatever the source may be. If a cpu gets loaded heavily from fscache
workload, things which aren't pinned to the cpu will be distributed to
other cpus. But again, I have difficult time imagining cpu loading
being an actual issue for fscache even in pathological cases. It's
almost strictly IO bound and CPU intensive stuff sitting in the IO
path already have or should grow mechanisms to schedule those properly
anyway.

>> and for most common cases, those group queued works are gonna hit similar
>> code paths over and over again during their short CPU burn durations so it's
>> likely to be more efficient.
>
> True.
>
>> Are you seeing maleffects of cpu affine work scheduling during
>> fscache load tests?
>
> Hard to say. Hear are some benchmarks:

Yay, some numbers. :-) I reorganized them for easier comparison.

(*) cold/cold-ish server, cold cache:

SLOW-WORK CMWQ
real 2m0.974s 1m5.154s
user 0m0.492s 0m0.628s
sys 0m15.593s 0m14.397s

(*) hot server, cold cache:

SLOW-WORK CMWQ
real 1m31.230s 1m13.408s 1m1.240s 1m4.012s
user 0m0.612s 0m0.652s 0m0.732s 0m0.576s
sys 0m17.845s 0m15.641s 0m13.053s 0m14.133s

(*) hot server, warm cache:

SLOW-WORK CMWQ
real 3m22.108s 3m52.557s 3m10.949s 4m9.805s
user 0m0.636s 0m0.588s 0m0.636s 0m0.648s
sys 0m13.317s 0m16.101s 0m14.065s 0m13.505s

(*) hot server, hot cache:

SLOW-WORK CMWQ
real 1m54.331s 2m2.745s 1m22.511s 2m57.075s
user 0m0.596s 0m0.608s 0m0.612s 0m0.604s
sys 0m11.457s 0m12.625s 0m11.629s 0m12.509s

(*) hot server, no cache:

SLOW-WORK CMWQ
real 1m1.508s 0m54.973s
user 0m0.568s 0m0.712s
sys 0m15.457s 0m13.969s

> Note that it took me several goes to get a second result for this
> case: it kept failing in a way that suggested that the
> non-reentrancy stuff you put in there failed somehow, but it's
> difficult to say for sure.

Sure, there could be a bug in the non-reentrance implementation but
I'm leaning more towards a bug in work flushing before freeing thing
which also seems to show up in the debugfs path. I'll try to
reproduce the problem here and debug it.

That said, the numbers look generally favorable to CMWQ although the
sample size is too small to draw conclusions. I'll try to get things
fixed up so that testing can be smoother.

Thanks a lot for testing.

--
tejun
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Ingo Molnar: "[GIT PULL] perf fixes"
Previous message: Clemens Ladisch: "Re: [PATCH] hpet: factor timer allocate from open"
In reply to: David Howells: "Re: [PATCHSET] workqueue: concurrency managed workqueue, take#4"
Next in thread: David Howells: "Re: [PATCHSET] workqueue: concurrency managed workqueue, take#4"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]