Re: [PATCH] sched: select 'idle' cfs_rq per task-group to prevent tg-internal imbalance

From: Michael wang
Date: Wed Jul 02 2014 - 22:10:11 EST


On 07/02/2014 08:49 PM, Peter Zijlstra wrote:
> On Wed, Jul 02, 2014 at 10:47:34AM +0800, Michael wang wrote:
>> The opinion on features actually make me a little confusing... I used to
>> think the scheduler is willing on providing kinds of way to adapt itself
>> to different situation, and some features do help on that, make them
>> only a debug option make the scheduler more static to users, but I know
>> that's not what I should touched...
>>
>> Anyway, the problem is still there and seems like we have to adopt some
>> optional solution to address it, we'll still working and practice on
>> that, please let us know if you have any ideas we could help on ;-)
>
> No, its very much not intended as a user knob to adapt.
>
> The problem with that approach to scheduling is that one set of knobs
> runs workload A, while another set of knobs runs workload B, but what
> happens if you need to run both A and B on the same machine?

That's actually what make me think the scheduler should be able to adapt
itself, our performance testing mostly based on static kinds of
workload, and that's too far away from the real world.

We tested A, we tested B, we tested A + B, but what if C joined? what if
folks only like A, don't care B?

The key point here is, scheduler still can't promise that all kinds of
situation is taking good care under it's static policy... it just
running there, won't get any feedback from user, thinking it's doing the
right thing, but sometimes not from admin's view.

>
> So optional things and knobs are a complete fail and will not happen.

But how could we promise that our static policy is the best?

For example the 'sysctl_sched_min_granularity', why it should be 750k?
is that the best on any kinds of platform to any kinds of workloads at
any time?

Regards,
Michael Wang

>
> Clearly I need to go take out all these things because people don't seem
> to know this and SCHED_DEBUG isn't a big enough hint. Tedious.
>

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/