Re: [PATCH v10 18/20] timers: Implement the hierarchical pull model

From: Frederic Weisbecker
Date: Thu Jan 25 2024 - 09:31:06 EST


On Mon, Jan 15, 2024 at 03:37:41PM +0100, Anna-Maria Behnsen wrote:
> +/**
> + * tmigr_quick_check() - Quick forecast of next tmigr event when CPU wants to
> + * go idle
> + *
> + * Returns KTIME_MAX, when it is probable that nothing has to be done (not the
> + * only one in the level 0 group; and if it is the only one in level 0 group,
> + * but there are more than a single group active in top level)
> + *
> + * Returns first expiry of the top level group, when it is the only one in level
> + * 0 and top level also only has a single active child.
> + */
> +u64 tmigr_quick_check(void)
> +{
> + struct tmigr_cpu *tmc = this_cpu_ptr(&tmigr_cpu);
> + struct tmigr_group *topgroup;
> + struct list_head lvllist;
> +
> + if (tmigr_is_not_available(tmc))
> + return KTIME_MAX;

Offline CPUs are supposed to handle their own global timers.

So instead of returning KTIME_MAX here, shouldn't we pass instead
tevt->global as a parameter and return that value?

Otherwise the quick check will simply ignore the next global event of this CPU
if it's before the next local event.

> +
> + if (WARN_ON_ONCE(tmc->idle))
> + return KTIME_MAX;

Same here I guess...

> +
> + if (!tmigr_check_migrator_and_lonely(tmc->tmgroup, tmc->childmask))
> + return KTIME_MAX;

This one makes sense.

> +
> + for (int i = tmigr_hierarchy_levels; i > 0 ; i--) {
> + lvllist = tmigr_level_list[i - 1];
> + if (list_is_singular(&lvllist)) {
> + topgroup = list_first_entry(&lvllist, struct
> tmigr_group, list);

Is it safe against concurrent allocation failure in hotplug?

If the list is seen singular, then concurrently a CPU comes up and creates/add
a new group. The current CPU actually fetches it instead of the current group
because it's not singular anymore. But then some higher level group
allocation fails and the newly added first entry is removed.

list_is_singular() looks safe. But list_first_entry isn't. You can create
list_first_entry_rcu:

#define list_first_entry_rcu(ptr, type, member) \
list_entry_rcu((ptr)->next, type, member)

Protected inside rcu_read_lock() until the below READ_ONCE().

And then use list_del_rcu/list_add_rcu/kfree_rcu on the update side.

Isn't it possible to walk through group->parent instead?

> +
> + if (tmigr_check_lonely(topgroup))
> + return READ_ONCE(topgroup->next_expiry);
> + } else {
> + continue;
> + }
> + }
> +
> + return KTIME_MAX;

I'm less sure about that return value.

Thanks.