Re: [PATCH v2 2/2] x86/hyperv: remove on-stack cpumask from hv_send_ipi_mask_allbutself

From: Wei Liu
Date: Wed Oct 06 2021 - 07:39:20 EST


Hi Thomas and Vitaly

Sorry for the late reply. I was buried in my other work.

On Tue, Oct 05, 2021 at 02:53:29PM +0200, Vitaly Kuznetsov wrote:
> Thomas Gleixner <tglx@xxxxxxxxxxxxx> writes:
>
> > Wei!
> >
>
> Not Wei here but I don't see the question answered on the mailing list
> so let me give my thoughts.
>
> > On Fri, Sep 10 2021 at 18:57, Wei Liu wrote:
> >> -static bool __send_ipi_mask_ex(const struct cpumask *mask, int vector)
> >> +static bool __send_ipi_mask_ex(const struct cpumask *mask, int vector,
> >> + bool exclude_self)
> >> {
> >> struct hv_send_ipi_ex **arg;
> >> struct hv_send_ipi_ex *ipi_arg;
> >> @@ -123,7 +124,10 @@ static bool __send_ipi_mask_ex(const struct cpumask *mask, int vector)
> >>
> >> if (!cpumask_equal(mask, cpu_present_mask)) {
> >
> > Not part of that patch, but is checking cpu_present_mask correct here?
> > If so then this really lacks a comment for the casual reader.
>
> It seems it *was* correct prior to 'exclude_self': the idea is that for
> everything but 'cpu_present_mask' we use HV_GENERIC_SET_SPARSE_4K
> format, for 'cpu_present_mask' we just use 'all' (HV_GENERIC_SET_ALL)
> to avoid specifying individual CPUs.

Yes, that's the intent.

It was correct before because cpumask would have been filtered to
exclude "self" when it came to this function.

>
> >
> >> ipi_arg->vp_set.format = HV_GENERIC_SET_SPARSE_4K;
> >> - nr_bank = cpumask_to_vpset(&(ipi_arg->vp_set), mask);
> >> + if (exclude_self)
> >> + nr_bank = cpumask_to_vpset_noself(&(ipi_arg->vp_set), mask);
> >> + else
> >> + nr_bank = cpumask_to_vpset(&(ipi_arg->vp_set), mask);
> >> }
> >
> > But, what happens in the case that mask == cpu_present_mask and
> > exclude_self == true?
> >
> > AFAICT it ends up sending the IPI to all CPUs including self:
> >
> > if (!nr_bank)
> > ipi_arg->vp_set.format = HV_GENERIC_SET_ALL;
> >
> > Not entirely correct, right?
>
> It's not, I think we need something like (completely untested)
>
> diff --git a/arch/x86/hyperv/hv_apic.c b/arch/x86/hyperv/hv_apic.c
> index 32a1ad356c18..80b7660208e4 100644
> --- a/arch/x86/hyperv/hv_apic.c
> +++ b/arch/x86/hyperv/hv_apic.c
> @@ -122,17 +122,17 @@ static bool __send_ipi_mask_ex(const struct cpumask *mask, int vector,
> ipi_arg->reserved = 0;
> ipi_arg->vp_set.valid_bank_mask = 0;
>
> - if (!cpumask_equal(mask, cpu_present_mask)) {
> + if (!cpumask_equal(mask, cpu_present_mask) || exclude_self) {
> ipi_arg->vp_set.format = HV_GENERIC_SET_SPARSE_4K;
> if (exclude_self)
> nr_bank = cpumask_to_vpset_noself(&(ipi_arg->vp_set), mask);
> else
> nr_bank = cpumask_to_vpset(&(ipi_arg->vp_set), mask);
> - }
> - if (nr_bank < 0)
> - goto ipi_mask_ex_done;
> - if (!nr_bank)
> + if (nr_bank =< 0)
> + goto ipi_mask_ex_done;
> + } else {
> ipi_arg->vp_set.format = HV_GENERIC_SET_ALL;
> + }
>
> status = hv_do_rep_hypercall(HVCALL_SEND_IPI_EX, 0, nr_bank,
> ipi_arg, NULL);
>
> here. Wei, I can test and send this out if you're not on it already.
>

Please turn this into a patch and send it out. Thank you so much for
looking into it.

Wei.

> --
> Vitaly
>