RE: [PATCH next v4 1/5] minmax: Add umin(a, b) and umax(a, b)

From: David Laight
Date: Fri Jan 12 2024 - 09:27:51 EST


From: Dan Carpenter
> Sent: 12 January 2024 14:03
>
> On Fri, Jan 12, 2024 at 01:40:30PM +0000, David Laight wrote:
> > From: Dan Carpenter
> > > Sent: 12 January 2024 12:50
> > >
> > > On Mon, Sep 18, 2023 at 08:16:30AM +0000, David Laight wrote:
> > > > +/**
> > > > + * umin - return minimum of two non-negative values
> > > > + * Signed types are zero extended to match a larger unsigned type.
> > > > + * @x: first value
> > > > + * @y: second value
> > > > + */
> > > > +#define umin(x, y) \
> > > > + __careful_cmp((x) + 0u + 0ul + 0ull, (y) + 0u + 0ul + 0ull, <)
> > >
> > > Why do we match "a larger unsigned type" instead of ULL_MAX? Presumably
> > > it helps performance somehow... I agree that it's probably fine but I
> > > would be more comfortable if it skipped UINT_MAX and jumped directly to
> > > ULONG_MAX. These days 4 gigs is small potatoes. The vmalloc() function
> > > can allocate 4G so we've had integer overflow bugs with this before.
> >
> > The '+ 0ul*' carefully zero extend signed values without changing
> > unsigned values.
> > The compiler detects when it has zero-extended both sides and
> > uses the smaller compare.
> > In essence:
> > x + 0u converts 'int' to 'unsigned int'.
> > Avoids the sign extension adding 0ul on 64bit.
> > x + 0ul converts a 'long' to 'unsigned long'.
> > Avoids the sign extension adding 0ull on 32bit
> > x + 0ull converts a 'long long' to 'unsigned long long'.
> > You need all three to avoid sign extensions and get an unsigned
> > compare.
>
> So unsigned int compares are faster than unsigned long compares?
>
> It's just sort of weird how it works.
>
> min_t(unsigned long, -1, 10000000000)); => 10000000000
> umin(umin(-1, 10000000000)); => UINT_MAX
>
> UINT_MAX is just kind of a random value. I would have prefered
> ULONG_MAX, it's equally random but it's more safe because nothing can
> allocate ULONG_MAX bytes.

umin() is only defined for non-negative values.
So that example is really outside the domain of the function.

Consider:
int x = some_positive_value;
unsigned long long y;
then:
min_t(unsigned long long, x, y);
Does (unsigned long long)x which is (unsigned long long)(long long)x
and requires that x be sign extended to 64bits.
On 32bit that is quite horrid.
whereas:
umin(x, y);
Only has to zero extend x.
So is compiled as:
y:hi || y:lo > x ? x ; y

If both values are 32bit the compiler generates a 32bit compare
(even on 64bit).

David

-
Registered Address Lakeside, Bramley Road, Mount Farm, Milton Keynes, MK1 1PT, UK
Registration No: 1397386 (Wales)