Re: [PATCH] get_maintainer: correctly parse UTF-8 encoded names in files

From: Joe Perches
Date: Mon Oct 16 2023 - 18:18:04 EST


On Mon, 2023-10-16 at 16:37 +0200, Duje Mihanović wrote:
> On Saturday, October 14, 2023 7:22:44 PM CEST Alvin Šipraga wrote:
> > From: Alvin Šipraga <alsi@xxxxxxxxxxxxxxx>
> >
> > While the script correctly extracts UTF-8 encoded names from the
> > MAINTAINERS file, the regular expressions damage my name when parsing
> > from .yaml files. Fix this by replacing the Latin-1-compatible regular
> > expressions with the unicode property matcher \p{Latin}.

Well, OK

> > It's also
> > necessary to instruct Perl to open all files with UTF-8 encoding.

But I'm not at all sure this is actually desired.