Re: [PATCH] get_maintainer: correctly parse UTF-8 encoded names in files

From: Duje Mihanović
Date: Mon Oct 16 2023 - 10:37:48 EST


On Saturday, October 14, 2023 7:22:44 PM CEST Alvin Šipraga wrote:
> From: Alvin Šipraga <alsi@xxxxxxxxxxxxxxx>
>
> While the script correctly extracts UTF-8 encoded names from the
> MAINTAINERS file, the regular expressions damage my name when parsing
> from .yaml files. Fix this by replacing the Latin-1-compatible regular
> expressions with the unicode property matcher \p{Latin}. It's also
> necessary to instruct Perl to open all files with UTF-8 encoding.
>
> The issue was also identified on the b4 mailing list [1]. This should
> solve the observed side effects there as well.
>
> Link: https://lore.kernel.org/all/20230726-gush-slouching-a5cd41@meerkat/
[1]
> Signed-off-by: Alvin Šipraga <alsi@xxxxxxxxxxxxxxx>
> ---
> scripts/get_maintainer.pl | 7 ++++---
> 1 file changed, 4 insertions(+), 3 deletions(-)

Tested-by: Duje Mihanović <duje.mihanovic@xxxxxxxx>