Re: [PATCH 0/3] scripts/spelling.txt: add some spelling pairs and reorder

From: Leizhen (ThunderTown)
Date: Fri Jun 11 2021 - 04:12:54 EST




On 2021/6/11 16:02, Andy Shevchenko wrote:
> On Fri, Jun 11, 2021 at 10:19 AM Zhen Lei <thunder.leizhen@xxxxxxxxxx> wrote:
>>
>> Add spelling_sanitizer.sh and use it to reorder, then add some spelling
>> "mistake||correction" pairs.
>
> The sorting idea is good, but the order is not.
> What you really need is to use language corpus [1] instead. So in such
> case you will eliminate false positives (to some extent).

Yes, the default sorting result of the sort command is different from that of
strcmp(). "export LC_ALL=C" needs to be added. It's mentioned in "man sort":

*** WARNING *** The locale specified by the environment affects sort order.
Set LC_ALL=C to get the traditional sort order that uses native byte values.

Let me add it.

>
> [1]: https://en.wikipedia.org/wiki/Corpus_of_Contemporary_American_English
>