From: | Tatsuo Ishii <ishii(at)postgresql(dot)org> |
---|---|
To: | nagata(at)sraoss(dot)co(dot)jp |
Cc: | daniel(at)yesql(dot)se, pgsql-hackers(at)postgresql(dot)org |
Subject: | Re: Doc: typo in config.sgml |
Date: | 2024-09-30 11:07:31 |
Message-ID: | 20240930.200731.666191321972616000.ishii@postgresql.org |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
>> I wonder if it would be worth to add a check for this like we have to tabs?
+1.
>> The attached adds a rule to "make -C doc/src/sgml check" for trapping nbsp
>> (doing so made me realize we don't have an equivalent meson target).
>
> Your patch couldn't detect 0xA0 in config.sgml in my machine, but it works
> when I use `grep -P "[\xA0]"` instead of `grep -e "\xA0"`.
>
> However, it also detects the following line in charset.sgml.
> (https://www.postgresql.org/docs/current/collation.html)
>
> For example, locale und-u-kb sorts 'àe' before 'aé'.
>
> This is not non-breaking space, so should not be detected as an error.
That's because non-breaking space (nbsp) is not encoded as 0xa0 in
UTF-8. nbsp in UTF-8 is "0xc2 0xa0" (2 bytes) (A 0xa0 is a nbsp's code
point in Unicode. i.e. U+00A0).
So grep -P "[\xC2\xA0]" should work to detect nbsp.
Best reagards,
--
Tatsuo Ishii
SRA OSS K.K.
English: http://www.sraoss.co.jp/index_en/
Japanese:http://www.sraoss.co.jp
From | Date | Subject | |
---|---|---|---|
Next Message | Daniel Westermann (DWE) | 2024-09-30 11:10:09 | pg_basebackup and error messages dependent on the order of the arguments |
Previous Message | Alexander Lakhin | 2024-09-30 11:00:00 | Re: msys inet_pton strangeness |