Quick Links

Re: UTF-8 encoding problem w/ libpq

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Heikki Linnakangas <hlinnakangas(at)vmware(dot)com>
Cc:	"ktm(at)rice(dot)edu" <ktm(at)rice(dot)edu>, Martin Schäfer <Martin(dot)Schaefer(at)cadcorp(dot)com>, "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: UTF-8 encoding problem w/ libpq
Date:	2013-06-03 18:28:30
Message-ID:	4852.1370284110@sss.pgh.pa.us
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Heikki Linnakangas <hlinnakangas(at)vmware(dot)com> writes:
> He *is* using UTF-8. Or trying to, anyway :-). The downcasing in the
> backend is supposed to leave bytes with the high-bit set alone, ie. in
> UTF-8 encoding, it's supposed to leave and alone.

Well, actually, downcase_truncate_identifier() is doing this:

unsigned char ch = (unsigned char) ident[i];

if (ch >= 'A' && ch <= 'Z')
ch += 'a' - 'A';
else if (IS_HIGHBIT_SET(ch) && isupper(ch))
ch = tolower(ch);

There's basically no way that that second case can give pleasant results
in a multibyte encoding, other than by not doing anything. I suspect
that Windows' libc has fewer defenses than other implementations and
performs some transformation that we don't get elsewhere. This may also
explain the gripe yesterday in -general about funny results in OS X.

We talked about this before and went off into the weeds about whether
it was sensible to try to use towlower() and whether that wouldn't
create undesirably platform-sensitive results. I wonder though if we
couldn't just fix this code to not do anything to high-bit-set bytes
in multibyte encodings.

regards, tom lane

In response to

Re: UTF-8 encoding problem w/ libpq at 2013-06-03 16:22:59 from Heikki Linnakangas

Responses

Re: UTF-8 encoding problem w/ libpq at 2013-06-03 18:41:37 from Heikki Linnakangas
Re: UTF-8 encoding problem w/ libpq at 2013-06-03 18:41:44 from Andrew Dunstan

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Jim Nasby	2013-06-03 18:41:17	Re: Optimising Foreign Key checks
Previous Message	Andres Freund	2013-06-03 18:12:00	Re: Vacuum, Freeze and Analyze: the big picture