Quick Links

Re: More Code Page wierdness

From:	"Magnus Hagander" <mha(at)sollentuna(dot)net>
To:	"Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	"Simon Riggs" <simon(at)2ndquadrant(dot)com>, <pgsql-hackers-win32(at)postgresql(dot)org>
Subject:	Re: More Code Page wierdness
Date:	2005-01-11 16:01:25
Message-ID:	6BCB9D8A16AC4241919521715F4D8BCE47657B@algol.sollentuna.se
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers-win32

> > Not sure what would be involved in requiring the file to be UTF8.
> > Notepad can certainly handle UTF8, but I wonder how much
> would need to
> > bec hanged in pg...
>
> I think it would Just Work, since UTF8 is an ASCII superset,

Ok.

> whereas apparently some of Windows' code pages are not :-(
> (which is proof of brain death in Redmond if I ever saw it).

I've never seen one that changes any chars <= 127. But it's possible the
editor in this case wrote down a file in the wrong charset.
Looking at http://www.microsoft.com/globaldev/reference/wincp.mspx, "*"
is 002A in *every single one*, inlcuding japanese.

Looking at http://www.microsoft.com/globaldev/reference/oem/437.htm, it
shows that it's 002A in the very encoding Simon said was used in this
case. So it is in
http://www.microsoft.com/globaldev/reference/oem/850.htm, which is the
OEM codepage used in Swedish windows.

I'm having trouble seeing why '*' would have a different value in
different codepages. I'm wondering if we're going down the wrong road
completely here?

//Magnus

Responses

Re: More Code Page wierdness at 2005-01-11 16:22:08 from Bruce Momjian
Re: More Code Page wierdness at 2005-01-11 16:29:17 from Andrew Dunstan
Re: More Code Page wierdness at 2005-01-11 16:49:11 from Tom Lane

Browse pgsql-hackers-win32 by date

	From	Date	Subject
Next Message	Bruce Momjian	2005-01-11 16:22:08	Re: More Code Page wierdness
Previous Message	Magnus Hagander	2005-01-11 15:54:15	Re: More Code Page wierdness