Quick Links

Re: UTF8 national character data type support WIP patch and list of open issues.

From:	"MauMau" <maumau307(at)gmail(dot)com>
To:	"Valentine Gogichashvili" <valgog(at)gmail(dot)com>, <ishii(at)postgresql(dot)org>
Cc:	"Robert Haas" <robertmhaas(at)gmail(dot)com>, "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>, "Boguk, Maksym" <maksymb(at)fast(dot)au(dot)fujitsu(dot)com>, "Heikki Linnakangas" <hlinnakangas(at)vmware(dot)com>, "PostgreSQL Hackers" <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: UTF8 national character data type support WIP patch and list of open issues.
Date:	2013-09-21 00:07:13
Message-ID:	F98A2B6D87404A7D8134F6C50BAACAD5@maumau
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

From: "Valentine Gogichashvili" <valgog(at)gmail(dot)com>
> the whole NCHAR appeared as hack for the systems, that did not have it
> from
> the beginning. It would not be needed, if all the text would be magically
> stored in UNICODE or UTF from the beginning and idea of character would be
> the same as an idea of a rune and not a byte.

I guess so, too.

> PostgreSQL has a very powerful possibilities for storing any kind of
> encoding. So maybe it makes sense to add the ENCODING as another column
> property, the same way a COLLATION was added?

Some other people in this community suggested that. ANd the SQL standard
suggests the same -- specifying a character encoding for each column:
CHAR(n) CHARASET SET ch.

> Text operations should work automatically, as in memory all strings will
> be
> converted to the database encoding.
>
> This approach will also open a possibility to implement custom ENCODINGs
> for the column data storage, like snappy compression or even BSON, gobs or
> protbufs for much more compact type storage.

Thanks for your idea that sounds interesting, although I don't understand
that well.

Regards
MauMau

In response to

Re: UTF8 national character data type support WIP patch and list of open issues. at 2013-09-20 01:20:02 from Valentine Gogichashvili

Responses

Re: UTF8 national character data type support WIP patch and list of open issues. at 2013-09-22 21:55:40 from Valentine Gogichashvili

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	MauMau	2013-09-21 00:32:27	Re: UTF8 national character data type support WIP patch and list of open issues.
Previous Message	MauMau	2013-09-20 23:58:15	Re: UTF8 national character data type support WIP patch and list of open issues.