Re: [GENERAL] Czech2ASCII with --mb=Latin2

From: Peter Eisentraut <peter_e(at)gmx(dot)net>
To: Robert <robert(at)robert(dot)cz>
Cc: pgsql-general(at)postgresql(dot)org
Subject: Re: [GENERAL] Czech2ASCII with --mb=Latin2
Date: 1999-12-17 00:31:07
Message-ID: Pine.LNX.4.21.9912161929560.5199-100000@localhost.localdomain
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On 1999-12-15, Robert mentioned:

> I have a database in Latin2 encoding (Czech stuff) and Latin2/Win1250
> on-the-fly recoding with 'set client_encoding' works smoothly. Now, when
> I set client encoding to SQL_ASCII, accented characters are converted to
> (hexa) codes. Is there any (simple) way to make this recoding convert
> accented characters to just the chars themselves but without accents?

I think this sort of thing has been the dream of many folks using
internationalized software, but it's not that easy. Perhaps one could
write a function that does this sort of conversion, which would have to
keep a gigantic table internally.

However, perhaps in your language it's customary to just leave off the
diacritic marks if they're not available, but in other languages such as
Swedish or German there are rules about converting those to sequences of
other letters. And if you start encoding rules of natural languages into
software, oh boy ...

--
Peter Eisentraut Sernanders väg 10:115
peter_e(at)gmx(dot)net 75262 Uppsala
http://yi.org/peter-e/ Sweden

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Peter Eisentraut 1999-12-17 00:32:19 Re: [HACKERS] \copy problem
Previous Message Jose Soares 1999-12-16 16:27:34 \copy problem