Re: Best practices for moving UTF8 databases

From: Sam Mason <sam(at)samason(dot)me(dot)uk>
To: pgsql-general(at)postgresql(dot)org
Subject: Re: Best practices for moving UTF8 databases
Date: 2009-07-22 23:04:04
Message-ID: 20090722230402.GW5407@samason.me.uk
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Wed, Jul 22, 2009 at 05:26:37PM +0800, Phoenix Kiula wrote:
> I tried this. Get an error.
>
> mypg=# select * from interesting WHERE NOT description ~ ( '^('||
> mypg(# $$[\09\0A\0D\x20-\x7E]|$$|| -- ASCII
> mypg(# $$[\xC2-\xDF][\x80-\xBF]|$$|| -- non-overlong 2-byte
> mypg(# $$\xE0[\xA0-\xBF][\x80-\xBF]|$$|| -- excluding overlongs
> mypg(# $$[\xE1-\xEC\xEE\xEF][\x80-\xBF]{2}|$$|| -- straight 3-byte
> mypg(# $$\xED[\x80-\x9F][\x80-\xBF]|$$|| -- excluding surrogates
> mypg(# $$\xF0[\x90-\xBF][\x80-\xBF]{2}|$$|| -- planes 1-3
> mypg(# $$[\xF1-\xF3][\x80-\xBF]{3}|$$|| -- planes 4-15
> mypg(# $$\xF4[\x80-\x8F][\x80-\xBF]{2}$$|| -- plane 16
> mypg(# '*)$' )

doh, I put the * in the wrong place! that last line should be:

')*$' )

at least that's what looks strange to me now--not sure how it got moved
though!

--
Sam http://samason.me.uk/

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Andreas Wenk 2009-07-22 23:05:22 Re: Can LIKE under utf8 use INDEXes?
Previous Message Sam Mason 2009-07-22 22:58:16 Re: Select Column Auditing/Logging