| From: | Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us> |
|---|---|
| To: | Håvard Wahl Kongsgård <haavard(dot)kongsgaard(at)gmail(dot)com> |
| Cc: | pgsql-general <pgsql-general(at)postgresql(dot)org> |
| Subject: | Re: UTF-8 and Regular expression |
| Date: | 2011-05-31 21:26:21 |
| Message-ID: | 911.1306877181@sss.pgh.pa.us |
| Views: | Whole Thread | Raw Message | Download mbox | Resend email |
| Thread: | |
| Lists: | pgsql-general |
=?ISO-8859-1?Q?H=E5vard_Wahl_Kongsg=E5rd?= <haavard(dot)kongsgaard(at)gmail(dot)com> writes:
> Hi, in 8.4 how does the regular expression functions in postgresql handle
> special UTF-8 characters?
Badly :-(
> for example:
> SELECT name,substring(name from E'\\w+\\s(\\w+)$') from nodes;
> fails to select characters like
Should work in 9.0, but no chance in earlier releases. You need to use
a single-byte encoding such as LATIN1 if you need to do this in older
releases. In any release, make sure you're using an LC_COLLATE setting
that's appropriate for the language and encoding.
regards, tom lane
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Merlin Moncure | 2011-05-31 21:42:27 | Re: Function Column Expansion Causes Inserts To Fail |
| Previous Message | Pete Chown | 2011-05-31 20:52:04 | Consistency of distributed transactions |