From: | pg(at)thetdh(dot)com |
---|---|
To: | "David E(dot) Wheeler" <david(at)kineticode(dot)com>, "PG Hackers" <pgsql-hackers(at)postgresql(dot)org> |
Cc: | "Hudson, T(dot) David" <pg1(at)thetdh(dot)com> |
Subject: | Re: Unicode Normalization |
Date: | 2009-09-24 13:24:07 |
Message-ID: | W979322959270161253798647@webmail34 |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
In a context using normalization, wouldn't you typically want to store a normalized-text type that could perhaps (depending on locale) take advantage of simpler, more-efficient comparison functions? Whether you're doing INSERT/UPDATE, or importing a flat text file, if you canonicalize characters and substrings of identical meaning when trivial distinctions of encoding are irrelevant, you're better off later. User-invocable normalization functions by themselves don't make much sense. (If Postgres now supports binary- or mixed-binary-and-text flat files, perhaps for restore purposes, the same thing applies.)
David Hudson
From | Date | Subject | |
---|---|---|---|
Next Message | Jan Urbański | 2009-09-24 14:21:47 | Re: [PATCH] DefaultACLs |
Previous Message | ning | 2009-09-24 13:17:12 | Re: "BEGIN TRANSACTION" and "START TRANSACTION": different error handling |