Does LIKE and REGEX work on all encodings in Postgres, if the locale is set to C?
(This of course would break all sorting for any langauge except english, if I understand things correctly.)
Although it's not going to happen because of memory constraints, it'd be a LOT simpler if everything internal to Postgres - disk, tables, and memory - were utf32.