How to find broken UTF-8 characters ?

From: Andreas <maps(dot)on(at)gmx(dot)net>
To: pgsql-sql(at)postgresql(dot)org
Subject: How to find broken UTF-8 characters ?
Date: 2010-04-26 00:12:43
Message-ID: 4BD4DA7B.3000603@gmx.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-sql

Hi,

I regularly have to import from Excel files, that hold rather simple
text and number columns.
That works.

Occasionally there are unwanted special characters at the end of
text-columns that Exel and pgAdmin either show as a upward arrow with a
short leg on top 90° to the right or others are invisible with UTF-8
aware programs or get dispayed as ? by Access.
A text viewer shows "ÔÇÄ" or E2 80 8E in Hex for the invisible thingy.

My database is unicode so it doesn't mind those freak-chars.
The problem rises when I need to export those records to CSV with
pgAdmin. pgAdmin complains about not beeing able to store those lines in
the local charset.

How can I find those broken UTF-8 characters?
How can I get rid of them?

Responses

Browse pgsql-sql by date

  From Date Subject
Next Message silly sad 2010-04-26 10:12:26 Re: [SPAM]-D] How to find broken UTF-8 characters ?
Previous Message junaidmalik14 2010-04-23 10:45:08 Re: count function alternative in postgres