Re: I cannot insert bengali character in UTF8

From: Christian Ullrich <chris(at)chrullrich(dot)net>
To: AI Rumman <rummandba(at)gmail(dot)com>
Cc: pgsql-general General <pgsql-general(at)postgresql(dot)org>
Subject: Re: I cannot insert bengali character in UTF8
Date: 2012-07-20 13:11:48
Message-ID: 50095914.7080801@chrullrich.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

* AI Rumman wrote:

> I am using database with UTF8 and LC_CTYPE set as default value in
> Postgresql 9.1.
> But I cannot insert bengali character in a column.
>
> Query Failed:INSERT into tracker (user_id, module_name, item_id,
> item_summary) values ('1','Leads','353','বাংলা টেস্��...')::ADODB
> error::->ERROR: invalid byte sequence for encoding "UTF8": 0xe0a62e

E0 A6 2E is not valid UTF-8: 11100000 10100110 00101110

The lead byte indicates that the codepoint consists of three bytes,
but only the very next byte is a trail byte (10......). The third
byte is a single character, a period ("."), to be exact.

Setting the MSB on the third byte gives us

11100000 10100110 10101110 = E0 A6 AE

, which is a valid UTF-8 encoding of U+09AE BENGALI LETTER MA.

Check your input data.

--
Christian

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Rich Shepard 2012-07-20 15:53:52 Select Rows With Only One of Two Values
Previous Message Benjamin Adams 2012-07-20 12:27:37 Get DB Size by row