| From: | Ramanarayana <raam(dot)soft(at)gmail(dot)com> |
|---|---|
| To: | PostgreSQL Developers <pgsql-hackers(at)lists(dot)postgresql(dot)org> |
| Cc: | Hugh Ranalli <hugh(at)whtc(dot)ca>, Thomas Munro <thomas(dot)munro(at)enterprisedb(dot)com>, Michael Paquier <michael(at)paquier(dot)xyz> |
| Subject: | Unaccent extension python script Issue in Windows |
| Date: | 2019-03-11 16:24:45 |
| Message-ID: | CAKm4Xs7_61XMyOWmHs3n0mmkS0O4S0pvfWk=7cQ5P0gs177f7A@mail.gmail.com |
| Views: | Whole Thread | Raw Message | Download mbox | Resend email |
| Thread: | |
| Lists: | pgsql-hackers |
Hi Hackers,
In master branch, unaccent extension is having issue with the below python
script.This issue is only in windows 10 and python 3.
python generate_unaccent_rules.py --unicode-data-file UnicodeData.txt
--latin-ascii-file Latin-ASCII.xml > unaccent.rules
I am getting the following error
UnicodeEncodeError: 'charmap' codec can't encode character '\u0100' in
position 0: character maps to <undefined>
I went through the python script and found that the stdout encoding is set
to utf-8 only if python version is <=2. The same needs to be done for
python 3
--
Cheers
Ram 4.0
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Alvaro Herrera | 2019-03-11 16:27:47 | Re: move hash_any to utils/hash/hashfn.c |
| Previous Message | Andrew Dunstan | 2019-03-11 15:59:50 | Re: pgbench MAX_ARGS |