Re: Could not Store French Accent Marks Correctly in Postgres

From: Ludwig Kniprath <ludwig(at)kni-online(dot)de>
To: "Wang, Mary Y" <mary(dot)y(dot)wang(at)boeing(dot)com>
Cc: "pgsql-general(at)postgresql(dot)org" <pgsql-general(at)postgresql(dot)org>
Subject: Re: Could not Store French Accent Marks Correctly in Postgres
Date: 2010-08-21 11:21:50
Message-ID: 4C6FB6CE.2080404@kni-online.de
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

Am 20.08.2010 23:10, schrieb Wang, Mary Y:
> Hi,
> I'm having a problem right now. Some of our French users uploaded
> some files with file names that had French accent marks, and those
> file names were inserted into the Postgres database. When I examined
> the value of those file names, they all had some weird characters (the
> weird characters were in the same position where the accent marks were
> entered). I do not know how to handle this kind of situation. Most
> of my users are US based, but I have been told that there will be more
> international users in the future.
> So my questions are:
> (1) What is the best character encoding that would work for most
> of those languages that have accent marks?
> (2) I assume that I also need to do some kind of conversion in the
> front end (PHP) as well.
> I'm running on Linux and Postgres 8.3.8.
> Any ideas?
> Thanks in advance.
> Mary Wang
Hi,
our solution for storing uploaded files in database/filesystem with php
uses utf-8 for the filenames in the database in combination with
string-replacement for some special characters in php. These are in our
case the german "Umlaute" (ä,ö,ü,ß), because otherwise we get the
problem of strange translations of these characters (php uses utf-8,
german windows uses cp-1250), that made them unusable for
download-links. You can use the function below, just add your special
characters to the $trans-array. As another benefit this function returns
unique filenames that can be used for storing the files in a
target-directory.

<SNIP>
public static function get_unique_file_name($target_dir,
$current_file_name){
$trans = array ("ä" => "ae", "ö" => "oe", "ü" => "ue", "ß" => "ss",
"Ä" => "Ae", "Ö" => "Oe", "Ü" => "Ue");
target_file_name = strtr($current_file_name, $trans);
$i = 0;
$old_target_file_name = $target_file_name;
while(file_exists($target_dir . '/' . $target_file_name)){
$i++;
$target_file_name = $i . $old_target_file_name;
}
return $target_file_name;
}
</SNIP>

Ludwig

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Jonathan Bond-Caron 2010-08-21 15:13:21 Re: Could not Store French Accent Marks Correctly in Postgres
Previous Message Filip Rembiałkowski 2010-08-21 09:08:07 Re: searchable book database