Quick Links

Re: default_text_search_config

From:	Tatsuo Ishii <ishii(at)postgresql(dot)org>
To:	itagaki(dot)takahiro(at)oss(dot)ntt(dot)co(dot)jp
Cc:	tgl(at)sss(dot)pgh(dot)pa(dot)us, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: default_text_search_config
Date:	2007-10-05 07:12:58
Message-ID:	20071005.161258.56048359.t-ishii@sraoss.co.jp
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

> Tatsuo Ishii <ishii(at)postgresql(dot)org> wrote:
>
> > For me the idea that a text-search configuration maps to a
> > locale/language seems to be totally wrong. IMO an encoding/charset
> > could include several languages and a text-search configuration should
> > be mapped to an encoding/charset, rather than a language.
>
> I think mapping by encoding/charset *is* totally wrong and by locale is
> reasonable. How do you treat LATIN1? It can be used in French and German,
> etc. Moreover, UTF-8 can be used in almost all languages.
>
> The tight mapping of EUC_jp <=> Japanese is a special case in the world.

What? I didn't say that an encoding/charset is mapped to single
language. Actually EUC_JP includes Japanese, English(ascii), Greek,
Cyrillic and so on. So for the full text search being able to process
EUC_JP text properly, it should be able to process multiple languages
at a time.

You know that PostgreSQL allows only one locale for a PostgreSQL
cluster, and the fact that text-search being depending on locale
prevent it from processing multi language text.

The only solution I can think of today is creating new parser which
can process EUC_JP properly (I mean it can process not only Japanese
but also English) and use it on C locale/EUC_JP cluster. I would do
this for 8.4 if I have time.
--
Tatsuo Ishii
SRA OSS, Inc. Japan

In response to

Re: default_text_search_config at 2007-10-05 06:45:16 from ITAGAKI Takahiro

Responses

Re: default_text_search_config at 2007-10-05 07:40:25 from ITAGAKI Takahiro
Re: default_text_search_config at 2007-10-05 15:03:04 from Tom Lane

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	ITAGAKI Takahiro	2007-10-05 07:40:25	Re: default_text_search_config
Previous Message	ITAGAKI Takahiro	2007-10-05 06:45:16	Re: default_text_search_config