From: | Oleg Bartunov <oleg(at)sai(dot)msu(dot)su> |
---|---|
To: | Pavel Stehule <stehule(at)kix(dot)fsv(dot)cvut(dot)cz> |
Cc: | pgsql-general(at)postgresql(dot)org, openfts discussion <openfts-general(at)lists(dot)sourceforge(dot)net> |
Subject: | Re: questions about tsearch2 (for czech language) |
Date: | 2003-12-22 13:55:47 |
Message-ID: | Pine.GSO.4.58.0312221651500.14104@ra.sai.msu.su |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-general |
Pavel,
did you restart psql session after modifying tsearch2 configuration ?
btw, there is czech dictionary available from http://lingucomponent.openoffice.org/download_dictionary.html
We have utility to convert myspell dicts to ispell one. It's included
in 7.5 development. Patch for 7.4 could be downloaded from
http://www.sai.msu.su/~megera/postgres/gist/tsearch/V2/
Also, historically, we use openfts mailing list for discussion of
tsearch2.
Oleg
On Mon, 22 Dec 2003, Pavel Stehule wrote:
> > > result. Why? Have I problem with my configuration?
> >
> > did you specify stop words in dictionaries configuration ?
> >
> > select * from pg_ts_dict;
> >
> tsearch2=# select * from pg_ts_dict where dict_name ='cz_ispell';
> -[ RECORD 1
> ]---+--------------------------------------------------------------------------------------------------------------------------
> dict_name | cz_ispell
> dict_init | 173405
> dict_initoption |
> DictFile="/usr/lib/ispell/czech",AffFile="/usr/lib/ispell/czech.aff",StopFile="/usr/local/pgsql/share/contrib/czech.stop"
> dict_lexize | 173406
> dict_comment |
>
> [postgres(at)usop root]$ cat /usr/local/pgsql/share/contrib/czech.stop|grep -e "^[sv]."
> se
> sem
> si
> svЫj
> ve
> vАm
> vА╧
> viz
> vy
>
> > >
> > > 2. I use small czech dictionary. I need don't erase words which aren't in
> > > dictionary (in my sample StЛhule). Can I set it somewhere? I tryed add
> > > simple dict into cfg map, but witout sucess
> > >
> >
> > Example, please ! What do you mean 'erase words' ?
> >
> >
> > > tsearch2=# select * from ts_debug('jmenuji se Pavel StЛhule a bydlМm ve
> > > Skalici.'); ts_name | tok_type | description | token |
> > > dict_name | tsvector
> > > ---------------+----------+-------------+---------+--------------------+-----------
> > > default_czech | word | Word | StЛhule | {cz_ispell,simple} |
> > > default_czech | lword | Latin word | a | {cz_ispell,simple} |
> > > default_czech | word | Word | bydlМm | {cz_ispell,simple} |
> > > 'bydlet'
> > >
> > >
>
> If tsearch didn't find word in dictionary, then erase this from result.
> True? My surname, fo example isn't in dictionary, but I wont save this
> word in result (tsvector).
>
> I use
>
> tsearch2=# select version();
> version
> -------------------------------------------------------------------------------------------------------
> PostgreSQL 7.4RC2 on i686-pc-linux-gnu, compiled by GCC gcc (GCC) 3.3
> 20030715 (Red Hat Linux 3.3-14)
>
>
Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83
From | Date | Subject | |
---|---|---|---|
Next Message | Mike Mascari | 2003-12-22 14:48:21 | Re: Tables Referencing themselves As Foreign Keys |
Previous Message | Dave Cramer | 2003-12-22 13:48:14 | Re: MySQL Gets Functions in Java - Enlightenment Please |