Re: TSearch2: Auto identify document language?

From: Mark Mitchenall <mark(at)mitchenall(dot)com>
To: pgsql-general(at)postgresql(dot)org
Subject: Re: TSearch2: Auto identify document language?
Date: 2005-12-12 11:48:33
Message-ID: 439D6391.8070006@mitchenall.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On 11/12/05, Hannes Dorbath <light(at)theendofthetunnel(dot)de> wrote:
> Is there a practical way to make a guess what language a document is
> written in and auto magically use the adequate TSearch config? I thought
> of looking up the document's words in various dicts and use the one with
> the most matches.. doesn't matter if performance will be bad.

Is it possible to use something like....

http://odur.let.rug.nl/~vannoord/TextCat/

... from a plPerl script?

Best,
Mark
--
Mark Mitchenall, Standingwave Ltd
(Complete Hosting and Development Services)

Tel/Fax := +44 (0)845 612 0699
Email := mark(at)standingwave(dot)co(dot)uk mark(at)mitchenall(dot)com
Home := http://www.standingwave.co.uk http://www.mitchenall.com

Attachment Content-Type Size
mark.vcf text/x-vcard 193 bytes

Browse pgsql-general by date

  From Date Subject
Next Message Richard Huxton 2005-12-12 12:02:14 Re: ODBC connection problems!
Previous Message A. Kretschmer 2005-12-12 11:42:46 Re: postgreSQL 8.0.4 - Windows driver