From: | Mark Mitchenall <mark(at)mitchenall(dot)com> |
---|---|
To: | pgsql-general(at)postgresql(dot)org |
Subject: | Re: TSearch2: Auto identify document language? |
Date: | 2005-12-12 11:48:33 |
Message-ID: | 439D6391.8070006@mitchenall.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-general |
On 11/12/05, Hannes Dorbath <light(at)theendofthetunnel(dot)de> wrote:
> Is there a practical way to make a guess what language a document is
> written in and auto magically use the adequate TSearch config? I thought
> of looking up the document's words in various dicts and use the one with
> the most matches.. doesn't matter if performance will be bad.
Is it possible to use something like....
http://odur.let.rug.nl/~vannoord/TextCat/
... from a plPerl script?
Best,
Mark
--
Mark Mitchenall, Standingwave Ltd
(Complete Hosting and Development Services)
Tel/Fax := +44 (0)845 612 0699
Email := mark(at)standingwave(dot)co(dot)uk mark(at)mitchenall(dot)com
Home := http://www.standingwave.co.uk http://www.mitchenall.com
Attachment | Content-Type | Size |
---|---|---|
mark.vcf | text/x-vcard | 193 bytes |
From | Date | Subject | |
---|---|---|---|
Next Message | Richard Huxton | 2005-12-12 12:02:14 | Re: ODBC connection problems! |
Previous Message | A. Kretschmer | 2005-12-12 11:42:46 | Re: postgreSQL 8.0.4 - Windows driver |