| From: | Mark Mitchenall <mark(at)mitchenall(dot)com> |
|---|---|
| To: | pgsql-general(at)postgresql(dot)org |
| Subject: | Re: TSearch2: Auto identify document language? |
| Date: | 2005-12-12 11:48:33 |
| Message-ID: | 439D6391.8070006@mitchenall.com |
| Views: | Whole Thread | Raw Message | Download mbox | Resend email |
| Thread: | |
| Lists: | pgsql-general |
On 11/12/05, Hannes Dorbath <light(at)theendofthetunnel(dot)de> wrote:
> Is there a practical way to make a guess what language a document is
> written in and auto magically use the adequate TSearch config? I thought
> of looking up the document's words in various dicts and use the one with
> the most matches.. doesn't matter if performance will be bad.
Is it possible to use something like....
http://odur.let.rug.nl/~vannoord/TextCat/
... from a plPerl script?
Best,
Mark
--
Mark Mitchenall, Standingwave Ltd
(Complete Hosting and Development Services)
Tel/Fax := +44 (0)845 612 0699
Email := mark(at)standingwave(dot)co(dot)uk mark(at)mitchenall(dot)com
Home := http://www.standingwave.co.uk http://www.mitchenall.com
| Attachment | Content-Type | Size |
|---|---|---|
| mark.vcf | text/x-vcard | 193 bytes |
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Richard Huxton | 2005-12-12 12:02:14 | Re: ODBC connection problems! |
| Previous Message | A. Kretschmer | 2005-12-12 11:42:46 | Re: postgreSQL 8.0.4 - Windows driver |