From: | Pavel Stehule <pavel(dot)stehule(at)gmail(dot)com> |
---|---|
To: | Radek Novotný <radek(dot)novotny(at)mediawork(dot)cz> |
Cc: | pgsql-general(at)postgresql(dot)org |
Subject: | Re: integration of fulltext search in bytea/docs |
Date: | 2009-07-29 15:38:31 |
Message-ID: | 162867790907290838q5cf5a640x241fb14a99d269f2@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-general |
Hello
Dne 29. červenec 2009 16:46 Radek Novotný <radek(dot)novotny(at)mediawork(dot)cz>
napsal(a):
> Hello,
> is there in the roadmap of postgre integration of fulltext searching in
> documents saved in blobs (bytea)?
>
What I know, no. PostgreSQL doesn't know about others binary formats,
so it cannot do it.
> For example linux antiword can export fine text output that can be inserted
> into varchar field.
I understand it well. Three years ago, we used antiword. We stored two
values - bytea - original word doc, and text - antiword output. The
overhead is minimal, and this solution worked very well.
>
> Would be very very nice (postgre users can be proud to be first) to save
> documents into bytea and search that field via to_tsvector, to_tsquery ...
>
It's should be very slow. You have to do repeated transformation.
Pavel Stehule
> --
>
> www.publicstream.cz - vytvořili jsme produkt pro živé přenosy (online
> streaming) a videoarchív ve formátu FLASH.
>
> Bc. Radek Novotný
> jednatel Mediawork group s.r.o.
>
> tel.: +420 724 020 361
> email: radek(dot)novotny(at)mediawork(dot)cz
> http://www.mediawork.cz
>
>
From | Date | Subject | |
---|---|---|---|
Next Message | Jennifer Trey | 2009-07-29 15:40:31 | How do I run PG Tuning Wizard on Linux? |
Previous Message | Tom Lane | 2009-07-29 15:25:35 | Re: OID in $_TD->{new}/$_TD->{old} |