Re: tsearch2 and pdf files

From: "philip johnson" <philip(dot)johnson(at)atempo(dot)com>
To: pgsql-general(at)postgresql(dot)org
Subject: Re: tsearch2 and pdf files
Date: 2006-12-11 23:08:46
Message-ID: 2590.82.230.159.163.1165878526.squirrel@webmail.atempo.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

>> 1. Convert PDF to file with e.g xpdf
>> 2. Insert parsed text to a table of your choice.
>> 3. Make vectors from the text.
>
> Actually, if you're not going to use the headline() function, you cna
> just store it directly in a vector, cutting down on the size
> requirements.
What size requirements ?

> Just insert to the to_tsvector() result. The full text is
> required for headline() though, so you can't cheat on that.
>
> //Magnus
>
> ---------------------------(end of broadcast)---------------------------
> TIP 6: explain analyze is your friend
>

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Scott Marlowe 2006-12-11 23:32:12 Re: forcing compression of text field
Previous Message SCassidy 2006-12-11 23:05:54 Re: shell script to populate array values