Re: Replacement for Oracle Text

From: s d <daku(dot)sandor(at)gmail(dot)com>
To: Postgresql General <pgsql-general(at)postgresql(dot)org>
Subject: Re: Replacement for Oracle Text
Date: 2016-02-19 13:49:16
Message-ID: CAKyoTgaat6uSofWmhZaCnDcCrHBcr+vHpyw+7dCs6dbW3pYwQA@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On 19 February 2016 at 14:19, Bruce Momjian <bruce(at)momjian(dot)us> wrote:

> On Fri, Feb 19, 2016 at 11:53:26AM +0000, Simon Riggs wrote:
> > On 19 February 2016 at 11:46, Thomas Kellerer <spam_eater(at)gmx(dot)net>
> wrote:
> >
> > Daniel Westermann schrieb am 19.02.2016 um 12:41:
> > >>>> if I'd need to implement/replace Oracle Text (ww.oracle.com/
> > technetwork/testcontent/index-098492.html).
> > >>>>> What choices do I have in PostgreSQL (9.5+) ?
> > >
> > >>Postgres also has a full text search (which I find much easier to
> use
> > than Oracle's):
> > >>
> > >>http://www.postgresql.org/docs/current/static/textsearch.html
> > >
> > > Yes, i have seen this. Can this be used to index and search binary
> > documents, e.g. pdf ?
> >
> > Ah, no. That's not possible
> >
> >
> > ...not possible, Yet.
> >
> > PostgreSQL grows by adding the features people need and its changing
> rapidly.
>
> I wonder if PLPerl could be used to extract the words from a PDF
> document and create a tsvector column from it.
>
>
I don't know about PLPerl(I'm pretty sure it could be used for this
purpose, though.). On the other hand I've written code for this in Python
which should be easy to adapt for PLPython, if necessary.

Ezt az e-mailt egy Avast védelemmel rendelkező, vírusmentes számítógépről
küldték.
www.avast.com <https://www.avast.com/sig-email>
<#DDB4FAA8-2DD7-40BB-A1B8-4E2AA1F9FDF2>

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Bruce Momjian 2016-02-19 13:54:02 Re: Replacement for Oracle Text
Previous Message Bruce Momjian 2016-02-19 13:19:52 Re: Replacement for Oracle Text