Re: electronic-izing unicode texts

From: Richard Huxton <dev(at)archonet(dot)com>
To: "A(dot) Cropi" <cropister(at)gmail(dot)com>
Cc: pgsql-general(at)postgresql(dot)org
Subject: Re: electronic-izing unicode texts
Date: 2005-04-21 07:16:17
Message-ID: 42675341.7030107@archonet.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

A. Cropi wrote:
> my objectives: (1) allow users to make query through the web (i guess
> i will do this via PHP interacting with the postgresql)
>
> my questions are: (1) is it reasonable to put the bookcontent into the
> CONTENT column? (2) the content of the book can be very long (some of
> them have nearly 1 milloin words), so, what kind of considerations
> should i be making? (3) how should i design something like this? there
> must be someone outthere that has done somethign similar to this.. if
> so, please share your experiences.

You might be better off with a web-indexing package.
http://freshmeat.net/search/?q=web+indexing&section=projects

Since you're not structuring the content of the book, most of the
advantages of a RDBMS don't apply. If you're going to treat it as text,
just use one of the text indexing systems above.

I would convert each book into one or more web-pages (perhaps one page
per section/chapter) and then use htdig or swish.

--
Richard Huxton
Archonet Ltd

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Daniel Schuchardt 2005-04-21 08:20:18 Re: Windows install/uninstall as a "service"
Previous Message Alexandre 2005-04-21 07:02:11 Re: Simplified (a-la [G|N]DBM) DB access