From: | "David G(dot) Johnston" <david(dot)g(dot)johnston(at)gmail(dot)com> |
---|---|
To: | Bruce Momjian <bruce(at)momjian(dot)us> |
Cc: | Marcelo Zabani <mzabani(at)gmail(dot)com>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Html parsing and inline elements |
Date: | 2016-04-29 21:07:14 |
Message-ID: | CAKFQuwad9tVwK6qdANw2eCgi5SbdfaG4MbKFhPcZsqR_jy4t-w@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
On Fri, Apr 29, 2016 at 1:47 PM, Bruce Momjian <bruce(at)momjian(dot)us> wrote:
> On Wed, Apr 13, 2016 at 12:57:19PM -0300, Marcelo Zabani wrote:
> > Hi, Tom,
> >
> > You're right, I don't think one can argue that the default parser should
> know
> > HTML.
> > How about your suggestion of there being an HTML parser, is it feasible?
> I ask
> > this because I think that a lot of people store HTML documents these
> days, and
> > although there probably aren't lots of HTML with words written along
> multiple
> > inline elements, it would certainly be nice to have a proper parser for
> these
> > use cases.
> >
> > What do you think?
>
> It sounds useful.
>
It sounds like an external project/extension...
David J.
From | Date | Subject | |
---|---|---|---|
Next Message | Alvaro Herrera | 2016-04-29 21:23:40 | Re: [COMMITTERS] pgsql: Support building with Visual Studio 2015 |
Previous Message | Andrew Dunstan | 2016-04-29 21:06:52 | Re: Add jsonb_compact(...) for whitespace-free jsonb to text |