Re: Html parsing and inline elements - Mailing list pgsql-hackers

From David G. Johnston
Subject Re: Html parsing and inline elements
Date
Msg-id CAKFQuwad9tVwK6qdANw2eCgi5SbdfaG4MbKFhPcZsqR_jy4t-w@mail.gmail.com
Whole thread Raw
In response to Re: Html parsing and inline elements  (Bruce Momjian <bruce@momjian.us>)
List pgsql-hackers
On Fri, Apr 29, 2016 at 1:47 PM, Bruce Momjian <bruce@momjian.us> wrote:
On Wed, Apr 13, 2016 at 12:57:19PM -0300, Marcelo Zabani wrote:
> Hi, Tom,
>
> You're right, I don't think one can argue that the default parser should know
> HTML.
> How about your suggestion of there being an HTML parser, is it feasible? I ask
> this because I think that a lot of people store HTML documents these days, and
> although there probably aren't lots of HTML with words written along multiple
> inline elements, it would certainly be nice to have a proper parser for these
> use cases.
>
> What do you think?

It sounds useful.

​It sounds like an external project/extension...

David J.
 

pgsql-hackers by date:

Previous
From: Andrew Dunstan
Date:
Subject: Re: Add jsonb_compact(...) for whitespace-free jsonb to text
Next
From: Alvaro Herrera
Date:
Subject: Re: [COMMITTERS] pgsql: Support building with Visual Studio 2015