Home > mailing lists

Re: Html parsing and inline elements - Mailing list pgsql-hackers

From	David G. Johnston
Subject	Re: Html parsing and inline elements
Date	April 29, 2016 21:07:20
Msg-id	CAKFQuwad9tVwK6qdANw2eCgi5SbdfaG4MbKFhPcZsqR_jy4t-w@mail.gmail.com Whole thread Raw
In response to	Re: Html parsing and inline elements (Bruce Momjian <bruce@momjian.us>)
List	pgsql-hackers

Tree view

On Fri, Apr 29, 2016 at 1:47 PM, Bruce Momjian <bruce@momjian.us> wrote:

On Wed, Apr 13, 2016 at 12:57:19PM -0300, Marcelo Zabani wrote:
> Hi, Tom,
>
> You're right, I don't think one can argue that the default parser should know
> HTML.
> How about your suggestion of there being an HTML parser, is it feasible? I ask
> this because I think that a lot of people store HTML documents these days, and
> although there probably aren't lots of HTML with words written along multiple
> inline elements, it would certainly be nice to have a proper parser for these
> use cases.
>
> What do you think?

It sounds useful.

It sounds like an external project/extension...

David J.

pgsql-hackers by date:

From: Andrew Dunstan
Date: 29 April 2016, 21:07:06
Subject: Re: Add jsonb_compact(...) for whitespace-free jsonb to text

From: Alvaro Herrera
Date: 29 April 2016, 21:23:54
Subject: Re: [COMMITTERS] pgsql: Support building with Visual Studio 2015

Re: Html parsing and inline elements - Mailing list pgsql-hackers

Previous

Next