Re: BUG #18274: Error 'invalid XML content' - Mailing list pgsql-bugs

From Tom Lane
Subject Re: BUG #18274: Error 'invalid XML content'
Date
Msg-id 696008.1706242495@sss.pgh.pa.us
Whole thread Raw
In response to Re: BUG #18274: Error 'invalid XML content'  (Michael Paquier <michael@paquier.xyz>)
List pgsql-bugs
Michael Paquier <michael@paquier.xyz> writes:
> On Thu, Jan 25, 2024 at 03:12:07PM +0300, Dmitry Koval wrote:
>> I agree that reverting a patch is a good idea if there are concerns about
>> server resources (XML is used by few users and there are even fewer users
>> who need to parse elements larger than 10Mb).
>> For such users it is better to create custom PostgreSQL build.

> And done with f2743a7d70e7.

Related to this: I just read some interesting things in libxml2 2.12's
release notes:

    Most of the known issues leading to quadratic behavior in the XML parser
    were fixed. Internal hash tables were rewritten to reduce memory
    consumption.

    A new API function xmlCtxtSetMaxAmplification was added to allow parsing
    of files that would otherwise trigger the billion laughs protection.

Could it be that if we see this new function is available and use it,
we could allow more than we have done historically?  I don't have a
whole lot of faith here, but perhaps this is worth investigation.

(BTW, 2.12 has created some annoying API breaks, which seems to be why
caiman is failing.  We have some work to do there in any case.)

            regards, tom lane



pgsql-bugs by date:

Previous
From: Michael Paquier
Date:
Subject: Re: BUG #18310: Some SQL commands fail to process duplicate objects with error: tuple already updated by self
Next
From: Tender Wang
Date:
Subject: Re: BUG #18310: Some SQL commands fail to process duplicate objects with error: tuple already updated by self