Re: XML with invalid chars - Mailing list pgsql-hackers

From Andrew Dunstan
Subject Re: XML with invalid chars
Date
Msg-id 4DB868FA.7020708@dunslane.net
Whole thread Raw
In response to Re: XML with invalid chars  (Noah Misch <noah@leadboat.com>)
Responses Re: XML with invalid chars  (Noah Misch <noah@leadboat.com>)
List pgsql-hackers

On 04/26/2011 05:11 PM, Noah Misch wrote:
> On Mon, Apr 25, 2011 at 07:25:02PM -0400, Andrew Dunstan wrote:
>> I came across this today, while helping a customer. The following will
>> happily create a piece of XML with an embedded ^A:
>>
>>     select xmlelement(name foo, null, E'abc\x01def');
>>
>> Now, a ^A is totally forbidden in XML version 1.0, and allowed but only
>> as "" or equivalent in XML version 1.1, and not as a 0x01 byte
>> (see<http://en.wikipedia.org/wiki/XML#Valid_characters>)
>>
>> ISTM this is something we should definitely try to fix ASAP, even if we
>> probably can't backpatch the fix.
> +1.  Given that such a datum breaks dump+reload, it seems risky to do nothing at
> all in the back branches.



Here's a patch along the lines suggested by Peter.

I'm not sure what to do about the back branches and cases where data is
already in databases. This is fairly ugly. Suggestions welcome.

cheers

andrew

Attachment

pgsql-hackers by date:

Previous
From: David Fetter
Date:
Subject: Re: PostgreSQL Core Team
Next
From: Thom Brown
Date:
Subject: Re: PostgreSQL Core Team