Thread: Bug in jsonb minus operator

Bug in jsonb minus operator

From
Peter Geoghegan
Date:
I'm seeing the following problem on the master branch:

postgres=# select '{"foo":5}'::jsonb - 'bar'; -- okay ?column?
------------{"foo": 5}
(1 row)

postgres=# select '{"foo":{"bar":5}}'::jsonb - 'foo'; -- okay?column?
----------{}
(1 row)

postgres=# select '{"foo":{"bar":5}}'::jsonb - 'rion'; -- spurious error
ERROR:  XX000: unknown type of jsonb container to convert
LOCATION:  convertJsonbValue, jsonb_util.c:1430

This is an elog() - a "can't happen" error - due to this restriction
within convertJsonbValue():

/** A JsonbValue passed as val should never have a type of jbvBinary, and* neither should any of its sub-components.
Thosevalues will be produced* by convertJsonbArray and convertJsonbObject, the results of which will* not be passed
backto this function as an argument.*/
 

This call to convertJsonbValue() actually originates from the final
line of the new jsonb_delete() function, here:

#5  0x0000000000877e10 in jsonb_delete (fcinfo=0x160e060) at jsonfuncs.c:3389
3389 PG_RETURN_JSONB(JsonbValueToJsonb(res));

I explored writing a fix for this bug. I went back and forth on it,
but I think that the most promising approach might be to simply teach
convertJsonbValue() to care about jbvBinary-typed JsonbValue
variables. That way, the jsonb_delete() function could actually expect
this to work. I can't remember why I thought it was a good idea to
have convertJsonbValue() reject jbvBinary values, but I believe the
reason was that it simply wasn't necessary. As it says of conversion
from in memory to disk representation (at the top of json_util.c):

/** Turn an in-memory JsonbValue into a Jsonb for on-disk storage.** There isn't a JsonbToJsonbValue(), because
generallywe find it more* convenient to directly iterate through the Jsonb representation and only* really convert
nestedscalar values.  JsonbIteratorNext() does this, so that* clients of the iteration code don't have to directly deal
withthe binary* representation (JsonbDeepContains() is a notable exception, although all* exceptions are internal to
thismodule).  In general, functions that accept* a JsonbValue argument are concerned with the manipulation of scalar
values,*or simple containers of scalar values, where it would be inconvenient to* deal with a great amount of other
state.*/

jsonb_delete() is unusual in that it converts from a JsonbValue back
to the on-disk Jsonb varlena format, but with a nested jbvBinary-typed
value (in the presence of a nested object or array) -- it seems like
it wouldn't be too hard to just roll with that. jsonb_delete() makes
the mistake of not expecting to see jbvBinary values (since it doesn't
always recurse into the json structure). We shouldn't really deal with
jbvBinary-typed values specially from outside specialized utility
routines like JsonbDeepContains() as noted in the above comment,
though (so offhand I don't think we should teach jsonb_delete()
anything new, as that would be a modularity violation). Thoughts?

Note that the existence related operators (that, like the minus
operator should only work at the top level) don't go through
JsonbValue conversion of the lhs value as part of their search at all.
I don't think that their findJsonbValueFromContainer() routine (or a
similar routine) is the right way of approaching this, though - that's
a specialized routine, that doesn't care if an array value (which is,
you'll recall, a key for the purposes of existence checking) appears
once or multiple times. On that topic, I think it's sloppy that "Table
9-41. Additional jsonb Operators" isn't clear about the fact that the
"operator - text" op matches things on the same basis as the existence
operators -- notice how the existence operator notes with emphasis
that it cares about array *string* values only.

Finally, I don't think an operator implementation function that is
jsonb-only belongs anywhere other than jsonb_ops.c (for the same
reason, replacePath() should live in jsonb_util.c). And I'm
disappointed that the jsonb tests can no longer be run atomically with
'make installcheck TESTS="jsonb"' - I liked being able to do that.
-- 
Peter Geoghegan



Re: Bug in jsonb minus operator

From
Andrew Dunstan
Date:
On 05/16/2015 08:04 PM, Peter Geoghegan wrote:
> I'm seeing the following problem on the master branch:
>
> postgres=# select '{"foo":5}'::jsonb - 'bar'; -- okay
>    ?column?
> ------------
>   {"foo": 5}
> (1 row)
>
> postgres=# select '{"foo":{"bar":5}}'::jsonb - 'foo'; -- okay
>   ?column?
> ----------
>   {}
> (1 row)
>
> postgres=# select '{"foo":{"bar":5}}'::jsonb - 'rion'; -- spurious error
> ERROR:  XX000: unknown type of jsonb container to convert
> LOCATION:  convertJsonbValue, jsonb_util.c:1430
>
> This is an elog() - a "can't happen" error - due to this restriction
> within convertJsonbValue():
>
> /*
>   * A JsonbValue passed as val should never have a type of jbvBinary, and
>   * neither should any of its sub-components. Those values will be produced
>   * by convertJsonbArray and convertJsonbObject, the results of which will
>   * not be passed back to this function as an argument.
>   */
>
> This call to convertJsonbValue() actually originates from the final
> line of the new jsonb_delete() function, here:
>
> #5  0x0000000000877e10 in jsonb_delete (fcinfo=0x160e060) at jsonfuncs.c:3389
> 3389 PG_RETURN_JSONB(JsonbValueToJsonb(res));
>
> I explored writing a fix for this bug. I went back and forth on it,
> but I think that the most promising approach might be to simply teach
> convertJsonbValue() to care about jbvBinary-typed JsonbValue
> variables. That way, the jsonb_delete() function could actually expect
> this to work. I can't remember why I thought it was a good idea to
> have convertJsonbValue() reject jbvBinary values, but I believe the
> reason was that it simply wasn't necessary.


Sure. I thought we'd covered this but it's possible that we didn't, or 
that it got rebroken. There have been complaints about the limitation on 
values containing jbvBinary, so let's just remove it if that can be done 
simply, as it seems to be a not uncommonly encountered problem.

Are you going to submit a patch for that?


> jsonb_delete() is unusual in that it converts from a JsonbValue back
> to the on-disk Jsonb varlena format, but with a nested jbvBinary-typed
> value (in the presence of a nested object or array) -- it seems like
> it wouldn't be too hard to just roll with that. jsonb_delete() makes
> the mistake of not expecting to see jbvBinary values (since it doesn't
> always recurse into the json structure). We shouldn't really deal with
> jbvBinary-typed values specially from outside specialized utility
> routines like JsonbDeepContains() as noted in the above comment,
> though (so offhand I don't think we should teach jsonb_delete()
> anything new, as that would be a modularity violation). Thoughts?


Assuming the above fix we won't have to teach it anything new, right?

>
> Note that the existence related operators (that, like the minus
> operator should only work at the top level) don't go through
> JsonbValue conversion of the lhs value as part of their search at all.
> I don't think that their findJsonbValueFromContainer() routine (or a
> similar routine) is the right way of approaching this, though - that's
> a specialized routine, that doesn't care if an array value (which is,
> you'll recall, a key for the purposes of existence checking) appears
> once or multiple times.

Seems reasonable.

> On that topic, I think it's sloppy that "Table
> 9-41. Additional jsonb Operators" isn't clear about the fact that the
> "operator - text" op matches things on the same basis as the existence
> operators -- notice how the existence operator notes with emphasis
> that it cares about array *string* values only.

OK, please submit a patch that you think clears it up.

>
> Finally, I don't think an operator implementation function that is
> jsonb-only belongs anywhere other than jsonb_ops.c (for the same
> reason, replacePath() should live in jsonb_util.c).


The heading on jsonb_op.c (note spelling) suggests that it's for indexed 
operations. I don't mind rearranging the code layout to something people 
think more logical, but I also don't want to engage in unnecessary code 
churn. I'm not that religious about the organization.


> And I'm
> disappointed that the jsonb tests can no longer be run atomically with
> 'make installcheck TESTS="jsonb"' - I liked being able to do that.

It was always known that some cases would not work with TESTS - that was 
part of Tom's reservation about the whole thing. You can say
   make check-tests TESTS="create_table copy jsonb"


and while create_table will fail on an irrelevant test, copy and jsonb 
will succeed.

cheers

andrew



Re: Bug in jsonb minus operator

From
Peter Geoghegan
Date:
On Sun, May 17, 2015 at 8:04 AM, Andrew Dunstan <andrew@dunslane.net> wrote:
> Sure. I thought we'd covered this but it's possible that we didn't, or that
> it got rebroken. There have been complaints about the limitation on values
> containing jbvBinary, so let's just remove it if that can be done simply, as
> it seems to be a not uncommonly encountered problem.
>
> Are you going to submit a patch for that?

I'll try and come up with something. It's not a trivial fix.

> Assuming the above fix we won't have to teach it anything new, right?

I'm not 100% sure, but I think not.

>> On that topic, I think it's sloppy that "Table
>> 9-41. Additional jsonb Operators" isn't clear about the fact that the
>> "operator - text" op matches things on the same basis as the existence
>> operators -- notice how the existence operator notes with emphasis
>> that it cares about array *string* values only.
>
>
> OK, please submit a patch that you think clears it up.

I was going to better document the new operators anyway, so I'll get
this in passing.

> The heading on jsonb_op.c (note spelling) suggests that it's for indexed
> operations. I don't mind rearranging the code layout to something people
> think more logical, but I also don't want to engage in unnecessary code
> churn. I'm not that religious about the organization.

I see what you mean.

> It was always known that some cases would not work with TESTS - that was
> part of Tom's reservation about the whole thing. You can say

Sure, but you added it, and you mentioned jsonb in the commit message,
before jsonb was even committed to the master branch. :-)

-- 
Peter Geoghegan



Re: Bug in jsonb minus operator

From
Andrew Dunstan
Date:
On 05/17/2015 09:08 PM, Peter Geoghegan wrote:
> On Sun, May 17, 2015 at 8:04 AM, Andrew Dunstan <andrew@dunslane.net> wrote:
>> Sure. I thought we'd covered this but it's possible that we didn't, or that
>> it got rebroken. There have been complaints about the limitation on values
>> containing jbvBinary, so let's just remove it if that can be done simply, as
>> it seems to be a not uncommonly encountered problem.
>>
>> Are you going to submit a patch for that?
> I'll try and come up with something. It's not a trivial fix.



Here's a thought. in pushJsonbValue() the value pushed is called 
scalarVal, and in all our regression tests it is in fact scalar. 
However, the Asserts inside the function tell a different story:
        case WJB_VALUE:            Assert(IsAJsonbScalar(scalarVal) ||                   scalarVal->type == jbvBinary);
          appendValue(*pstate, scalarVal);            break;        case WJB_ELEM:
Assert(IsAJsonbScalar(scalarVal)||                   scalarVal->type == jbvBinary);            appendElement(*pstate,
scalarVal);           break;
 

and indeed it turns out that your delete example does push a value with 
jbvBinary, thus triggering the problem. So, could we deal with that, by, 
say, setting up an iterator for the binary datum and just pushing all of 
its values?

cheers

andrew





Re: Bug in jsonb minus operator

From
Andrew Dunstan
Date:
On 05/18/2015 08:17 AM, Andrew Dunstan wrote:
>
> On 05/17/2015 09:08 PM, Peter Geoghegan wrote:
>> On Sun, May 17, 2015 at 8:04 AM, Andrew Dunstan <andrew@dunslane.net>
>> wrote:
>>> Sure. I thought we'd covered this but it's possible that we didn't,
>>> or that
>>> it got rebroken. There have been complaints about the limitation on
>>> values
>>> containing jbvBinary, so let's just remove it if that can be done
>>> simply, as
>>> it seems to be a not uncommonly encountered problem.
>>>
>>> Are you going to submit a patch for that?
>> I'll try and come up with something. It's not a trivial fix.
>
>
>
> Here's a thought. in pushJsonbValue() the value pushed is called
> scalarVal, and in all our regression tests it is in fact scalar.
> However, the Asserts inside the function tell a different story:
>
>         case WJB_VALUE:
>             Assert(IsAJsonbScalar(scalarVal) ||
>                    scalarVal->type == jbvBinary);
>             appendValue(*pstate, scalarVal);
>             break;
>         case WJB_ELEM:
>             Assert(IsAJsonbScalar(scalarVal) ||
>                    scalarVal->type == jbvBinary);
>             appendElement(*pstate, scalarVal);
>             break;
>
> and indeed it turns out that your delete example does push a value
> with jbvBinary, thus triggering the problem. So, could we deal with
> that, by, say, setting up an iterator for the binary datum and just
> pushing all of its values?
>
>


Here's an patch along those lines. It seems to do the trick, at least
for your test case, and it has the merit of being very small, so small
I'd like to backpatch it - accepting jbvBinary as is in pushJsonbValue
seems like a bug to me.

cheers

andrew



Attachment

Re: Bug in jsonb minus operator

From
Peter Geoghegan
Date:
On Mon, May 18, 2015 at 7:11 AM, Andrew Dunstan <andrew@dunslane.net> wrote:
> Here's an patch along those lines. It seems to do the trick, at least for
> your test case, and it has the merit of being very small, so small I'd like
> to backpatch it - accepting jbvBinary as is in pushJsonbValue seems like a
> bug to me.

Isn't that for the benefit of raw scalar pseudo arrays? The existing
comments above pushJsonbValue() acknowledge such callers.

Why are you passing the skipNested variable to JsonbIteratorNext()
within jsonb_delete()? I'm not seeing a need for that.

-- 
Peter Geoghegan



Re: Bug in jsonb minus operator

From
Andrew Dunstan
Date:
On 05/18/2015 10:52 PM, Peter Geoghegan wrote:
> On Mon, May 18, 2015 at 7:11 AM, Andrew Dunstan <andrew@dunslane.net> wrote:
>> Here's an patch along those lines. It seems to do the trick, at least for
>> your test case, and it has the merit of being very small, so small I'd like
>> to backpatch it - accepting jbvBinary as is in pushJsonbValue seems like a
>> bug to me.
> Isn't that for the benefit of raw scalar pseudo arrays? The existing
> comments above pushJsonbValue() acknowledge such callers.


Umm, no, the raw scalar pseudo arrays are of type jbvArray, not 
jbvBinary. And they are pushed with WJB_BEGIN_ARRAY, not with WJB_ELEM 
or WJB_VALUE, as the comment notes. See this fragment of code from 
JsonbValueToJsonb:
        scalarArray.type = jbvArray;        scalarArray.val.array.rawScalar = true;        scalarArray.val.array.nElems
=1;
 
        pushJsonbValue(&pstate, WJB_BEGIN_ARRAY, &scalarArray);


I tested this by removing the assert test for jbvBinary in the WJB_ELEM 
and WJB_VALUE switch beranches and then running the regression suite. No 
assertion failure was triggered. While that's not guaranteed to be a 
perfect test, it doesn't seem like a bad one. Can you pose a counter 
example where this will break?


>
> Why are you passing the skipNested variable to JsonbIteratorNext()
> within jsonb_delete()? I'm not seeing a need for that.
>

If you don't the logic gets more complex, as you need to keep track of 
what level of the object you are at. The virtue of this change is that 
it will simplify a lot of such processing by removing the unnecessary 
restriction on passing jbvBinary values to pushJsonbValue().

If you have a better fix for the bug you complained about I'll be happy 
to take a look.

cheers

andrew







Re: Bug in jsonb minus operator

From
Andrew Dunstan
Date:
On 05/19/2015 07:11 PM, Andrew Dunstan wrote:
>
> On 05/18/2015 10:52 PM, Peter Geoghegan wrote:
>> On Mon, May 18, 2015 at 7:11 AM, Andrew Dunstan <andrew@dunslane.net>
>> wrote:
>>> Here's an patch along those lines. It seems to do the trick, at
>>> least for
>>> your test case, and it has the merit of being very small, so small
>>> I'd like
>>> to backpatch it - accepting jbvBinary as is in pushJsonbValue seems
>>> like a
>>> bug to me.
>> Isn't that for the benefit of raw scalar pseudo arrays? The existing
>> comments above pushJsonbValue() acknowledge such callers.
>
>
> Umm, no, the raw scalar pseudo arrays are of type jbvArray, not
> jbvBinary. And they are pushed with WJB_BEGIN_ARRAY, not with WJB_ELEM
> or WJB_VALUE, as the comment notes. See this fragment of code from
> JsonbValueToJsonb:
>
>         scalarArray.type = jbvArray;
>         scalarArray.val.array.rawScalar = true;
>         scalarArray.val.array.nElems = 1;
>
>         pushJsonbValue(&pstate, WJB_BEGIN_ARRAY, &scalarArray);
>
>
> I tested this by removing the assert test for jbvBinary in the
> WJB_ELEM and WJB_VALUE switch beranches and then running the
> regression suite. No assertion failure was triggered. While that's not
> guaranteed to be a perfect test, it doesn't seem like a bad one. Can
> you pose a counter example where this will break?
>
>
>>
>> Why are you passing the skipNested variable to JsonbIteratorNext()
>> within jsonb_delete()? I'm not seeing a need for that.
>>
>
> If you don't the logic gets more complex, as you need to keep track of
> what level of the object you are at. The virtue of this change is that
> it will simplify a lot of such processing by removing the unnecessary
> restriction on passing jbvBinary values to pushJsonbValue().
>
> If you have a better fix for the bug you complained about I'll be
> happy to take a look.
>
>




OK, here's a slightly updated version of the patch. The current
behaviour in which pushJsonbValue() happily pushes  a jbvBinary value,
only to have it fail later when we try to get a Jsonb out of it, seems
like a plain bug. The fact that none of our present 9.4 code exercises
that bug is lucky, but since this is an exposed API, and has been the
cause of past complaint, I propose, in the absence of further objections
or comments, to apply it to both master and 9.4, once the new releases
are out of the way.

cheers

andrew



Attachment