Re: JDBC driver - is "getGeneratedKeys()" guaranteed to return the ids in the same order a batch insert was made? - Mailing list pgsql-general

From David G. Johnston
Subject Re: JDBC driver - is "getGeneratedKeys()" guaranteed to return the ids in the same order a batch insert was made?
Date
Msg-id CAKFQuwZesZPEmnycdcwCYMGWJ=TVXqc=OhrwyopSKGfe=c+KSA@mail.gmail.com
Whole thread Raw
In response to Re: JDBC driver - is "getGeneratedKeys()" guaranteed to return the ids in the same order a batch insert was made?  (electrotype <electrotype@gmail.com>)
Responses Re: JDBC driver - is "getGeneratedKeys()" guaranteed to return the ids in the same order a batch insert was made?  (Dave Cramer <davecramer@postgres.rocks>)
Re: JDBC driver - is "getGeneratedKeys()" guaranteed to return the ids in the same order a batch insert was made?  (electrotype <electrotype@gmail.com>)
List pgsql-general
On Wed, Dec 9, 2020 at 1:31 PM electrotype <electrotype@gmail.com> wrote:
Agreed.

However, this isn't really the purview of JDBC - I'm doubting it does anything that would cause the order to be different than what is received, and the batch items are sent and results processed sequentially.

The main question is whether any batch items are inserting multiple records themselves - i.e., RETURNING * is producing multiple results.  Whatever order RETURNING * produces is what the driver will capture - but it isn't responsible for guaranteeing that the order of multiple inserted records going in matches what comes out.  PostgreSQL needs to make that claim.  I don't see where it does (i've sent an email to see if adding such a claim to the documentation is proper).  Done manually one can always do "WITH insert returning SELECT ORDER BY", but it doesn't seem workable for the driver to try and do that when adding the returning clause, which I presume is what is in scope here.

David J.

Thank you, it's appreciated! I'm sure this clarification would help other developers too.

My take is that there is presently no guarantee, and that with current efforts to add parallelism it is quite probable that observation of such non-orderedness is simply a matter of time.  With batching it seems best to combine its use with single inserts in order to avoid this problem.

David J.

pgsql-general by date:

Previous
From: Michael Lewis
Date:
Subject: Re: Extended statistics for correlated columns, row estimates when values are not in MCVs list
Next
From: Atul Kumar
Date:
Subject: Required checkpoints occurs too frequently