Home > mailing lists

Re: Looking for an efficient way to replace efficient NOT IN when landling very large data - Mailing list pgsql-sql

From	Shaozhong SHI
Subject	Re: Looking for an efficient way to replace efficient NOT IN when landling very large data
Date	April 11, 2023 09:44:02
Msg-id	CA+i5Jwa6evCRWGZ8C2oW4jRdXKBu0ot_rjqKCq7sVepBAkCzoA@mail.gmail.com Whole thread
In response to	Re: Looking for an efficient way to replace efficient NOT IN when landling very large data (David Rowley <dgrowleyml@gmail.com>)
List	pgsql-sql

Tree view

On Tue, 11 Apr 2023 at 10:33, David Rowley <dgrowleyml@gmail.com> wrote:

On Tue, 11 Apr 2023 at 21:28, Shaozhong SHI <shishaozhong@gmail.com> wrote:
>
> Select a.id, a.name, b.id, b.name from a_large_table a, definitive b where (a.id, b.name) not in
> (select b.id, b.name from definitive b)
>
> is very slow.
>
> Is there a faster way to do so?

It depends on what your exact requirements are for the NULL handling
that NOT IN provides. Do you need the query to return 0 rows if b.id
and b.name are null? This question is moot if none of the columns or
either table allow NULLs.

If you don't require that, then you'll give the planner more
flexibility to choose a more efficient plan if you use NOT EXISTS
instead.

David

I would like to try out an example of NOT EXISTS way and see how the replacement works.

Regards,

David

pgsql-sql by date:

From: David Rowley
Date: 11 April 2023, 09:33:42
Subject: Re: Looking for an efficient way to replace efficient NOT IN when landling very large data

From: kunwar singh
Date: 20 April 2023, 16:01:02
Subject: PostgreSQL Objects design

Re: Looking for an efficient way to replace efficient NOT IN when landling very large data - Mailing list pgsql-sql

Previous

Next