Thread: Optimizing queries
Hi, I have a query that use a function and some column test to select row. It's in the form of: SELECT * FROM TABLE WHERE TABLE.COLUMN1=something AND TABLE.COLUMN2=somethingelse AND function(TABLE.COLUMN3,TABLE.COLUMN4) > 0; The result of the function does NOT depend only from the table, but also from some other tables. Since it's long to process, I've add some output to see what's going on. I find out that the function process every row even if the row should be rejected as per the first or the second condition. Then , my question is: Is there a way to formulate a query that wont do all the check if it does not need to do it ? Meaning that, if condition1 is false then it wont check condition2 and that way the function will only be called when it's really necessary. Thanks
On Tue, 2006-08-08 at 12:49, Patrice Beliveau wrote: > Hi, > > I have a query that use a function and some column test to select row. > It's in the form of: > > SELECT * FROM TABLE > WHERE TABLE.COLUMN1=something > AND TABLE.COLUMN2=somethingelse > AND function(TABLE.COLUMN3,TABLE.COLUMN4) > 0; > > The result of the function does NOT depend only from the table, but also > from some other tables. > > Since it's long to process, I've add some output to see what's going on. > I find out that the function process every row even if the row should be > rejected as per the first or the second condition. Then , my question > is: Is there a way to formulate a query that wont do all the check if it > does not need to do it ? Meaning that, if condition1 is false then it > wont check condition2 and that way the function will only be called when > it's really necessary. What version of postgresql are you running? It might be better in later versions. The standard fix for such things is to use a subquery... select * from ( select * from table where col1='something' and col2='somethingelse' ) as a where function(a.col3,a.col4) > 0;
Scott Marlowe wrote: > On Tue, 2006-08-08 at 12:49, Patrice Beliveau wrote: > >> Hi, >> >> I have a query that use a function and some column test to select row. >> It's in the form of: >> >> SELECT * FROM TABLE >> WHERE TABLE.COLUMN1=something >> AND TABLE.COLUMN2=somethingelse >> AND function(TABLE.COLUMN3,TABLE.COLUMN4) > 0; >> >> The result of the function does NOT depend only from the table, but also >> from some other tables. >> >> Since it's long to process, I've add some output to see what's going on. >> I find out that the function process every row even if the row should be >> rejected as per the first or the second condition. Then , my question >> is: Is there a way to formulate a query that wont do all the check if it >> does not need to do it ? Meaning that, if condition1 is false then it >> wont check condition2 and that way the function will only be called when >> it's really necessary. >> > > What version of postgresql are you running? It might be better in later > versions. The standard fix for such things is to use a subquery... > > select * from ( > select * from table where > col1='something' > and col2='somethingelse' > ) as a > where function(a.col3,a.col4) > 0; > > ---------------------------(end of broadcast)--------------------------- > TIP 3: Have you checked our extensive FAQ? > > http://www.postgresql.org/docs/faq > > > Thanks for the answer, but it does not work, maybe I did something wrong First, I'm using version 8.1.3 This is what I did: select * from ( select * from sales_order_delivery where sales_order_id in ( select sales_order_id from sales_order where closed=false ) ) as a where outstandingorder(sales_order_id, sales_order_item, date_due) > 0; Some output that I've create look like INFO: so:03616 soi:1 date:1993-12-23 INFO: so:09614 soi:1 date:1998-06-04 which are the three arguments passed to the function "outstandingorder", but sales_order 03616 and 09614 are closed. What's wrong ?? Thanks
Patrice Beliveau <pbeliveau@avior.ca> writes: >>> SELECT * FROM TABLE >>> WHERE TABLE.COLUMN1=something >>> AND TABLE.COLUMN2=somethingelse >>> AND function(TABLE.COLUMN3,TABLE.COLUMN4) > 0; > I find out that the function process every row even if the row should be > rejected as per the first or the second condition. > ... I'm using version 8.1.3 PG 8.1 will not reorder WHERE clauses for a single table unless it has some specific reason to do so (and AFAICT no version back to 7.0 or so has done so either...) So there's something you are not telling us that is relevant. Let's see the exact table schema (psql \d output is good), the exact query, and EXPLAIN output for that query. regards, tom lane
Tom Lane wrote: > Patrice Beliveau <pbeliveau@avior.ca> writes: > >>>> SELECT * FROM TABLE >>>> WHERE TABLE.COLUMN1=something >>>> AND TABLE.COLUMN2=somethingelse >>>> AND function(TABLE.COLUMN3,TABLE.COLUMN4) > 0; >>>> > > >> I find out that the function process every row even if the row should be >> rejected as per the first or the second condition. >> ... I'm using version 8.1.3 >> > > PG 8.1 will not reorder WHERE clauses for a single table unless it has > some specific reason to do so (and AFAICT no version back to 7.0 or so > has done so either...) So there's something you are not telling us that > is relevant. Let's see the exact table schema (psql \d output is good), > the exact query, and EXPLAIN output for that query. > > regards, tom lane > > ---------------------------(end of broadcast)--------------------------- > TIP 9: In versions below 8.0, the planner will ignore your desire to > choose an index scan if your joining column's datatypes do not > match > > > Hi, here is my query, and the query plan that result explain select * from ( select * from sales_order_delivery where sales_order_id in ( select sales_order_id from sales_order where closed=false ) ) as a where outstandingorder(sales_order_id, sales_order_item, date_due) > 0; QUERY PLAN ---------------------------------------------------------------------------------------------------------------------- Hash IN Join (cost=498.89..8348.38 rows=34612 width=262) Hash Cond: (("outer".sales_order_id)::text = ("inner".sales_order_id)::text) -> Seq Scan on sales_order_delivery (cost=0.00..6465.03 rows=69223 width=262) Filter: (outstandingorder((sales_order_id)::text, (sales_order_item)::text, date_due) > 0::double precision) -> Hash (cost=484.90..484.90 rows=5595 width=32) -> Seq Scan on sales_order (cost=0.00..484.90 rows=5595 width=32) Filter: (NOT closed) (7 rows)
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 If subquerys are not working I think you should try to create a view with the subquery. Maybe it will work. Patrice Beliveau wrote: > Tom Lane wrote: >> Patrice Beliveau <pbeliveau@avior.ca> writes: >> >>>>> SELECT * FROM TABLE >>>>> WHERE TABLE.COLUMN1=something >>>>> AND TABLE.COLUMN2=somethingelse >>>>> AND function(TABLE.COLUMN3,TABLE.COLUMN4) > 0; >>>>> >> >> >>> I find out that the function process every row even if the row should >>> be rejected as per the first or the second condition. >>> ... I'm using version 8.1.3 >>> >> >> PG 8.1 will not reorder WHERE clauses for a single table unless it has >> some specific reason to do so (and AFAICT no version back to 7.0 or so >> has done so either...) So there's something you are not telling us that >> is relevant. Let's see the exact table schema (psql \d output is good), >> the exact query, and EXPLAIN output for that query. >> >> regards, tom lane >> >> ---------------------------(end of broadcast)--------------------------- >> TIP 9: In versions below 8.0, the planner will ignore your desire to >> choose an index scan if your joining column's datatypes do not >> match >> >> >> > Hi, > > here is my query, and the query plan that result > > explain select * from ( > select * from sales_order_delivery > where sales_order_id in ( > select sales_order_id from sales_order > where closed=false > ) > ) as a where outstandingorder(sales_order_id, sales_order_item, > date_due) > 0; > > > QUERY PLAN > ---------------------------------------------------------------------------------------------------------------------- > > Hash IN Join (cost=498.89..8348.38 rows=34612 width=262) > Hash Cond: (("outer".sales_order_id)::text = > ("inner".sales_order_id)::text) > -> Seq Scan on sales_order_delivery (cost=0.00..6465.03 rows=69223 > width=262) > Filter: (outstandingorder((sales_order_id)::text, > (sales_order_item)::text, date_due) > 0::double precision) > -> Hash (cost=484.90..484.90 rows=5595 width=32) > -> Seq Scan on sales_order (cost=0.00..484.90 rows=5595 width=32) > Filter: (NOT closed) > (7 rows) > > > ---------------------------(end of broadcast)--------------------------- > TIP 3: Have you checked our extensive FAQ? > > http://www.postgresql.org/docs/faq > > -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.2.2 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iD8DBQFE2dMTIo1XmbAXRboRAhbIAJwJGZ+ITP0gl38A3qROrzIeNbTtUwCcDOIW eZ9NJqjL+58gyMfO95jwZSw= =4Zxj -----END PGP SIGNATURE-----
I've create a view, same query plan (some number vary a bit, but nothing significant) and same result, closed sales_order are processed Ruben Rubio wrote: > -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA1 > > If subquerys are not working I think you should try to create a view > with the subquery. > > Maybe it will work. > > Patrice Beliveau wrote: > >> Tom Lane wrote: >> >>> Patrice Beliveau <pbeliveau@avior.ca> writes: >>> >>> >>>>>> SELECT * FROM TABLE >>>>>> WHERE TABLE.COLUMN1=something >>>>>> AND TABLE.COLUMN2=somethingelse >>>>>> AND function(TABLE.COLUMN3,TABLE.COLUMN4) > 0; >>>>>> >>>>>> >>> >>> >>>> I find out that the function process every row even if the row should >>>> be rejected as per the first or the second condition. >>>> ... I'm using version 8.1.3 >>>> >>>> >>> PG 8.1 will not reorder WHERE clauses for a single table unless it has >>> some specific reason to do so (and AFAICT no version back to 7.0 or so >>> has done so either...) So there's something you are not telling us that >>> is relevant. Let's see the exact table schema (psql \d output is good), >>> the exact query, and EXPLAIN output for that query. >>> >>> regards, tom lane >>> >>> ---------------------------(end of broadcast)--------------------------- >>> TIP 9: In versions below 8.0, the planner will ignore your desire to >>> choose an index scan if your joining column's datatypes do not >>> match >>> >>> >>> >>> >> Hi, >> >> here is my query, and the query plan that result >> >> explain select * from ( >> select * from sales_order_delivery >> where sales_order_id in ( >> select sales_order_id from sales_order >> where closed=false >> ) >> ) as a where outstandingorder(sales_order_id, sales_order_item, >> date_due) > 0; >> >> >> QUERY PLAN >> ---------------------------------------------------------------------------------------------------------------------- >> >> Hash IN Join (cost=498.89..8348.38 rows=34612 width=262) >> Hash Cond: (("outer".sales_order_id)::text = >> ("inner".sales_order_id)::text) >> -> Seq Scan on sales_order_delivery (cost=0.00..6465.03 rows=69223 >> width=262) >> Filter: (outstandingorder((sales_order_id)::text, >> (sales_order_item)::text, date_due) > 0::double precision) >> -> Hash (cost=484.90..484.90 rows=5595 width=32) >> -> Seq Scan on sales_order (cost=0.00..484.90 rows=5595 width=32) >> Filter: (NOT closed) >> (7 rows) >> >> >> ---------------------------(end of broadcast)--------------------------- >> TIP 3: Have you checked our extensive FAQ? >> >> http://www.postgresql.org/docs/faq >> >> >> > > -----BEGIN PGP SIGNATURE----- > Version: GnuPG v1.4.2.2 (GNU/Linux) > Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org > > iD8DBQFE2dMTIo1XmbAXRboRAhbIAJwJGZ+ITP0gl38A3qROrzIeNbTtUwCcDOIW > eZ9NJqjL+58gyMfO95jwZSw= > =4Zxj > -----END PGP SIGNATURE----- > > >
Patrice Beliveau <pbeliveau@avior.ca> writes: > Tom Lane wrote: >> PG 8.1 will not reorder WHERE clauses for a single table unless it has >> some specific reason to do so (and AFAICT no version back to 7.0 or so >> has done so either...) So there's something you are not telling us that >> is relevant. > here is my query, and the query plan that result > explain select * from ( > select * from sales_order_delivery > where sales_order_id in ( > select sales_order_id from sales_order > where closed=false > ) > ) as a where outstandingorder(sales_order_id, sales_order_item, > date_due) > 0; So this isn't a simple query, but a join. PG will generally push single-table restrictions down to the individual tables in order to reduce the number of rows that have to be processed at the join. In this case that's not a win, but the planner doesn't know enough about the outstandingorder() function to realize that. I think what you need is an "optimization fence" to prevent the subquery from being flattened: explain select * from ( select * from sales_order_delivery where sales_order_id in ( select sales_order_id from sales_order where closed=false ) OFFSET 0 ) as a where outstandingorder(sales_order_id, sales_order_item, date_due) > 0; Any LIMIT or OFFSET in a subquery prevents WHERE conditions from being pushed down past it (since that might change the results). OFFSET 0 is otherwise a no-op, so that's what people usually use. regards, tom lane
On Tue, 2006-08-08 at 16:42 -0400, Tom Lane wrote: > Patrice Beliveau <pbeliveau@avior.ca> writes: > >>> SELECT * FROM TABLE > >>> WHERE TABLE.COLUMN1=something > >>> AND TABLE.COLUMN2=somethingelse > >>> AND function(TABLE.COLUMN3,TABLE.COLUMN4) > 0; > > > I find out that the function process every row even if the row should be > > rejected as per the first or the second condition. > > ... I'm using version 8.1.3 > > PG 8.1 will not reorder WHERE clauses for a single table unless it has > some specific reason to do so (and AFAICT no version back to 7.0 or so > has done so either...) So there's something you are not telling us that > is relevant. Let's see the exact table schema (psql \d output is good), > the exact query, and EXPLAIN output for that query. Is WHERE clause re-ordering done for 8.2, or is that still a TODO item? (Don't remember seeing that at all). -- Simon Riggs EnterpriseDB http://www.enterprisedb.com