bad plan - Mailing list pgsql-performance

From Gaetano Mendola
Subject bad plan
Date
Msg-id 422D7F23.7000109@bigfoot.com
Whole thread Raw
Responses Re: bad plan
List pgsql-performance
I posted this on hackers, but I had to post it here.


===================================================================================================================================
Hi all,
running a 7.4.5 engine, I'm facing this bad plan:


empdb=# explain analyze SELECT name,url,descr,request_status,url_status,size_mb,estimated_start,request_time_stamp
empdb-#                    FROM v_sc_user_request
empdb-#                    WHERE
empdb-#                         login = 'babinow1'
empdb-#                    LIMIT 10 ;
                                                                                                    QUERY PLAN

------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
 Limit  (cost=1716.38..1716.39 rows=1 width=232) (actual time=52847.239..52847.322 rows=10 loops=1)
   ->  Subquery Scan v_sc_user_request  (cost=1716.38..1716.39 rows=1 width=232) (actual time=52847.234..52847.301
rows=10loops=1) 
         ->  Sort  (cost=1716.38..1716.39 rows=1 width=201) (actual time=52847.219..52847.227 rows=10 loops=1)
               Sort Key: sr.id_sat_request
               ->  Nested Loop Left Join  (cost=1478.82..1716.37 rows=1 width=201) (actual time=3254.483..52847.064
rows=31loops=1) 
                     Join Filter: ("outer".id_package = "inner".id_package)
                     ->  Nested Loop  (cost=493.09..691.55 rows=1 width=193) (actual time=347.665..940.582 rows=31
loops=1)
                           ->  Nested Loop  (cost=493.09..688.49 rows=1 width=40) (actual time=331.446..505.628 rows=31
loops=1)
                                 Join Filter: ("inner".id_user = "outer".id_user)
                                 ->  Index Scan using user_login_login_key on user_login ul  (cost=0.00..4.00 rows=2
width=16)(actual time=12.065..12.071 rows=1 loops=1) 
                                       Index Cond: ((login)::text = 'babinow1'::text)
                                 ->  Materialize  (cost=493.09..531.37 rows=7656 width=28) (actual
time=167.654..481.813rows=8363 loops=1) 
                                       ->  Seq Scan on sat_request sr  (cost=0.00..493.09 rows=7656 width=28) (actual
time=167.644..467.344rows=8363 loops=1) 
                                             Filter: (request_time > (now() - '1 mon'::interval))
                           ->  Index Scan using url_pkey on url u  (cost=0.00..3.05 rows=1 width=161) (actual
time=13.994..14.000rows=1 loops=31) 
                                 Index Cond: ("outer".id_url = u.id_url)
                     ->  Subquery Scan vsp  (cost=985.73..1016.53 rows=1103 width=12) (actual time=25.328..1668.754
rows=493loops=31) 
                           ->  Merge Join  (cost=985.73..1011.01 rows=1103 width=130) (actual time=25.321..1666.666
rows=493loops=31) 
                                 Merge Cond: ("outer".id_program = "inner".id_program)
                                 ->  Sort  (cost=20.74..20.97 rows=93 width=19) (actual time=0.385..0.431 rows=47
loops=31)
                                       Sort Key: programs.id_program
                                       ->  Seq Scan on programs  (cost=0.00..17.70 rows=93 width=19) (actual
time=0.022..11.709rows=48 loops=1) 
                                             Filter: (id_program <> 0)
                                 ->  Sort  (cost=964.99..967.75 rows=1102 width=115) (actual time=14.592..15.218
rows=493loops=31) 
                                       Sort Key: sequences.id_program
                                       ->  Merge Join  (cost=696.16..909.31 rows=1102 width=115) (actual
time=79.717..451.495rows=493 loops=1) 
                                             Merge Cond: ("outer".id_package = "inner".id_package)
                                             ->  Merge Left Join  (cost=0.00..186.59 rows=1229 width=103) (actual
time=0.101..366.854rows=1247 loops=1) 
                                                   Merge Cond: ("outer".id_package = "inner".id_package)
                                                   ->  Index Scan using packages_pkey on packages p  (cost=0.00..131.04
rows=1229width=103) (actual time=0.048..163.503 rows=1247 loops=1) 
                                                   ->  Index Scan using package_security_id_package_key on
package_securityps  (cost=0.00..46.83 rows=855 width=4) (actual time=0.022..178.599 rows=879 loops=1) 
                                             ->  Sort  (cost=696.16..705.69 rows=3812 width=16) (actual
time=79.582..79.968rows=493 loops=1) 
                                                   Sort Key: sequences.id_package
                                                   ->  Seq Scan on sequences  (cost=0.00..469.42 rows=3812 width=16)
(actualtime=0.012..78.863 rows=493 loops=1) 
                                                         Filter: (estimated_start IS NOT NULL)
 Total runtime: 52878.516 ms
(36 rows)


Disabling the nestloop then the execution time become more affordable:

empdb=# set enable_nestloop = false;
SET
empdb=# explain analyze SELECT name,url,descr,request_status,url_status,size_mb,estimated_start,request_time_stamp
empdb-#                    FROM v_sc_user_request
empdb-#                    WHERE
empdb-#                         login = 'babinow1'
empdb-#                    LIMIT 10 ;
                                                                                                       QUERY PLAN

------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
 Limit  (cost=4467.64..4467.65 rows=1 width=232) (actual time=7091.233..7091.289 rows=10 loops=1)
   ->  Subquery Scan v_sc_user_request  (cost=4467.64..4467.65 rows=1 width=232) (actual time=7091.228..7091.272
rows=10loops=1) 
         ->  Sort  (cost=4467.64..4467.64 rows=1 width=201) (actual time=7091.216..7091.221 rows=10 loops=1)
               Sort Key: sr.id_sat_request
               ->  Merge Left Join  (cost=4462.07..4467.63 rows=1 width=201) (actual time=6377.732..7091.067 rows=31
loops=1)
                     Merge Cond: ("outer".id_package = "inner".id_package)
                     ->  Sort  (cost=3389.81..3389.81 rows=1 width=193) (actual time=1338.759..1338.814 rows=31
loops=1)
                           Sort Key: sr.id_package
                           ->  Merge Join  (cost=3285.05..3389.80 rows=1 width=193) (actual time=1318.877..1338.651
rows=31loops=1) 
                                 Merge Cond: ("outer".id_url = "inner".id_url)
                                 ->  Sort  (cost=1029.26..1029.26 rows=1 width=40) (actual time=703.085..703.113
rows=31loops=1) 
                                       Sort Key: sr.id_url
                                       ->  Merge Join  (cost=991.00..1029.25 rows=1 width=40) (actual
time=702.740..702.984rows=31 loops=1) 
                                             Merge Cond: ("outer".id_user = "inner".id_user)
                                             ->  Sort  (cost=986.99..1006.13 rows=7656 width=28) (actual
time=648.559..655.302rows=8041 loops=1) 
                                                   Sort Key: sr.id_user
                                                   ->  Seq Scan on sat_request sr  (cost=0.00..493.09 rows=7656
width=28)(actual time=201.968..614.631 rows=8363 loops=1) 
                                                         Filter: (request_time > (now() - '1 mon'::interval))
                                             ->  Sort  (cost=4.01..4.02 rows=2 width=16) (actual time=35.252..35.282
rows=1loops=1) 
                                                   Sort Key: ul.id_user
                                                   ->  Index Scan using user_login_login_key on user_login ul
(cost=0.00..4.00rows=2 width=16) (actual time=35.214..35.221 rows=1 loops=1) 
                                                         Index Cond: ((login)::text = 'babinow1'::text)
                                 ->  Sort  (cost=2255.79..2308.95 rows=21264 width=161) (actual time=587.664..602.490
rows=21250loops=1) 
                                       Sort Key: u.id_url
                                       ->  Seq Scan on url u  (cost=0.00..727.32 rows=21264 width=161) (actual
time=0.026..418.586rows=21264 loops=1) 
                     ->  Sort  (cost=1072.27..1075.03 rows=1103 width=12) (actual time=5015.761..5016.092 rows=493
loops=1)
                           Sort Key: vsp.id_package
                           ->  Subquery Scan vsp  (cost=985.73..1016.53 rows=1103 width=12) (actual
time=898.876..5014.570rows=494 loops=1) 
                                 ->  Merge Join  (cost=985.73..1011.01 rows=1103 width=130) (actual
time=898.869..5011.954rows=494 loops=1) 
                                       Merge Cond: ("outer".id_program = "inner".id_program)
                                       ->  Sort  (cost=20.74..20.97 rows=93 width=19) (actual time=29.669..29.708
rows=47loops=1) 
                                             Sort Key: programs.id_program
                                             ->  Seq Scan on programs  (cost=0.00..17.70 rows=93 width=19) (actual
time=0.035..29.525rows=48 loops=1) 
                                                   Filter: (id_program <> 0)
                                       ->  Sort  (cost=964.99..967.75 rows=1102 width=115) (actual
time=868.619..869.286rows=494 loops=1) 
                                             Sort Key: sequences.id_program
                                             ->  Merge Join  (cost=696.16..909.31 rows=1102 width=115) (actual
time=44.820..867.649rows=494 loops=1) 
                                                   Merge Cond: ("outer".id_package = "inner".id_package)
                                                   ->  Merge Left Join  (cost=0.00..186.59 rows=1229 width=103) (actual
time=19.563..835.352rows=1248 loops=1) 
                                                         Merge Cond: ("outer".id_package = "inner".id_package)
                                                         ->  Index Scan using packages_pkey on packages p
(cost=0.00..131.04rows=1229 width=103) (actual time=12.796..457.520 rows=1248 loops=1) 
                                                         ->  Index Scan using package_security_id_package_key on
package_securityps  (cost=0.00..46.83 rows=855 width=4) (actual time=6.703..283.944 rows=879 loops=1) 
                                                   ->  Sort  (cost=696.16..705.69 rows=3812 width=16) (actual
time=25.222..25.705rows=494 loops=1) 
                                                         Sort Key: sequences.id_package
                                                         ->  Seq Scan on sequences  (cost=0.00..469.42 rows=3812
width=16)(actual time=0.017..24.412 rows=494 loops=1) 
                                                               Filter: (estimated_start IS NOT NULL)
 Total runtime: 7104.946 ms
(47 rows)



May I know wich parameter may I tune in order to avoid to "disable" the nested loop ?

===================================================================================================================================

I tried to reduce the runtime cost adding a new column on sat_request ( expired boolean ) in order to
have a better row extimation ( I used a partial index ) but nothing changed.


I finally was able to reduce the cost putting ( just for this query ):
set cpu_tuple_cost = 0.07

is it a resonable value ?


empdb=# set cpu_tuple_cost = 0.07;
SET
empdb=# explain analyze select *
empdb-# from v_sc_user_request
empdb-# where login = 'babinow1';
                                                                                       QUERY PLAN

----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
 Subquery Scan v_sc_user_request  (cost=1978.23..1978.30 rows=1 width=364) (actual time=1612.719..1613.064 rows=31
loops=1)
   ->  Sort  (cost=1978.23..1978.23 rows=1 width=201) (actual time=1612.700..1612.728 rows=31 loops=1)
         Sort Key: sr.id_sat_request
         ->  Merge Left Join  (cost=1974.05..1978.22 rows=1 width=201) (actual time=1537.343..1612.565 rows=31 loops=1)
               Merge Cond: ("outer".id_package = "inner".id_package)
               ->  Sort  (cost=887.92..887.93 rows=1 width=193) (actual time=475.924..476.020 rows=31 loops=1)
                     Sort Key: sr.id_package
                     ->  Nested Loop  (cost=4.07..887.91 rows=1 width=193) (actual time=145.782..475.851 rows=31
loops=1)
                           ->  Hash Join  (cost=4.07..884.65 rows=1 width=40) (actual time=139.816..464.678 rows=31
loops=1)
                                 Hash Cond: ("outer".id_user = "inner".id_user)
                                 ->  Index Scan using idx_sat_request_expired on sat_request sr  (cost=0.00..838.69
rows=8363width=28) (actual time=19.696..443.702 rows=8460 loops=1) 
                                       Index Cond: (expired = false)
                                 ->  Hash  (cost=4.07..4.07 rows=2 width=16) (actual time=11.779..11.779 rows=0
loops=1)
                                       ->  Index Scan using user_login_login_key on user_login ul  (cost=0.00..4.07
rows=2width=16) (actual time=11.725..11.732 rows=1 loops=1) 
                                             Index Cond: ((login)::text = 'babinow1'::text)
                           ->  Index Scan using url_pkey on url u  (cost=0.00..3.19 rows=1 width=161) (actual
time=0.345..0.347rows=1 loops=31) 
                                 Index Cond: ("outer".id_url = u.id_url)
               ->  Sort  (cost=1086.13..1088.16 rows=813 width=12) (actual time=1060.374..1060.622 rows=390 loops=1)
                     Sort Key: vsp.id_package
                     ->  Subquery Scan vsp  (cost=676.18..1046.83 rows=813 width=12) (actual time=625.645..1059.388
rows=480loops=1) 
                           ->  Hash Join  (cost=676.18..989.92 rows=813 width=131) (actual time=625.637..1057.105
rows=480loops=1) 
                                 Hash Cond: ("outer".id_package = "inner".id_package)
                                 ->  Hash Left Join  (cost=79.67..302.87 rows=1341 width=104) (actual
time=4.336..18.549rows=1342 loops=1) 
                                       Hash Cond: ("outer".id_package = "inner".id_package)
                                       ->  Seq Scan on packages p  (cost=0.00..145.87 rows=1341 width=104) (actual
time=0.007..3.357rows=1342 loops=1) 
                                       ->  Hash  (cost=77.27..77.27 rows=961 width=4) (actual time=3.685..3.685 rows=0
loops=1)
                                             ->  Seq Scan on package_security ps  (cost=0.00..77.27 rows=961 width=4)
(actualtime=0.016..2.175 rows=974 loops=1) 
                                 ->  Hash  (cost=594.48..594.48 rows=813 width=31) (actual time=620.397..620.397 rows=0
loops=1)
                                       ->  Hash Join  (cost=20.60..594.48 rows=813 width=31) (actual
time=38.307..619.406rows=480 loops=1) 
                                             Hash Cond: ("outer".id_program = "inner".id_program)
                                             ->  Seq Scan on sequences  (cost=0.00..512.82 rows=830 width=16) (actual
time=16.858..595.262rows=480 loops=1) 
                                                   Filter: (estimated_start IS NOT NULL)
                                             ->  Hash  (cost=20.48..20.48 rows=47 width=19) (actual time=21.093..21.093
rows=0loops=1) 
                                                   ->  Seq Scan on programs  (cost=0.00..20.48 rows=47 width=19)
(actualtime=9.369..20.980 rows=48 loops=1) 
                                                         Filter: (id_program <> 0)
 Total runtime: 1614.123 ms
(36 rows)



Regards
Gaetano Mendola









pgsql-performance by date:

Previous
From: "Magnus Hagander"
Date:
Subject: Re: [pgsql-hackers-win32] Help with tuning this query (with
Next
From: Richard Huxton
Date:
Subject: Re: bad plan