Home > mailing lists

Re: again on index usage - Mailing list pgsql-hackers

From	Daniel Kalchev
Subject	Re: again on index usage
Date	January 9, 2002 02:54:40
Msg-id	200201090745.JAA15141@dcave.digsys.bg Whole thread Raw
In response to	Re: again on index usage (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: again on index usage
List	pgsql-hackers

Tree view

>>>Tom Lane said:> Daniel Kalchev <daniel@digsys.bg> writes:> > Same result (sorry, should have included this
originally):>> > Aggregate  (cost=47721.72..47721.72 rows=1 width=8)> >   ->  Seq Scan on iplog_gate200112
(cost=0.00..47579.54rows=56873 width=    8)> > >>> If you say "set enable_seqscan to off", does that change the plan?>
>> Aggregate  (cost=100359.71..100359.71 rows=1 width=8)> >   ->  Index Scan using iplog_gate200112_ipdate_idx on
iplog_gate200112 > > (cost=0.00..100217.52 rows=56873 width=8)> > So, what we've got here is a difference of opinion:
theplanner thinks> that the seqscan will be faster.  How many rows are actually selected> by this WHERE clause?  How
longdoes each plan actually take?> >             regards, tom lane

3-5 minutes with sequential scan; 10-15 sec with index scan. The query returns 
4062 rows. Out of ca 1700000 rows.

With only the datetime constraints (which relates to the index), the number of 
rows is 51764.

In any case, sequential scan of millions of rows cannot be faster than index 
scan. The average number of records for each index key is around 25000 - 
perhaps the planner thinks because the number of tuples in this case is 
higher, it should prefer sequential scan. I guess the planner will do better 
if there is some scaling of these values with respect to the total number of 
rows.

Daniel

pgsql-hackers by date:

From: Michael Meskes
Date: 09 January 2002, 02:44:42
Subject: ECPG: include sqlca

From: "Zeugswetter Andreas SB SD"
Date: 09 January 2002, 03:55:06
Subject: Re: Time as keyword

Re: again on index usage - Mailing list pgsql-hackers

Previous

Next