Home > mailing lists

Re: Index not used on group by - Mailing list pgsql-performance

From	Richard Huxton
Subject	Re: Index not used on group by
Date	September 27, 2005 09:45:32
Msg-id	4339282F.1060900@archonet.com Whole thread Raw
In response to	Re: Index not used on group by (Андрей Репко<repko@sart.must-ipra.com>)
List	pgsql-performance

Tree view

Андрей Репко wrote:
> RH> What happens if you use something like
> RH>    SELECT DISTINCT alias_id FROM ma_data;
> sart_ma=# EXPLAIN ANALYZE SELECT DISTINCT alias_id FROM ma_data;
>                                                           QUERY PLAN
>
>
-------------------------------------------------------------------------------------------------------------------------------
>  Unique  (cost=65262.63..66770.75 rows=32 width=4) (actual time=16780.214..18250.761 rows=32 loops=1)
>    ->  Sort  (cost=65262.63..66016.69 rows=301624 width=4) (actual time=16780.204..17255.129 rows=301624 loops=1)
>          Sort Key: alias_id
>          ->  Seq Scan on ma_data  (cost=0.00..37811.24 rows=301624 width=4) (actual time=6.896..15321.023 rows=301624
loops=1)
>  Total runtime: 18292.542 ms

> sart_ma=# EXPLAIN ANALYZE SELECT alias_id FROM ma_data GROUP BY alias_id;
>                                                        QUERY PLAN
>
-------------------------------------------------------------------------------------------------------------------------
>  HashAggregate  (cost=38565.30..38565.62 rows=32 width=4) (actual time=15990.863..15990.933 rows=32 loops=1)
>    ->  Seq Scan on ma_data  (cost=0.00..37811.24 rows=301624 width=4) (actual time=3.446..14572.141 rows=301624
loops=1)
>  Total runtime: 15991.244 ms

OK - the planner thinks it's doing the right thing, your cost estimates
are way off. If you look back at where you got an index-scan, it's cost
was 1.1 million.
   Index Scan using reference_9_fk on ma_data (cost=0.00..1139526.57

That's way above the numbers for seq-scan+hash/sort, so if the cost
estimate was right PG would be making the right choice. Looks like you
need to check your configuration settings. Have you read:
   http://www.powerpostgresql.com/PerfList
or
   http://www.varlena.com/varlena/GeneralBits/Tidbits/perf.html
--
   Richard Huxton
   Archonet Ltd

pgsql-performance by date:

From: Gnanavel S
Date: 27 September 2005, 09:06:04
Subject: Re: PostgreSQL overall design

From: Bruce Momjian
Date: 27 September 2005, 11:12:50
Subject: Re: PostgreSQL overall design

Re: Index not used on group by - Mailing list pgsql-performance

Previous

Next