Home > mailing lists

Re: select distinct and index usage - Mailing list pgsql-general

From	Gregory Stark
Subject	Re: select distinct and index usage
Date	April 7, 2008 23:57:28
Msg-id	87iqytfcbi.fsf@oxford.xeocode.com Whole thread Raw
In response to	Re: select distinct and index usage ("David Wilson" <david.t.wilson@gmail.com>)
Responses	Re: select distinct and index usage ("David Wilson" <david.t.wilson@gmail.com>)
List	pgsql-general

Tree view

"David Wilson" <david.t.wilson@gmail.com> writes:

> I appreciate the responses so far! I'm used to several minutes for
> some of the complex queries on this DB, but 12.5 minutes for a select
> distinct just seems wrong. :)

You could try changing it to the equivalent GROUP BY query. The planner,
unfortunately, doesn't know they're equivalent and has two separate sets of
plans available. In this case where there are only 4,000 distinct values out
of 75M original records you might find a HashAggregate plan, which the planner
doesn't know can be used for DISTINCT, best. You might have to raise work_mem
before the planner feels a hash will fit.

--
  Gregory Stark
  EnterpriseDB          http://www.enterprisedb.com
  Ask me about EnterpriseDB's RemoteDBA services!

pgsql-general by date:

From: "Joshua D. Drake"
Date: 07 April 2008, 19:26:56
Subject: Re: [pgsql-advocacy] pgus-general now up

From: Mike Ginsburg
Date: 08 April 2008, 00:50:31
Subject: Most Occurring Value

Re: select distinct and index usage - Mailing list pgsql-general

Previous

Next