Home > mailing lists

Re: Select with grouping plan question - Mailing list pgsql-performance

From	Tom Lane
Subject	Re: Select with grouping plan question
Date	November 30, 2005 16:41:28
Msg-id	815.1133383282@sss.pgh.pa.us Whole thread Raw
In response to	Select with grouping plan question ("Brad Might" <bmight@storediq.com>)
List	pgsql-performance

Tree view

"Brad Might" <bmight@storediq.com> writes:
> This seems to me to be an expensive plan and I'm wondering if there's a
> way to improve it or a better way to do what I'm trying to do here (get
> a count of distinct values for each record_id and map that value to the
> entity type) entity_type_id_mapping is 56 rows
> volume_node_entity_data_values is approx 500,000,000 rows vq_record_id
> has approx 11,000,000 different values vq_entity_type is a value in
> entity_type_id_mapping.entity_type

Hmm, what Postgres version is that?  And have you ANALYZEd
entity_type_id_mapping lately?  I'd expect the planner to realize that
there cannot be more than 56 output groups, which ought to lead it to
prefer a hashed aggregate over the sort+group method.  That's what I
get in a test case with a similar query structure, anyway.

If you're stuck on an old PG version, it might help to do the
aggregation first and then join, ie

    select ... from
    (select count(vq_entity_value) as vcount, vq_entity_type
     from data_schema.volume_queue_entities group by vq_entity_type) qe,
    volume_8.entity_type_id_mapping emap
    where qe.vq_entity_type = emap.entity_type;

            regards, tom lane

pgsql-performance by date:

From: "Franklin Haut"
Date: 30 November 2005, 16:39:37
Subject: RES: pg_dump slow

From: Ron
Date: 30 November 2005, 17:05:54
Subject: Re: RES: pg_dump slow

Re: Select with grouping plan question - Mailing list pgsql-performance

Previous

Next