Home > mailing lists

index for group by - Mailing list pgsql-sql

From	Patrick Scharrenberg
Subject	index for group by
Date	July 22, 2008 08:18:38
Msg-id	4885C206.3040800@web.de Whole thread Raw
Responses	Re: index for group by
List	pgsql-sql

Tree view

Hi,

is there a way to speedup "group by" queries with an index?

In particular if I have a table like this:

CREATE TABLE data
(  id1 integer,  id2 integer,  somedata character varying,  ts timestamp with time zone
);

where continously data is logged about "id1" and "id2" into "somedata",
together with the timestamp when it was logged.

So I have multiple rows with the same id1 and id2 but different
timestamp (and data maybe).

At the moment I have ~40.000.000 rows in that table so doing a
SELECT id1, id2 FROM data GROUP BY id1, id2;

takes some time (~10 minutes)
and return about 1.000.000 rows.

I created an index on both colums id1 and id2 (together) which takes
about 800 MB but doesn't speedup things.
In fact it even doesn't seem to be used.

Is there any way to speedup this "group by" or does it seem more likely
that I have a conceptional flaw?

regards
Patrick

pgsql-sql by date:

From: "A. Kretschmer"
Date: 22 July 2008, 06:06:24
Subject: Re: How to Select a Tupl by Nearest Date

From: "A. Kretschmer"
Date: 22 July 2008, 08:27:35
Subject: Re: index for group by

index for group by - Mailing list pgsql-sql

Previous

Next