Home > mailing lists

Re: One table or many tables for data set - Mailing list pgsql-performance

From	Joe Conway
Subject	Re: One table or many tables for data set
Date	July 22, 2003 22:04:52
Msg-id	3F1DDE9D.4010108@joeconway.com Whole thread Raw
In response to	One table or many tables for data set ("Castle, Lindsay" <lindsay.castle@eds.com>)
List	pgsql-performance

Tree view

Castle, Lindsay wrote:
> I'm working on a project that has a data set of approximately 6million rows
> with about 12,000 different elements, each element has 7 columns of data.
>
> I'm wondering what would be faster from a scanning perspective (SELECT
> statements with some calculations) for this type of set up;
>     one table for all the data
>     one table for each data element (12,000 tables)
>     one table per subset of elements (eg all elements that start with
> "a" in a table)
>

I, for one, am having difficulty understanding exactly what your data
looks like, so it's hard to give advice. Maybe some concrete examples of
what you are calling "rows", "elements", and "columns" would help.

Does each of 6 million rows have 12000 elements, each with 7 columns? Or
do you mean that out of 6 million rows, there are 12000 distinct kinds
of elements?

> Can I do anything with Indexing to help with performance?  I suspect for the
> majority of scans I will need to evaluate an outcome based on 4 or 5 of the
> 7 columns of data.
>

Again, this isn't clear to me -- but maybe I'm just being dense ;-)
Does this mean you expect 4 or 5 items in your WHERE clause?

Joe

pgsql-performance by date:

From: Rod Taylor
Date: 22 July 2003, 21:53:18
Subject: Re: One table or many tables for data set

From: "Castle, Lindsay"
Date: 22 July 2003, 22:25:31
Subject: Re: One table or many tables for data set

Re: One table or many tables for data set - Mailing list pgsql-performance

Previous

Next