Re: Big wide datasets - Mailing list pgsql-novice

From Jean-Yves F. Barbier
Subject Re: Big wide datasets
Date
Msg-id 20111208142444.3184bcf8@anubis.defcon1
Whole thread Raw
In response to Big wide datasets  (Michael Lush <mjlush@gmail.com>)
List pgsql-novice
On Thu, 8 Dec 2011 13:05:19 +0000
Michael Lush <mjlush@gmail.com> wrote:

> I have dataset with ~10000 columns and ~200000 rows (GWAS data (1)) in the
> form
> sample1, A T, A A, G C, ....
> sampel2, A C, C T, A A, ....
>
> I'd like to take subsets of both columns and rows for analysis

Why do you want to write the subsets to a table instead of reading
1/1, analyse and save the result?
This way you avoid the huge temp table which don't seem necessary.

BTW just in case, in python there's large libraries dedicated to
genetics.

--
BOFH excuse #99:
SIMM crosstalk.

pgsql-novice by date:

Previous
From: Michael Lush
Date:
Subject: Big wide datasets
Next
From: "Robert D. Schnabel"
Date:
Subject: Re: Big wide datasets