Home > mailing lists

Re: Big wide datasets - Mailing list pgsql-novice

From	Jean-Yves F. Barbier
Subject	Re: Big wide datasets
Date	December 8, 2011 09:25:05
Msg-id	20111208142444.3184bcf8@anubis.defcon1 Whole thread Raw
In response to	Big wide datasets (Michael Lush <mjlush@gmail.com>)
List	pgsql-novice

Tree view

On Thu, 8 Dec 2011 13:05:19 +0000
Michael Lush <mjlush@gmail.com> wrote:

> I have dataset with ~10000 columns and ~200000 rows (GWAS data (1)) in the
> form
> sample1, A T, A A, G C, ....
> sampel2, A C, C T, A A, ....
>
> I'd like to take subsets of both columns and rows for analysis

Why do you want to write the subsets to a table instead of reading
1/1, analyse and save the result?
This way you avoid the huge temp table which don't seem necessary.

BTW just in case, in python there's large libraries dedicated to
genetics.

--
BOFH excuse #99:
SIMM crosstalk.

pgsql-novice by date:

From: Michael Lush
Date: 08 December 2011, 09:05:41
Subject: Big wide datasets

From: "Robert D. Schnabel"
Date: 08 December 2011, 09:39:37
Subject: Re: Big wide datasets

Re: Big wide datasets - Mailing list pgsql-novice

Previous

Next