On Thu, 8 Dec 2011 13:05:19 +0000
Michael Lush <mjlush(at)gmail(dot)com> wrote:
> I have dataset with ~10000 columns and ~200000 rows (GWAS data (1)) in the
> form
> sample1, A T, A A, G C, ....
> sampel2, A C, C T, A A, ....
>
> I'd like to take subsets of both columns and rows for analysis
Why do you want to write the subsets to a table instead of reading
1/1, analyse and save the result?
This way you avoid the huge temp table which don't seem necessary.
BTW just in case, in python there's large libraries dedicated to
genetics.
--
BOFH excuse #99:
SIMM crosstalk.