General data warehousing questions

From: "Sean Davis" <sdavis2(at)mail(dot)nih(dot)gov>
To: pgsql <pgsql-general(at)postgresql(dot)org>
Subject: General data warehousing questions
Date: 2008-10-06 01:48:15
Message-ID: 264855a00810051848q3aece54dt635ca4c7139f6f1b@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

I am looking at the prospect of building a data warehouse of genomic
sequence data. The machine that produces the data adds about
300million rows per month in a central fact table and we will
generally want the data to be "online". We don't need instantaneous
queries, but we would be using the data for data mining purposes and
running some "real-time" queries for reporting and research purposes.
I have had the pleasure of working on an Netezza box where this type
of thing is quite standard, but we don't have that access anymore, so
I'm looking for hints on using postgres in a data warehousing/mining
environment. Any suggestions on how DDL, loading, backup, indexing,
or (to a certain extent) hardware?

Thanks,
Sean

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Scott Marlowe 2008-10-06 02:07:00 Re: General data warehousing questions
Previous Message Ricardo Pinho 2008-10-05 23:43:06 GISVM - One Month old - Statistics Report