Re: large database

From: Mihai Popa <mihai(at)lattica(dot)com>
To: Jeff Janes <jeff(dot)janes(at)gmail(dot)com>
Cc: pgsql-general(at)postgresql(dot)org
Subject: Re: large database
Date: 2012-12-11 21:58:34
Message-ID: 1355263114.24819.92.camel@ozzy.lattica.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Tue, 2012-12-11 at 10:00 -0800, Jeff Janes wrote:
> On Mon, Dec 10, 2012 at 12:26 PM, Mihai Popa <mihai(at)lattica(dot)com> wrote:
> > Hi,
> >
> > I've recently inherited a project that involves importing a large set of
> > Access mdb files into a Postgres or MySQL database.
> > The process is to export the mdb's to comma separated files than import
> > those into the final database.
> > We are now at the point where the csv files are all created and amount
> > to some 300 GB of data.
>
> Compressed or uncompressed?

uncompressed, but that's not much relief...
and it's 800GB not 300 anymore. I still can't believe the size of this
thing.

> Why did you originally choose MySQL? What has changed that causes you
> to rethink that decision? Does your team have experience with MySQL
> but not with PostgreSQL?

I did not choose it; somebody before me did. I personally have more
experience with Postgres, but not with databases as large as this one
promises to be.

>
> I like PostgreSQL, of course, but if I already had an
> already-functioning app on MySQL I'd be reluctant to change it.

...and I'm not rushing to do it; I was just asking around, maybe there
are known issues with MySQL, or with Postgres for that matter.

> My understanding is that RDS does not support Postgres, so if you go
> that route the decision is already made for you. Or am I wrong here?

That's right, but I could still get an EC2 instance and run my own
Postgres
Or use this: http://www.enterprisedb.com/cloud-database/pricing-amazon

> 1TB of storage sounds desperately small for loading 300GB of csv files.

really? that's good to know; I wouldn't have guessed

> IOPS would mostly depend on how you are using the system, not how large it is.

mostly true

--
Mihai Popa <mihai(at)lattica(dot)com>
Lattica, Inc.

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Mihai Popa 2012-12-11 22:01:40 Re: large database
Previous Message rektide 2012-12-11 21:48:00 Re: NOTIFY/LISTEN on server, asynchronous processing