Quick Links

Re: Re: Data Warehouse Reevaluation - MySQL vs Postgres --

From:	<simon(at)2ndquadrant(dot)com>
To:	Joe Conway <mail(at)joeconway(dot)com>
Cc:	Chris Browne <cbbrowne(at)acm(dot)org>, <pgsql-performance(at)postgresql(dot)org>
Subject:	Re: Re: Data Warehouse Reevaluation - MySQL vs Postgres --
Date:	2004-09-15 09:10:01
Message-ID:	28292295$109523922141480635ce12a8.90913606@config17.schlund.de
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-performance

Joe Conway <mail(at)joeconway(dot)com> wrote on 15.09.2004, 06:30:24:
> Chris Browne wrote:
> > Might we set up the view as:
> >
> > create view combination_of_logs as
> > select * from table_1 where txn_date between 'this' and 'that'
> > union all
> > select * from table_2 where txn_date between 'this2' and 'that2'
> > union all
> > select * from table_3 where txn_date between 'this3' and 'that3'
> > union all
> > select * from table_4 where txn_date between 'this4' and 'that4'
> > union all
> > ... ad infinitum
> > union all
> > select * from table_n where txn_date > 'start_of_partition_n';
> >
> > and expect that to help, as long as the query that hooks up to this
> > has date constraints?
> >
> > We'd have to regenerate the view with new fixed constants each time we
> > set up the tables, but that sounds like it could work...
>
> That's exactly what we're doing, but using inherited tables instead of a
> union view. With inheritance, there is no need to rebuild the view each
> time a table is added or removed. Basically, in our application, tables
> are partitioned by either month or week, depending on the type of data
> involved, and queries are normally date qualified.
>
> We're not completely done with our data conversion (from a commercial
> RDBMSi), but so far the results have been excellent. Similar to what
> others have said in this thread, the conversion involved restructuring
> the data to better suit Postgres, and the application (data
> analysis/mining vs. the source system which is operational). As a result
> we've compressed a > 1TB database down to ~0.4TB, and seen at least one
> typical query reduced from ~9 minutes down to ~40 seconds.

Sounds interesting.

The performance gain comes from partition elimination of the inherited
tables under the root?

I take it the compression comes from use of arrays, avoiding the need
for additional rows and key overhead?

Best Regards, Simon Riggs

Responses

Re: Data Warehouse Reevaluation - MySQL vs Postgres -- at 2004-09-15 15:15:50 from Joe Conway

Browse pgsql-performance by date

	From	Date	Subject
Next Message	Markus Schaber	2004-09-15 09:16:44	Re: Data Warehouse Reevaluation - MySQL vs Postgres --
Previous Message	Leeuw van der, Tim	2004-09-15 06:51:00	Re: Data Warehouse Reevaluation - MySQL vs Postgres --