Re: Air-traffic benchmark

From: Arjen van der Meijden <acmmailing(at)tweakers(dot)net>
To: Lefteris <lsidir(at)gmail(dot)com>
Cc: pgsql-performance(at)postgresql(dot)org
Subject: Re: Air-traffic benchmark
Date: 2010-01-07 13:36:46
Message-ID: 4B45E36E.4040706@tweakers.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-performance

On 7-1-2010 13:38 Lefteris wrote:
> I decided to run the benchmark over postgres to get some more
> experience and insights. Unfortunately, the query times I got from
> postgres were not the expected ones:

Why were they not expected? In the given scenario, column databases are
having a huge advantage. Especially the given simple example is the type
of query a column database *should* excel.
You should, at the very least, compare the queries to MyISAM:
http://www.mysqlperformanceblog.com/2009/11/05/air-traffic-queries-in-myisam-and-tokutek-tokudb/

But unfortunately, that one also beats your postgresql-results.

> The hardware characteristics are:
> Platform Intel(R) Core(TM)2 Quad CPU Q6600 @ 2.40GHz with 8GB RAM and
> ample disk space (2x 500 GB SATA disk @ 7200 RPM as SW-RAID-0)

Unfortunately, the blogpost fails to mention the disk-subsystem. So it
may well be much faster than yours, although its not a new, big or fast
server, so unless it has external storage, it shouldn't be too different
for sequential scans.

> SELECT "DayOfWeek", count(*) AS c FROM ontime WHERE "Year" BETWEEN
> 2000 AND 2008 GROUP BY "DayOfWeek" ORDER BY c DESC;
>
> Reported query times are (in sec):
> MonetDB 7.9s
> InfoBright 12.13s
> LucidDB 54.8s
>
> For pg-8.4.2 I got with 3 consecutive runs on the server:
> 5m52.384s
> 5m55.885s
> 5m54.309s

Maybe an index of the type 'year, dayofweek' will help for this query.
But it'll have to scan about half the table any way, so a seq scan isn't
a bad idea.
In this case, a partitioned table with partitions per year and
constraint exclusion enabled would help a bit more.

Best regards,

Arjen

In response to

Responses

Browse pgsql-performance by date

  From Date Subject
Next Message Lefteris 2010-01-07 13:40:39 Re: Air-traffic benchmark
Previous Message A. Kretschmer 2010-01-07 13:32:25 Re: Air-traffic benchmark