Re: On columnar storage

From: Jim Nasby <Jim(dot)Nasby(at)BlueTreble(dot)com>
To: Alvaro Herrera <alvherre(at)2ndquadrant(dot)com>, Robert Haas <robertmhaas(at)gmail(dot)com>
Cc: Pg Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: On columnar storage
Date: 2015-06-16 02:04:56
Message-ID: 557F8448.5070104@BlueTreble.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On 6/14/15 10:22 AM, Alvaro Herrera wrote:
>> >To me, it feels like there are two different features here that would
>> >be better separated. First, there's the idea of having a table that
>> >gets auto-joined to other tables whenever you access it, so that the
>> >user sees one really wide table but really the data is segregated by
>> >column groups under the hood. That's a neat idea.
> Thanks. (It also seems pretty tricky to implement.)

I look at it as a form of vertical partitioning; the big difference
being whether you normalize the columns out or not (or to use data
warehouse parlance, slow vs fast changing dimensions).

Perhaps it would be useful to vet this out as a userspace extension
first since that would presumably be much easier. I believe we actually
have all the backend infrastructure that would be needed for this now
that views are smart enough to exclude tables that aren't referenced at
all. I suspect that even a 'dumb userspace' approach would still expose
a lot of the planner problems we'll run into (join explosion and
filtering through the join come to mind).

Related to idea of an 'auto join', I do wish we had the ability to
access columns in a referenced FK table from a referring key; something
like SELECT customer_id.first_name FROM invoice (which would be
translated to SELECT first_name FROM invoice JOIN customer USING(
customer_id )).
--
Jim Nasby, Data Architect, Blue Treble Consulting, Austin TX
Data in Trouble? Get it in Treble! http://BlueTreble.com

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Jim Nasby 2015-06-16 02:47:49 Re: pg_stat_*_columns?
Previous Message Jim Nasby 2015-06-16 00:26:26 Re: Need Multixact Freezing Docs