Re: What's the cost of a few extra columns?

From: "Jim C(dot) Nasby" <jnasby(at)pervasive(dot)com>
To: Announce <truthhurts(at)insightbb(dot)com>
Cc: pgsql-performance(at)postgresql(dot)org
Subject: Re: What's the cost of a few extra columns?
Date: 2005-10-11 00:12:49
Message-ID: 20051011001249.GJ39569@pervasive.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-performance

What you're describing is known as vertical partitioning (think of
splitting a table vertically), and can be a good technique for
increasing performance when used properly. The key is to try and get the
average row size down, since that means more rows per page which means
less I/O. Some things to consider:

First rule of performance tuning: don't. In other words, you should be
able to verify with benchmark numbers that a) you need to do this and b)
how much it's actually helping.

How will splitting the table affect *_tstmp, especially mod_tstmp?

How will you handle inserts and joining these two tables together? Will
you always do a left join (preferably via a view), or will you have a
trigger/rule that inserts into production_info whenever a row is
inserted into productions?

On Sun, Oct 09, 2005 at 10:03:33PM -0500, Announce wrote:
> What's goin on pg-people?
>
> I have a table PRODUCTIONS that is central to the DB and ties a lot of other
> information together:
>
> PRODUCTIONS (table)
> ----------------------------------
> prod_id primary key
> type_id foreign key
> level_id foreign key
> tour_id foreign key
> show_id foreign key
> venue_id foreign key
> title varchar(255); not null indexed
> version char;
> details text
> open_date date
> close_date date
> preview_open date
> preview_close date
> perform_tot int
> preview_tot int
> park_info text
> phone_nos text
> some_other_info text
> seating_info text
> this text
> that text
> create_tstmp timestamptz; NOW()
> mod_tstmp timestamptz;triggered
> delete_tstmp timestamptz;default null
> is_complete bool
>
>
> As it stands now, there are approximately 25-30 columns on the table. Since
> this table is very central to the database, would it be more efficient to
> break some of the columns (especially the TEXT ones) out into a separate
> INFO table since some queries on the web will not care about all of these
> text columns anyway? I know that pg can handle A LOT more columns and if
> there IS no performance hit for keeping them all on the same table, I would
> like to do that because the relation between PRODUCTIONS and the INFO will
> always be 1-to-1.
>
> My implementation of this INFO table would look a little somethin' like
> this:
>
> PROD_INFO (table)
> -------------------------------
> prod_id pkey/fkey
> open_date date
> close_date date
> preview_open date
> preview_close date
> perform_tot int
> preview_tot int
> park_info text
> phone_nos text
> some_other_info text
> seating_info text
> this text
> that text
> (the rest would stay in in the original PRODUCTIONS table)
>
>
> I am open to ANY suggestions, criticisms, mockery, etc.
>
> Thanks,
>
> Aaron
>
>
> ---------------------------(end of broadcast)---------------------------
> TIP 5: don't forget to increase your free space map settings
>

--
Jim C. Nasby, Sr. Engineering Consultant jnasby(at)pervasive(dot)com
Pervasive Software http://pervasive.com work: 512-231-6117
vcard: http://jim.nasby.net/pervasive.vcf cell: 512-569-9461

In response to

Responses

Browse pgsql-performance by date

  From Date Subject
Next Message Announce 2005-10-11 02:14:12 Re: What's the cost of a few extra columns?
Previous Message Michael Stone 2005-10-10 22:48:34 Re: XFS External Log on Pg 7.4.8 Pg_xlog drives?