Re: data model one large and many small columns

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: "Campbell, Lance" <lance(at)illinois(dot)edu>
Cc: "pgsql-admin(at)postgresql(dot)org" <pgsql-admin(at)postgresql(dot)org>
Subject: Re: data model one large and many small columns
Date: 2016-07-26 21:18:08
Message-ID: 11722.1469567888@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-admin

"Campbell, Lance" <lance(at)illinois(dot)edu> writes:
> Suppose you have a table that consists of a lot of small sized columns with one really large text column. The text column represents an average sized web page. When inserting or updating you will work with only one row at a time.

> When selecting information you will either:
> A) select all columns in one row for display or editing purposes.
> B) Or you will select hundreds of rows but NOT the very large text column.

> Question: Is there any performance to be gained from PostgreSQL by
> storing the data as two tables versus one table?

No, probably not; TOAST will effectively do that for you by off-loading
the large text values into the side table.

If you were to do an explicit join, there would be some use-cases ---
mainly where you were selecting a LOT of rows --- where in theory you
could get a smarter plan from the explicit join. TOAST will effectively
always fetch the text values via a nestloop-with-inner-indexscan plan,
but maybe hashing or merging would be smarter. Unfortunately, with two
explicit tables, TOAST would still apply to the second table, which means
that what you've really got under the hood is a three-way join, with the
intermediate table contributing nothing except overhead.

So don't bother ...

regards, tom lane

In response to

Responses

Browse pgsql-admin by date

  From Date Subject
Next Message Campbell, Lance 2016-07-26 21:24:42 Re: data model one large and many small columns
Previous Message David G. Johnston 2016-07-26 21:17:36 Re: data model one large and many small columns