Re: Client-side compression

From: Filip Rembiałkowski <plk(dot)zuber(at)gmail(dot)com>
To: Rob Sargent <robjsargent(at)gmail(dot)com>
Cc: pgsql-sql(at)postgresql(dot)org
Subject: Re: Client-side compression
Date: 2009-06-24 15:08:58
Message-ID: 92869e660906240808vc476880sa4d005fba416de88@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-sql

2009/6/23 Rob Sargent <robjsargent(at)gmail(dot)com>

>
> Not sure if this belongs here or on the admin or performance list.
> Apologies if so. (And this may be a second posting as the first was from an
> un-registered account. Further apologies)
>
> My assumption is that any de/compression done by postgres would be
> server-side.
>
> We're considering minimizing bandwidth utilization

Why? Are you hitting some limit?

> by using client-side compression on a column value that will typically be
> multi-megabyte in size. We would use ALTER TABLE SET STORAGE EXTERNAL to
> prevent the server from un-necessary compression.
>
> Is this generally worthwhile?

No general answer, sorry.

It depends on other conditions and on what you want to achieve.

1. Almost always you will get better on-disk compression ratio. That's
because, quote from pg docs:
"The compression technique used is a fairly simple and very fast member of
the LZ family of compression techniques. See
src/backend/utils/adt/pg_lzcompress.c for the details."

2. You will force all client apps to decompress data on their side.

3. To minimize bandwith utilisation, there are other ways (think compressed
tunnels, SSL and so on - but it adds per-connection overhead)

>
> Is there a great penalty for a query which delves into the value, given
> that the server will not be aware it's compressed?

not clear. do you mean something like

SELECT ... FROM table where decompress(compressed_data) LIKE 'whatever'
???

of course it will be a great penalty.

I assume we're pretty much on our own to prevent such actions (i.e. the app
> can never query against this column via sql).

certainly.

--
Filip Rembiałkowski
JID,mailto:filip(dot)rembialkowski(at)gmail(dot)com
http://filip.rembialkowski.net/

In response to

Browse pgsql-sql by date

  From Date Subject
Next Message Sandeep 2009-06-24 16:39:15 Bucketing Row Data in columns
Previous Message Jasen Betts 2009-06-24 12:13:59 Re: Client-side compression