From: | Jesper Krogh <jesper(at)krogh(dot)cc> |
---|---|
To: | Merlin Moncure <mmoncure(at)gmail(dot)com> |
Cc: | Divakar Singh <dpsmails(at)yahoo(dot)com>, Robert Haas <robertmhaas(at)gmail(dot)com>, Mladen Gogala <mladen(dot)gogala(at)vmsinfo(dot)com>, pgsql-performance(at)postgresql(dot)org |
Subject: | Re: Postgres insert performance and storage requirement compared to Oracle |
Date: | 2010-10-27 19:47:23 |
Message-ID: | 4CC881CB.8090806@krogh.cc |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers pgsql-performance |
On 2010-10-27 20:51, Merlin Moncure wrote:
>> Yes, I am quite aware of how the o/s page cache works. All else being
>> equal, I more compact database obviously would be preferred. However
>> 'all else' is not necessarily equal. I can mount my database on bzip
>> volume, that must make it faster, right? wrong. I understand the
>> postgres storage architecture pretty well, and the low hanging fruit
>> having been grabbed further layout compression is only going to come
>> as a result of tradeoffs.
>>
Or configureabillity.. Not directly related to overall space consumption
but I have been working on a patch that would make TOAST* kick in
earlier in the process, giving a "slimmer" main table with visibillity
information
and simple columns and moving larger colums more aggressively to TOAST.
The overall disadvantage of TOAST is the need for an extra disk seek if
you actually need the data. If the application rarely needs the large
columns but often do count/filtering on simple values this will eventually
lead to a better utillization of the OS-page-cache with a very small
overhead
to PG (in terms of code) and 0 overhead in the applications that benefit.
Keeping in mind that as SSD-drives get more common the "the extra disk seek"
drops dramatically, but the drive is by itself probably still 100-1000x
slower than
main memory, so keeping "the right data" in the OS-cache is also a
parameter.
If you deal with data where the individual tuple-size goes up, currently
TOAST
first kicks in at 2KB (compressed size) which leads to a very sparse
main table
in terms of visibillity information and count and selects on simple values
will drag a huge amount of data into the cache-layers thats not needed
there.
Another suggestion could be to make the compression of text columns kick in
earlier .. if thats possible. (I dont claim that its achiveable)
Unless the tuple-header is hugely bloated I have problems creating a
situation in my
head where hammering that one can change anything significantly.
* http://www.mail-archive.com/pgsql-hackers(at)postgresql(dot)org/msg159726.html
--
Jesper
From | Date | Subject | |
---|---|---|---|
Next Message | Fujii Masao | 2010-10-27 19:53:29 | Re: xlog.c: WALInsertLock vs. WALWriteLock |
Previous Message | Joshua D. Drake | 2010-10-27 19:40:12 | Re: max_wal_senders must die |
From | Date | Subject | |
---|---|---|---|
Next Message | Pierre C | 2010-10-27 19:52:49 | Re: Select count(*), the sequel |
Previous Message | Alex Hunsaker | 2010-10-27 19:45:06 | Re: Postgres insert performance and storage requirement compared to Oracle |