| From: | Andres Freund <andres(at)2ndquadrant(dot)com> |
|---|---|
| To: | Josh Berkus <josh(at)agliodbs(dot)com> |
| Cc: | pgsql-hackers(at)postgresql(dot)org |
| Subject: | Re: Why we are going to have to go DirectIO |
| Date: | 2013-12-05 00:49:48 |
| Message-ID: | 20131205004948.GB8935@awork2.anarazel.de |
| Views: | Whole Thread | Raw Message | Download mbox | Resend email |
| Thread: | |
| Lists: | pgsql-hackers |
Hi,
On 2013-12-03 10:44:15 -0800, Josh Berkus wrote:
> I don't know where we'll get the resources to implement our own storage,
> but it's looking like we don't have a choice.
As long as our storage layer is a s suboptimal as it is today, I think
it's a purely detractory to primarily blame the kernel.
We
* cannot deal with large shared_buffers, the dirty-buffer scanning is far to
expensive. The amount of memory required for locks is pretty big, and
every backend carries around a pretty huge private array for the
buffer pins.
* do not have scalability in pretty damn central datastructures like
buffer mapping.
* Our background eviction mechanism doesn't do anything in lots of
workloads but increase contention on important data structures.
* Due to the missing efficient eviction, we synchronously write out data
when acquiring a victim buffer most of the time. That's already bad if
you have a kernel buffering your writes, but if you don't...
* Due to the frequency of buffer pins in almost all workloads, our
tracking of the importance of individual buffers is far, far too
volatile.
Greetings,
Andres Freund
--
Andres Freund http://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Training & Services
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Tom Lane | 2013-12-05 01:02:38 | Re: Status of FDW pushdowns |
| Previous Message | Robert Haas | 2013-12-05 00:45:56 | Re: Performance optimization of btree binary search |