From: | Simon Riggs <simon(at)2ndQuadrant(dot)com> |
---|---|
To: | Andres Freund <andres(at)anarazel(dot)de> |
Cc: | Hannu Krosing <hannu(at)krosing(dot)net>, Heikki Linnakangas <heikki(dot)linnakangas(at)enterprisedb(dot)com>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, jd(at)commandprompt(dot)com, Gregory Stark <stark(at)enterprisedb(dot)com>, Mark Kirkwood <markir(at)paradise(dot)net(dot)nz>, pgsql-hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Hot Standby (v9d) |
Date: | 2009-02-03 15:29:14 |
Message-ID: | 1233674954.4500.168.camel@ebony.2ndQuadrant |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
On Tue, 2009-02-03 at 15:55 +0100, Andres Freund wrote:
> Hi,
>
> On 02/03/2009 02:26 PM, Hannu Krosing wrote:
> >> I don't see any way around the fact that when a tuple is removed, it's
> >> gone and can't be accessed by queries. Either you don't remove it, or
> >> you kill the query.
> > Actually we came up with a solution to this - use filesystem level
> > snapshots (like LVM2+XFS or ZFS), and redirect backends with
> > long-running queries to use fs snapshot mounted to a different
> > mountpoint.
> Isn't that really, really expensive?
>
> A single write on the master logical volume yields writes of PE size
> for _every_ single snapshot (the first time the block is touched) -
> considering that there could quite many such snapshots I don't think
> that this is really feasible - io quite possible might be saturated.
If we did that we would provide an option to select the MVCC snapshot
that went with the filesystem snapshot. There need not be one per user.
--
Simon Riggs www.2ndQuadrant.com
PostgreSQL Training, Services and Support
From | Date | Subject | |
---|---|---|---|
Next Message | Hannu Krosing | 2009-02-03 15:38:17 | Re: Hot Standby (v9d) |
Previous Message | Tom Lane | 2009-02-03 15:22:19 | Re: [NOVICE] LATIN2->UTF8 conversation with dblink |