From: | Robert Haas <robertmhaas(at)gmail(dot)com> |
---|---|
To: | Andres Freund <andres(at)anarazel(dot)de> |
Cc: | Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Noah Misch <noah(at)leadboat(dot)com>, Alvaro Herrera <alvherre(at)2ndquadrant(dot)com>, Thomas Munro <thomas(dot)munro(at)enterprisedb(dot)com>, "Joshua D(dot) Drake" <jd(at)commandprompt(dot)com>, Steve Kehlet <steve(dot)kehlet(at)gmail(dot)com>, Forums postgresql <pgsql-general(at)postgresql(dot)org>, Pg Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: [HACKERS] 9.4.1 -> 9.4.2 problem: could not access status of transaction 1 |
Date: | 2015-06-05 18:26:57 |
Message-ID: | CA+TgmoYoHz7QkTLeapJaOzQCFq3DyW3RjvB5jBa6y7kA209P+g@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-general pgsql-hackers |
On Fri, Jun 5, 2015 at 12:00 PM, Andres Freund <andres(at)anarazel(dot)de> wrote:
> On 2015-06-05 11:43:45 -0400, Tom Lane wrote:
>> Robert Haas <robertmhaas(at)gmail(dot)com> writes:
>> > On Fri, Jun 5, 2015 at 2:20 AM, Noah Misch <noah(at)leadboat(dot)com> wrote:
>> >> I read through this version and found nothing to change. I encourage other
>> >> hackers to study the patch, though. The surrounding code is challenging.
>>
>> > Andres tested this and discovered that my changes to
>> > find_multixact_start() were far more creative than intended.
>> > Committed and back-patched with a trivial fix for that stupidity and a
>> > novel-length explanation of the changes.
>>
>> So where are we on this? Are we ready to schedule a new set of
>> back-branch releases? If not, what issues remain to be looked at?
>
> We're currently still doing bad things while the database is in an
> inconsistent state (i.e. read from SLRUs and truncate based on the
> results of that). It's quite easy to reproduce base backup startup
> failures.
>
> On the other hand, that's not new. And the fix requires, afaics, a new
> type of WAL record (issued very infrequently). I'll post a first version
> of the patch, rebased ontop of Robert's commit, tonight or tomorrow. I
> guess we can then decide what we'd like to do.
There are at least two other known issues that seem like they should
be fixed before we release:
1. The problem that we might truncate an SLRU members page away when
it's in the buffers, but not drop it from the buffers, leading to a
failure when we try to write it later.
2. Thomas's bug fix for another longstanding but that occurs when you
run his checkpoint-segment-boundary.sh script.
I think we might want to try to fix one or both of those before
cutting a new release. I'm less sold on the idea of installing
WAL-logging in this minor release. That probably needs to be done,
but right now we've got stuff that worked in early 9.3.X release and
is now broken, and I'm in favor of fixing that first.
--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company
From | Date | Subject | |
---|---|---|---|
Next Message | Tom Lane | 2015-06-05 18:33:12 | Re: [HACKERS] 9.4.1 -> 9.4.2 problem: could not access status of transaction 1 |
Previous Message | Steve Crawford | 2015-06-05 18:24:34 | Re: Query running slow for only one specific id. (Postgres 9.3) version |
From | Date | Subject | |
---|---|---|---|
Next Message | Tom Lane | 2015-06-05 18:27:59 | gcc -ansi versus SSE4.2 detection |
Previous Message | Alvaro Herrera | 2015-06-05 18:11:34 | Re: Is it possible to have a "fast-write" Index? |