Quick Links

Re: Allow some recovery parameters to be changed with reload

From:	Fujii Masao <masao(dot)fujii(at)oss(dot)nttdata(dot)com>
To:	Sergei Kornilov <sk(at)zsrv(dot)org>, Kyotaro Horiguchi <horikyota(dot)ntt(at)gmail(dot)com>, "a(dot)lubennikova(at)postgrespro(dot)ru" <a(dot)lubennikova(at)postgrespro(dot)ru>
Cc:	"robertmhaas(at)gmail(dot)com" <robertmhaas(at)gmail(dot)com>, "michael(at)paquier(dot)xyz" <michael(at)paquier(dot)xyz>, "andres(at)anarazel(dot)de" <andres(at)anarazel(dot)de>, "peter(dot)eisentraut(at)2ndquadrant(dot)com" <peter(dot)eisentraut(at)2ndquadrant(dot)com>, "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Allow some recovery parameters to be changed with reload
Date:	2020-10-28 12:36:24
Message-ID:	0e0d5982-ac64-818d-1ec6-18a80b9300b1@oss.nttdata.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On 2020/10/28 21:02, Sergei Kornilov wrote:
> Hello
>
> Sorry for late response.
>
>>> > ... but what's the corresponding hazard here, exactly? It doesn't seem
>>> > that there's any way in which the decision one process makes affects
>>> > the decision the other process makes. There's still a race condition:
>>> > it's possible for a walsender
>>> Did you mean walreceiver here?
>>
>> It's logical walsender. restore_command is used within
>> logical_read_xlog_page() via XLogReadDetermineTimeline().
>
> Still have no idea what's the corresponding hazard here.
>
>>> > to use the old restore_command after the
>>> > startup process had already used the new one, or the other way around.
>>> > However, it doesn't seem like that should confuse anything inside the
>>> > server, and therefore I'm not sure we need to code around it.
>>> I came up with following scenario. Let's say we have xlog files 1,2,3
>>> in dir1 and files 4,5 in dir2. If startup process had only handled
>>> files 1 and 2, before we switched restore_command from reading dir1 to
>>> reading dir2, it will fail to find next file. IIUC, it will assume
>>> that recovery is done, start server and walreceiver. The walreceiver
>>> will fail as well. I don't know, how realistic is this case, though.
>>
>> That operation is somewhat bogus, if the server is not in standby
>> mode. In standby mode, startup waits for the next segment safely.
>
> I think it's pilot error. It is already possible to change anything in restore_command by wrapping real command into some script:
>
>> restore_command = '/bin/restore_wal.sh "%f" "%p"'
>
> And one can simple replace this file with something else with different logic. Or even by using some command with separate own settings. Real world example ( https://github.com/wal-g/wal-g ):
>
>> restore_command = '. /etc/wal-g/WALG_AWS_ENV; wal-g wal-fetch "%f" "%p"'
>
> And it is possible to change the real WAL source in ENV script without changing the restore_command. We can't track this, so I not see new issues here.
>
>>> Sergey, could you please attach this thread to the upcoming CF, if
>>> you're going to continue working on it.
>
> Sure, I created one: https://commitfest.postgresql.org/30/2802/

+1 to mark restore_command as PGC_SIGHUP.

Currently when restore_command is not set, archive recovery fails
at the beginning. With the patch, how should we treat the case where
retore_command is reset to empty during archive recovery? We should
reject that change of restore_command?

Regards,

--
Fujii Masao
Advanced Computing Technology Center
Research and Development Headquarters
NTT DATA CORPORATION

In response to

Re: Allow some recovery parameters to be changed with reload at 2020-10-28 12:02:33 from Sergei Kornilov

Responses

Re: Allow some recovery parameters to be changed with reload at 2020-11-06 12:36:48 from Sergei Kornilov

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	k.jamison@fujitsu.com	2020-10-28 12:52:08	RE: [Patch] Optimize dropping of relation buffers using dlist
Previous Message	Fabrízio de Royes Mello	2020-10-28 12:35:21	Re: Add important info about ANALYZE after create Functional Index