Quick Links

Re: parallelizing the archiver

From:	"Bossart, Nathan" <bossartn(at)amazon(dot)com>
To:	Robert Haas <robertmhaas(at)gmail(dot)com>, Julien Rouhaud <rjuju123(at)gmail(dot)com>
Cc:	"pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: parallelizing the archiver
Date:	2021-09-10 17:18:59
Message-ID:	0B5F73F7-5F70-45CF-BCFA-FFA96C6C27E0@amazon.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On 9/10/21, 10:12 AM, "Robert Haas" <robertmhaas(at)gmail(dot)com> wrote:
> If on the other hand you imagine a system that's not very busy, say 1
> WAL file being archived every 10 seconds, then using a batch size of
> 30 would very significantly delay removal of old files. However, on
> this system, batching probably isn't really needed. The rate of WAL
> file generation is low enough that if you pay the startup cost of your
> archive_command for every file, you're probably still doing just fine.
>
> Probably, any kind of parallelism or batching needs to take this kind
> of time-based thinking into account. For batching, the rate at which
> files are generated should affect the batch size. For parallelism, it
> should affect the number of processes used.

I was thinking that archive_batch_size would be the maximum batch
size. If the archiver only finds a single file to archive, that's all
it'd send to the archive command. If it finds more, it'd send up to
archive_batch_size to the command.

Nathan

In response to

Re: parallelizing the archiver at 2021-09-10 17:09:51 from Robert Haas

Responses

Re: parallelizing the archiver at 2021-09-10 18:34:00 from Andrey Borodin

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Jaime Casanova	2021-09-10 17:31:20	Re: Numeric x^y for negative x
Previous Message	Mark Dilger	2021-09-10 17:16:48	Re: [Patch] ALTER SYSTEM READ ONLY