From: | "Bossart, Nathan" <bossartn(at)amazon(dot)com> |
---|---|
To: | Robert Haas <robertmhaas(at)gmail(dot)com>, Julien Rouhaud <rjuju123(at)gmail(dot)com> |
Cc: | "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: parallelizing the archiver |
Date: | 2021-09-10 17:18:59 |
Message-ID: | 0B5F73F7-5F70-45CF-BCFA-FFA96C6C27E0@amazon.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
On 9/10/21, 10:12 AM, "Robert Haas" <robertmhaas(at)gmail(dot)com> wrote:
> If on the other hand you imagine a system that's not very busy, say 1
> WAL file being archived every 10 seconds, then using a batch size of
> 30 would very significantly delay removal of old files. However, on
> this system, batching probably isn't really needed. The rate of WAL
> file generation is low enough that if you pay the startup cost of your
> archive_command for every file, you're probably still doing just fine.
>
> Probably, any kind of parallelism or batching needs to take this kind
> of time-based thinking into account. For batching, the rate at which
> files are generated should affect the batch size. For parallelism, it
> should affect the number of processes used.
I was thinking that archive_batch_size would be the maximum batch
size. If the archiver only finds a single file to archive, that's all
it'd send to the archive command. If it finds more, it'd send up to
archive_batch_size to the command.
Nathan
From | Date | Subject | |
---|---|---|---|
Next Message | Jaime Casanova | 2021-09-10 17:31:20 | Re: Numeric x^y for negative x |
Previous Message | Mark Dilger | 2021-09-10 17:16:48 | Re: [Patch] ALTER SYSTEM READ ONLY |