Re: BUG #17005: Enhancement request: Improve walsender throughput by aggregating multiple messages in one send

From: Yura Sokolov <y(dot)sokolov(at)postgrespro(dot)ru>
To: Andres Freund <andres(at)anarazel(dot)de>
Cc: rony(dot)kurniawan(at)oracle(dot)com, pgsql-bugs(at)lists(dot)postgresql(dot)org
Subject: Re: BUG #17005: Enhancement request: Improve walsender throughput by aggregating multiple messages in one send
Date: 2021-05-17 13:21:36
Message-ID: aae9709fddecd6f6b59405c62a861dae@postgrespro.ru
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

Andres Freund писал 2021-05-17 01:44:
> Hi,
>
> On 2021-05-13 00:31:53 +0000, PG Bug reporting form wrote:
>> I measured the throughput of reading the logical replication slot and
>> found
>> that in smaller row size (512 bytes) the throughput is 50% lower
>> compared to
>> 1024 bytes.
>
> Huh, that is interesting.
>
>
>> tcpdump shows that ethernet packets sent by the replication server
>> contain
>> only one message per packet (see tcpdump output below).
>> May be this is the intended design to achieve low latency but this is
>> not
>> favorable in application that requires high throughput.
>
> What kind of network is this? I would have expected that if the network
> can't keep up the small sends would end up getting aggregated into
> larger packets anyway? Are you hitting a PPS limit due to the small
> packages, but not yet the throughput limit?

I believe the reason is more in sys-call and kernel cpu time overhead
than in
network throughput. Especially in this "after meltdown+spectre" time.

>
>> Is it possible for PostgreSQL to enable Nagle's algorithm on the
>> streaming
>> socket for replication?
>> Or aggegate the messages manually before sending them in one send()?
>
> I think we can probably do better in cases a transaction is more than a
> single change - but I don't think either enabling nagle's or
> aggregation
> are really an option in case of single row transactions. The latency
> impact for some scenarios seems too high.

We have commit_siblings and commit_delay options. Probably similar could
be introduced for replication slot?

regards
Yura

In response to

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message PG Bug reporting form 2021-05-17 14:11:06 BUG #17015: trigger + array + 'md5' -> 'domain kernel: pid ... (postgres), uid ..., was killed: out of swap spac
Previous Message Alex F 2021-05-17 09:29:56 Re: BUG #16833: postgresql 13.1 process crash every hour