From: | Andrey Borodin <x4mmm(at)yandex-team(dot)ru> |
---|---|
To: | Binguo Bao <djydewang(at)gmail(dot)com> |
Cc: | simon(at)2ndquadrant(dot)com, pgsql-hackers(at)postgresql(dot)org |
Subject: | Re: Optimize partial TOAST decompression |
Date: | 2019-06-23 09:23:54 |
Message-ID: | 7B7E4A05-F404-4083-B7F2-267869EE2910@yandex-team.ru |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
Hi, Binguo!
> 2 июня 2019 г., в 19:48, Binguo Bao <djydewang(at)gmail(dot)com> написал(а):
>
> Hi, hackers!
....
> This seems to have a 10x improvement. If the number of toast data chunks is more, I believe that patch can play a greater role, there are about 200 related TOAST data chunks for each entry in the case.
That's really cool that you could produce meaningful patch long before end of GSoC!
I'll describe what is going on a little:
1. We have compressed value, which resides in TOAST table.
2. We want only some fraction of this value. We want some prefix with length L.
3. Previously Paul Ramsey submitted patch that omits decompression of value beyond desired L bytes.
4. Binguo's patch tries to do not fetch compressed data which will not bee needed to decompressor. In fact it fetches L bytes from TOAST table.
This is not correct: L bytes of compressed data do not always can be decoded into at least L bytes of data. At worst we have one control byte per 8 bytes of literal bytes. This means at most we need (L*9 + 8) / 8 bytes with current pglz format.
Also, I'm not sure you use SET_VARSIZE_COMPRESSED correctly...
Best regards, Andrey Borodin.
From | Date | Subject | |
---|---|---|---|
Next Message | Thomas Munro | 2019-06-23 10:35:24 | Re: Code comment change |
Previous Message | Vik Fearing | 2019-06-23 09:21:13 | Code comment change |