Quick Links

Re: Performance Evaluation of Result Cache by using TPC-DS

From:	David Rowley <dgrowleyml(at)gmail(dot)com>
To:	Yuya Watari <watari(dot)yuya(at)gmail(dot)com>
Cc:	pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Performance Evaluation of Result Cache by using TPC-DS
Date:	2021-05-03 23:02:22
Message-ID:	CAApHDvraTf76AVSmxivBKVG9RrfBLGdYNnbNJpHor6ZtHtMwRg@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Thanks for doing further analysis on this.

On Mon, 26 Apr 2021 at 20:31, Yuya Watari <watari(dot)yuya(at)gmail(dot)com> wrote:
> Thank you for running experiments on your machine and I really
> appreciate your deep analysis.
>
> Your results are very interesting. In 5 queries, the result cache is
> cheaper but slower. Especially, in query 88, although the cost with
> result cache is cheaper, it has 34.23% degradation in query execution
> time. This is big regression.

That's certainly one side of it. On the other side, it's pretty
important to also note that in 4 of 23 queries the result cache plan
executed faster but the planner costed it as more expensive.

I'm not saying the costing is perfect, but what I am saying is, as you
noted above, in 5 of 23 queries the result cache was cheaper and
slower, and, as I just noted, in 4 of 23 queries, result cache was
more expensive and faster. We know that costing is never going to be
a perfect representation of what the execution time will be However,
in these examples, we've just happened to get quite a good balance. If
we add a penalty to result cache then it'll just subtract from one
problem group and add to the other.

Overall, in my tests execution was 1.15% faster with result cache
enabled than it was without.

I could maybe get on board with adding a small fixed cost penalty. I'm
not sure exactly what it would be, maybe a cpu_tuple_cost instead of a
cpu_operator_cost and count it in for forming/deforming cached tuples.
I think the patch you wrote to add the resultcache_cost_factor is only
suitable for running experiments with.

The bigger concerns I have with the costing are around what happens
when an n_distinct estimate is far too low on one of the join columns.
I think it is more likely to be concerns like that one which would
cause us to default enable_resultcache to off.

David

In response to

Re: Performance Evaluation of Result Cache by using TPC-DS at 2021-04-26 08:32:17 from Yuya Watari

Responses

Re: Performance Evaluation of Result Cache by using TPC-DS at 2021-05-12 05:08:20 from Yuya Watari

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Peter Smith	2021-05-03 23:29:42	AlterSubscription_refresh "wrconn" wrong variable?
Previous Message	Tom Lane	2021-05-03 22:42:51	Re: Simplify backend terminate and wait logic in postgres_fdw test