| From: | Stefan Kaltenbrunner <stefan(at)kaltenbrunner(dot)cc> |
|---|---|
| To: | Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us> |
| Cc: | PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org> |
| Subject: | Re: -HEAD planner issue wrt hash_joins on dbt3 ? |
| Date: | 2006-09-17 06:42:51 |
| Message-ID: | 450CEE6B.1090005@kaltenbrunner.cc |
| Views: | Whole Thread | Raw Message | Download mbox | Resend email |
| Thread: | |
| Lists: | pgsql-hackers |
[already sent a variant of that yesterday but it doesn't look like it
made it to the list]
Tom Lane wrote:
> Stefan Kaltenbrunner <stefan(at)kaltenbrunner(dot)cc> writes:
>> Tom Lane wrote:
>>> Apparently we've made the planner a bit too optimistic about the savings
>>> that can be expected from repeated indexscans occurring on the inside of
>>> a join.
>
>> effective_cache_size was set to 10GB(my fault for copying over the conf
>> from a 16GB box) during the run - lowering it just a few megabytes(!) or
>> to a more realistic 6GB results in the following MUCH better plan:
>> http://www.kaltenbrunner.cc/files/dbt3_explain_analyze2.txt
>
> Interesting. It used to be that effective_cache_size wasn't all that
> critical... what I think this report is showing is that with the 8.2
> changes to try to account for caching effects in repeated indexscans,
> we've turned that into a pretty significant parameter.
took me a while due to hardware issues on my testbox - but there are new
results(with 6GB for effective_cache_size) up at:
http://www.kaltenbrunner.cc/files/5/
there are still a few issues with the validity of the run like the rf
tests not actually being done right - but lowering effective_cache_size
gave a dramtic speedup on Q5,Q7 and Q8.
that is the explain for the 4h+ Q9:
http://www.kaltenbrunner.cc/files/analyze_q9.txt
increasing the the statistic_target up to 1000 does not seem to change
the plan btw.
disabling nested loop leads to the following (4 times faster) plan:
http://www.kaltenbrunner.cc/files/analyze_q9_no_nest.txt
since the hash-joins in there look rather slow (inappropriate hashtable
set up due to the wrong estimates?) I disabled hash_joins too:
http://www.kaltenbrunner.cc/files/analyze_q9_no_nest_no_hashjoin.txt
and amazingly this plan is by far the fastest one in runtime (15min vs
4,5h ...) except that the planner thinks it is 20 times more expensive ...
Stefan
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Gevik Babakhani | 2006-09-17 07:57:13 | Re: Opinion about macro for the uuid datatype. |
| Previous Message | Dan Thomas | 2006-09-17 05:14:42 | tiny patch to make vacuumdb -a's database order match pg_dumpall |