Quick Links

Re: tweaking MemSet() performance - 7.4.5

From:	Manfred Spraul <manfred(at)colorfullife(dot)com>
To:	mcolosimo(at)smtp-bedford(dot)mitre(dot)org
Cc:	Marc Colosimo <mcolosimo(at)mitre(dot)org>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: tweaking MemSet() performance - 7.4.5
Date:	2004-09-25 21:23:13
Message-ID:	4155E1C1.4030503@colorfullife.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

mcolosimo(at)mitre(dot)org wrote:

>>If the memset
>>bypasses the cache then the following access will cause a cache line
>>miss, which can be so slow that using the faster memset can result in a
>>net performance loss.
>>
>>
>>
>
>Could you suggest some structs to test? If I get your meaning, I would make a loop that sets then reads from the structure.
>
>
>
Read the sources and the cpu specs. Benchmarking such problems is
virtually impossible.
I don't have OS-X, thus I checked the Linux-kernel sources: It seems
that the power architecture doesn't have the same problem as x86.
There is a special clear cacheline instruction for large memsets and the
rest is done through carefully optimized store byte/halfword/word/double
word sequences.

Thus I'd check what happens if you memset not perfectly aligned buffers.
That's another point where over-optimized functions sometimes break
down. If there is no slowdown, then I'd replace the postgres function
with the OS provided function.

I'd add some __builtin_constant_p() optimizations, but I guess Tom won't
like gcc hacks ;-)
--
Manfred

In response to

Re: tweaking MemSet() performance - 7.4.5 at 2004-09-20 22:20:12 from mcolosimo

Responses

Re: tweaking MemSet() performance - 7.4.5 at 2004-09-29 08:46:18 from Karel Zak

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Bruce Momjian	2004-09-25 22:58:01	Re: 8.0.0beta1: Ownership of implicit sequences after dump/restore
Previous Message	Tom Lane	2004-09-25 19:45:12	Re: 7.4.5 losing committed transactions