From: | Anastasia Lubennikova <a(dot)lubennikova(at)postgrespro(dot)ru> |
---|---|
To: | Aleksander Alekseev <a(dot)alekseev(at)postgrespro(dot)ru> |
Cc: | pgsql-hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Patch: fix lock contention for HASHHDR.mutex |
Date: | 2016-01-27 13:19:43 |
Message-ID: | 56A8C3EF.7030604@postgrespro.ru |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
22.01.2016 13:48, Aleksander Alekseev:
>> Then, this thread became too tangled. I think it's worth to write a
>> new message with the patch, the test script, some results and brief
>> overview of how does it really works. It will make following review
>> much easier.
> Sure.
>
> HASHHDR represents a hash table. It could be usual or partitioned.
> Partitioned table is stored in a shared memory and accessed by multiple
> processes simultaneously. To prevent data corruption hash table is
> partitioned and each process has to acquire a lock for a corresponding
> partition before accessing data in it. Number of partition is determine
> by lower bits of key's hash value. Most tricky part is --- dynahash
> knows nothing about these locks, all locking is done on calling side.
>
> Since shared memory is pre-allocated and can't grow to allocate memory
> in a hash table we use freeList. Also we use nentries field to count
> current number of elements in a hash table. Since hash table is used by
> multiple processes these fields are protected by a spinlock. Which as
> it turned out could cause lock contention and create a bottleneck.
>
> After trying a few approaches I discovered that using one spinlock per
> partition works quite well. Here are last benchmark results:
>
> http://www.postgresql.org/message-id/20151229184851.1bb7d1bd@fujitsu
>
> Note that "no locks" solution cant be used because it doesn't guarantee
> that all available memory will be used in some corner cases.
>
> You can find a few more details and a test script in the first message
> of this thread. If you have any other questions regarding this patch
> please don't hesitate to ask.
InitLocks
/*
* Compute init/max size to request for lock hashtables. Note these
* calculations must agree with LockShmemSize!
*/
This comment certainly requires some changes.
BTW, could you explain why init_table_size was two times less than
max_table_size?
Although, I don't see any problems with your changes.
- hctl->nentries = 0;
- hctl->freeList = NULL;
Why did you delete these two lines? I wonder if you should rewrite them
instead?
+ * This particular number of partitions significantly reduces lock
contention
+ * in partitioned hash tables, almost if partitioned tables didn't use any
+ * locking at all
As far as I understood, this number was obtained experimentally? Maybe
you should note that in the comment.
And the last, but not least.
+ nelem_alloc = ((int) nelem) / partitions_number;
+ if (nelem_alloc == 0)
+ nelem_alloc = 1;
+
+ for (i = 0; i < partitions_number; i++)
+ if (!element_alloc(hashp, nelem_alloc, i))
+ ereport(ERROR,
+ (errcode(ERRCODE_OUT_OF_MEMORY),
+ errmsg("out of memory")));
It looks like you forgot to round the result of the division.
For example, if you have nelem=25 and partitions_number=6.
25 / 6 = 4. And then you allocate 24 nelems, while 1 is lost.
What about productivity, I did a test on my notebook. Patched version
shows small performance improvement.
pgbench -j 1 -c 1 -f pgbench.sql -T 300 postgres
base patched
127tps 145tps
pgbench -j 8 -c 8 -f pgbench.sql -T 300 postgres
base patched
247tps 270tps
But I haven't any big machine to perform tests, where the patch is
supposed to make significant changes.
So I would rely on your and the other reviewers results.
Except mentioned notes, I suppose the patch is good enough to pass it to
a reviewer.
--
Anastasia Lubennikova
Postgres Professional: http://www.postgrespro.com
The Russian Postgres Company
From | Date | Subject | |
---|---|---|---|
Next Message | Peter Geoghegan | 2016-01-27 13:20:03 | Re: Using quicksort for every external sort run |
Previous Message | Andres Freund | 2016-01-27 13:11:05 | Re: Proposal:Use PGDLLEXPORT for libpq |