Quick Links

Re: two index bitmap scan of a big table & hash_seq_search

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	"Sergey E(dot) Koposov" <math(at)sai(dot)msu(dot)ru>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: two index bitmap scan of a big table & hash_seq_search
Date:	2011-08-20 03:03:40
Message-ID:	3266.1313809420@sss.pgh.pa.us
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

"Sergey E. Koposov" <math(at)sai(dot)msu(dot)ru> writes:
> But the funny thing I noticed is that the query after running a certain
> amount of time doing I/O, starts to use 100%CPU and spend 99% the time in
> hash_seq_search. Here is the oprofile of PG during that period:
> --------
> CPU: Intel Core/i7, speed 2.268e+06 MHz (estimated)
> Counted CPU_CLK_UNHALTED events (Clock cycles when not halted) with a unit mask of 0x00 (No unit mask) count 100000
> samples % symbol name
> 303404 99.3562 hash_seq_search
> 1163 0.3808 tbm_lossify
> 639 0.2093 hash_search_with_hash_value

It seems like you've uncovered a scaling limitation in the tidbitmap
logic when it has to deal with very very large numbers of pages.

I might be reading too much into the mention of tbm_lossify, but
I wonder if the problem is repeated invocations of tbm_lossify()
as the bitmap gets larger. Maybe that function needs to be more
aggressive about how much information it deletes per call.

regards, tom lane

In response to

two index bitmap scan of a big table & hash_seq_search at 2011-08-19 17:56:06 from Sergey E. Koposov

Responses

Re: two index bitmap scan of a big table & hash_seq_search at 2011-08-20 16:50:21 from Sergey E. Koposov

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Heikki Linnakangas	2011-08-20 04:36:43	Re: the big picture for index-only scans
Previous Message	David Fetter	2011-08-20 02:14:32	Re: New copyright program