Quick Links

Re: Date indexing

From:	Dima Tkach <dmitry(at)openratings(dot)com>
To:	Ian Cass <ian(dot)cass(at)mblox(dot)com>, pgsql-general(at)postgresql(dot)org
Subject:	Re: Date indexing
Date:	2002-04-23 12:56:31
Message-ID:	3CC559FF.1030209@openratings.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general pgsql-sql

Ian Cass wrote:
> Hi,
>
> I've been trying to do date range index lookups. According to EXPLAIN, my
> sql is using the appropriate index, but the result takes forever to return
> (~10mins) even tho the dataset it's referencing is reasonably small (~5000
> lines). Similar queries on a similar sized dataset on the same database that
> are not referencing a date range, but a specific field (WHERE user_name =
> 'U12345' for example) take a matter of seconds to return.
>
> The index is on a timestamp field, and the only way I could find to get it
> to use this index was something like the following...
>
> AND (messages.client_id = '89' AND messages.originator like '%' AND
> messages.logtime >= timestamp '20/04/2002 00:00:00' AND messages.logtime <=
> timestamp '20/04/2002 23:59:59')
>
> Index is on logtime, client_id, originator
> logtime = timezone
> client id = int2
> originator = text
>

This won't be able to make much use of the compound index, because you
are not searching for exact match on the first column, and the other
columns are not in your criteria at all ... Thus your index is only as
good as being just on the timestamp alone.

10 minutes still sounds pretty excessive though, even for a seq. scan
on a 5000 rows table. Have you vacuum'ed it lately?
Try running 'vacuum full' on that table, it might help a lot, if you
were ding lots of updates to it.

> I'll only ever be referencing this data to a granularity of 1 day, so is
> there a way I can use a function to index this so that the date column in
> the index is text containing just DD/MM/YYYY?

Perhaps, you should use date instead of timestamp - that way you'd be
able to search for exact match, instead of a range...

> I did try to index something
> like..
>
> create index messages_200204_ix3 on messages_200204
> (to_char(logtime,'DD/MM/YYYY'), client_id, originator)
>
> ...but it didn't work. Gives me parse errors on '.
>

You can't do that - functional indexes can only be on a single column :-(
You could hack around that with something like this:

create function concat_fields (timestamp, int, text) returns text as
'select to_char ($1,'DD/MM/YYYY') || \':\' || $2 || \':\' || $3'
language 'sql' with (iscacahable);

And then,
create index ... on ... (concat_fields(logtime,client_id,originator));

BUT:

- There is a memory leak that will barf during the index creation if
your table is large (I have a patch for it, but Tom Lane thinks it is
not 'systemic' enough to put it in... I could send it to you if you
want to though)...

- You don't seem to be making use of the compound index anyway...
Perhaps, just having an index on timestamp alone will do better

- You'd have to modify your query to make it use the index like this:
select ... where
concat_fields (logtime,client_id,originator) like '20/04/2002:%:%'

Once again, this is only as good (actually, a little worse) as
just having the logtime as date, and scanning a single-column
index for an exact match.

And don't forget to run the vacuum - I think that this alone should
fix your problem (in general, you shouldn't need any indexes AT ALL
on a 5000 rows table).

I hope, it helps...

Dima

In response to

Date indexing at 2002-04-23 09:35:59 from Ian Cass

Responses

Re: Date indexing at 2002-04-23 14:07:49 from Ian Cass

Browse pgsql-general by date

	From	Date	Subject
Next Message	tony	2002-04-23 13:33:13	some pl/pgsql help please
Previous Message	Lincoln Yeoh	2002-04-23 11:13:50	Re: One particular large database application

Browse pgsql-sql by date

	From	Date	Subject
Next Message	Tom Lane	2002-04-23 13:56:38	Re: Date indexing
Previous Message	Ian Cass	2002-04-23 09:35:59	Date indexing