Re: Text Search zero padding

From: Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>
To: Richard Greenwood <richard(dot)greenwood(at)gmail(dot)com>
Cc: pgsql-general(at)postgresql(dot)org
Subject: Re: Text Search zero padding
Date: 2008-02-29 08:06:40
Message-ID: Pine.LNX.4.64.0802291105130.31180@sn.sai.msu.ru
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Thu, 28 Feb 2008, Richard Greenwood wrote:

> I am using text search across multiple columns. Two of the columns
> have values that have zero padding - sort of. The values look like
> R0001234 (1 char followed by 7 digits, zero padded). Users are
> accustom to searching with and without the zero padding (entering
> R0001234 or R1234 should return identical results). This is easy to
> accommodate when parsing user input for a single column, but text
> searching across multiple columns it is harder determine if a
> char/digit group should be padded.
>
> So far my best idea is to create a tsvector column containing both
> padded and non-padded versions of the value. i.e. put both R1234 and
> R0001234 into the tsvector column. This seems pretty brute force, and
> I am pretty new to text search, so I'd welcome any suggestions.

create your dictionary, which index R0001234 as R0001234 and R1234
Seems, dict_regex is your friend.
http://vo.astronet.ru/arxiv/dict_regex.html

>
> Thanks,
> Rich
>
>

Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru)
Sternberg Astronomical Institute, Moscow University, Russia
Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
phone: +007(495)939-16-83, +007(495)939-23-83

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Gurjeet Singh 2008-02-29 09:23:05 Re: LIMIT Question
Previous Message Scott Marlowe 2008-02-29 07:25:13 Re: WAL Log Size