regexp_replace and search/replace values stored in table

From: Leif Biberg Kristensen <leif(at)solumslekt(dot)org>
To: pgsql-sql(at)postgresql(dot)org
Subject: regexp_replace and search/replace values stored in table
Date: 2010-04-27 11:43:48
Message-ID: 201004271343.48070.leif@solumslekt.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-sql

I've got a system for entering and storing a lot of standard hyperlinks in a
compact form, and then expand them at run time like this:

CREATE OR REPLACE FUNCTION _my_expand(TEXT) RETURNS TEXT AS $$
-- private func, expand various compact links
DECLARE
str TEXT = $1;
BEGIN
-- Scanned church books [kb=book reference|image reference|link text]
str := REGEXP_REPLACE(str, E'\\[kb=(.+?)\\|(.+?)\\|(.+?)\\]',
E'<a
href="//www.arkivverket.no/URN:kb_read?idx_kildeid=\\1&amp;uid=ny&amp;idx_side=\\2"
title="Lenke til bilde av kirkebokside">\\3</a>', 'g');
-- Scanned probate registers [sk=protocol|image reference|link text]
str := REGEXP_REPLACE(str, E'\\[sk=(.+?)\\|(.+?)\\|(.+?)\\]',
E'<a href="//www.arkivverket.no/URN:sk_read/\\1/\\2" title="Lenke
til bilde av skifteprotokollside">\\3</a>', 'g');
-- Scanned deed ("pantebok") registers [sk=protocol|image reference|link
text]
str := REGEXP_REPLACE(str, E'\\[tl=(.+?)\\|(.+?)\\|(.+?)\\]',
E'<a
href="//www.arkivverket.no/URN:tl_read?idx_id=\\1&amp;uid=ny&amp;idx_side=\\2"
title="Lenke til bilde av pantebokside">\\3</a>', 'g');
RETURN str;
END
$$ LANGUAGE plpgsql STABLE;

According to the slogan "minimize code, maximize data" I feel that these
strings should be stored in a table:

CREATE TABLE short_links (
link_type CHAR(2) PRIMARY KEY,
short_link TEXT,
long_link TEXT,
description TEXT
);

It appears like I have to double the number of backslashes when I enter the
data:

INSERT INTO short_links (link_type, short_link, long_link, description) VALUES
('sk', E'\\\\[sk=(.+?)\\\\|(.+?)\\\\|(.+?)\\\\]',
E'<a href="//www.arkivverket.no/URN:sk_read/\\\\1/\\\\2" title="Lenke til
bilde av skifteprotokollside">\\\\3</a>',
'Scanned probate registers [sk=protocol|image reference|link text]');

pgslekt=> select * from short_links;

link_type | short_link |
long_link |
description
-----------+--------------------------------+-----------------------------------------------------------------------------------------------------------+-------------------------------------------------------------------
sk | \\[sk=(.+?)\\|(.+?)\\|(.+?)\\] | <a
href="//www.arkivverket.no/URN:sk_read/\\1/\\2" title="Lenke til bilde av
skifteprotokollside">\\3</a> | Scanned probate registers [sk=protocol|image
reference|link text]
(1 row)

So far, so good. But when I try to do the actual expansion, I'm stumped.

pgslekt=> select regexp_replace((select source_text from sources where
source_id=23091), (select quote_literal(short_link) from short_links where
link_type = 'sk'), (select quote_literal(long_link) from short_links where
link_type = 'sk'), 'g');
regexp_replace
------------------------------------------------------------------------------------------------
[sk=25658|67|side 66a]. Vabakken under Klyve vestre i Solum 07.07.1784:
[p=6947|Isach Jonsen].
(1 row)

What am I missing?

regards,
--
Leif Biberg Kristensen
http://solumslekt.org/

Responses

Browse pgsql-sql by date

  From Date Subject
Next Message Tim Landscheidt 2010-04-27 13:04:23 Re: regexp_replace and search/replace values stored in table
Previous Message silly sad 2010-04-26 12:55:12 Re: [SPAM]-D] Re: [SPAM]-D] How to find broken UTF-8 characters ?