From: | "John Hansen" <john(at)geeknet(dot)com(dot)au> |
---|---|
To: | "Oleg Bartunov" <oleg(at)sai(dot)msu(dot)su> |
Cc: | "Greg Sabino Mullane" <greg(at)turnstep(dot)com>, <pgsql-www(at)postgresql(dot)org> |
Subject: | Re: Suggestion for improving Archives |
Date: | 2004-09-05 13:42:29 |
Message-ID: | 5066E5A966339E42AA04BA10BA706AE56192@rodrick.geeknet.com.au |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-www |
>
> What code ? I've seen that last modified header and now it's gone.
> No stress on the archives, it's pure question of several lines of code
>
Yes, which is exactly what we wanted to avoid, more php code.
> it's not a portal page, it's just a message, why should it
> changed so often. I think I should teach our crawler to
> recognize if changes were cosmetic using fuzzy checksum.
>
No, but even something as simple as adding a new mailing list would then
cause you to recrawl the entire site.
I agree that the last-modified header is the best solution. (the value
of it being equal to the message date, that is)
... John
From | Date | Subject | |
---|---|---|---|
Next Message | Marc G. Fournier | 2004-09-05 14:16:14 | Re: Suggestion for improving Archives |
Previous Message | Oleg Bartunov | 2004-09-05 13:27:09 | Re: search.postgresql.org |