| From: | Andrew Dunstan <andrew(at)dunslane(dot)net> |
|---|---|
| To: | Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us> |
| Cc: | Robins Tharakan <tharakan(at)gmail(dot)com>, pgsql-hackers(at)postgresql(dot)org |
| Subject: | Re: buildfarm instance bichir stuck |
| Date: | 2021-04-07 20:27:33 |
| Message-ID: | 61048734-b2c7-4cd6-7731-29d6d688760f@dunslane.net |
| Views: | Whole Thread | Raw Message | Download mbox | Resend email |
| Thread: | |
| Lists: | pgsql-hackers |
On 4/7/21 4:02 PM, Tom Lane wrote:
> Andrew Dunstan <andrew(at)dunslane(dot)net> writes:
>> On 4/7/21 1:07 PM, Tom Lane wrote:
>>> I do use it on some of my flakier dinosaurs, and I've noticed that
>>> when it does kick in, the buildfarm run just stops dead and no report
>>> is sent to the BF server. That has advantages in not cluttering the
>>> BF status with run-failed-because-of-$weird_problem issues, but it
>>> doesn't help from the standpoint of noticing when your animal is stuck.
>>> Maybe it'd be better to change that behavior.
>> Yeah, I'll have a look. It's not simple for a bunch of reasons.
> On further thought, that doesn't seem like the place to fix it.
> I'd rather be able to ask the buildfarm server to send me nagmail
> if my animal hasn't sent a report in N days (where N had better
> be owner-configurable). This would catch not only animal-is-hung,
> but also other classes of problems like whole-machine-is-hung or
> you-broke-your-firewall-configuration-so-it-cant-contact-the-server.
> I've had issues of those sorts before ...
>
>
That already exists, and has for a long time. See the 'alerts' stanza of
your config file.
cheers
andrew
--
Andrew Dunstan
EDB: https://www.enterprisedb.com
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Alvaro Herrera | 2021-04-07 20:28:50 | Re: multi-install PostgresNode fails with older postgres versions |
| Previous Message | Bruce Momjian | 2021-04-07 20:22:55 | Re: Feature improvement: can we add queryId for pg_catalog.pg_stat_activity view? |