From: | Raphael Salguero Aragón <raphael(dot)salguero(at)enterprisedb(dot)com> |
---|---|
To: | Mendbayar Alzakhgui <mendbayar(dot)alz(at)unitel(dot)mn> |
Cc: | "pgsql-admin(at)lists(dot)postgresql(dot)org" <pgsql-admin(at)lists(dot)postgresql(dot)org> |
Subject: | Re: Postgresql replication failed in Patroni |
Date: | 2025-02-07 07:21:50 |
Message-ID: | CAA2=wKb-XE+t7DpksUNwoiFYN8pxjQBEbS3ab72cffs4cLAfMg@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-admin |
Hi Mendbayar,
Am Fr., 7. Feb. 2025 um 07:04 Uhr schrieb Mendbayar Alzakhgui <
mendbayar(dot)alz(at)unitel(dot)mn>:
> Hello everybody,
> I need a urgent help on my Patroni managed postgres cluster,
>
> the main patroni managed leader postgres crushed and down, when we try to
> start the Postgresql it’s showing us this error log
>
> 2025-02-07 12:31:18 +08 [2354332]: [4-1] user=,db=,app=,client=LOG:
> listening on IPv4 address "ip_address", port 5432
>
> 2025-02-07 12:31:18 +08 [2354332]: [5-1] user=,db=,app=,client=LOG:
> listening on Unix socket "./.s.PGSQL.5432"
>
> 2025-02-07 12:31:18 +08 [2354337]: [1-1] user=,db=,app=,client=LOG:
> database system was shut down in recovery at 2025-02-07 11:56:50 +08
>
> 2025-02-07 12:31:18 +08 [2354337]: [2-1] user=,db=,app=,client=LOG:
> entering standby mode
>
> 2025-02-07 12:31:18 +08 [2354337]: [3-1] user=,db=,app=,client=FATAL:
> requested timeline 20 is not a child of this server's history
>
> 2025-02-07 12:31:18 +08 [2354337]: [4-1] user=,db=,app=,client=DETAIL:
> Latest checkpoint is at 71/4D8BB8C0 on timeline 19, but in the history of
> the requested timeline, the server forked off from that timeline at
> 71/4D793220.
>
> 2025-02-07 12:31:18 +08 [2354332]: [6-1] user=,db=,app=,client=LOG:
> startup process (PID 2354337) exited with exit code 1
>
> 2025-02-07 12:31:18 +08 [2354332]: [7-1] user=,db=,app=,client=LOG:
> aborting startup due to startup process failure
>
> 2025-02-07 12:31:18 +08 [2354332]: [8-1] user=,db=,app=,client=LOG:
> database system is shut down
>
>
> what should we check?, and is this because the leader node already deleted
> the wal it’s needed to start? And we were connected debezium to this node
> when we recover it will the debezium start automatically from the
> disconnected sessions? Please help me.
>
> You're right, the crashed DB is not able to recover due to a lag of
transactional information.
What is your DB size?
The easiest way is to stop Patroni on the crashed instance (systemctl stop
patroni), remove and recreate the data directory (also take care about
tablespace if they're in use).
Afterwards, you can restart the Patroni service on the crashed instance and
run a reinit from the current leader:
patronictl -c /etc/patroni.yml reinit your_cluster_name replica_node
That should do the trick :)
> Sincerely,
>
>
> * Mendbayar A. *| Database Administrator
>
> Information technology department
>
>
>
> +976 8611-2165
>
> mendbayar(dot)alz(at)unitel(dot)mn
>
> Central Tower, 11th floor
>
> www.unitel.mn
>
>
>
Best regards
Raphael
From | Date | Subject | |
---|---|---|---|
Next Message | daur exp | 2025-02-07 09:38:06 | etcd failed on master node |
Previous Message | Mendbayar Alzakhgui | 2025-02-07 06:04:10 | Postgresql replication failed in Patroni |