Re: BUG #17892: Bug primary key

From: Alvaro Herrera <alvherre(at)alvh(dot)no-ip(dot)org>
To: comanicibogdy(at)yahoo(dot)com, pgsql-bugs(at)lists(dot)postgresql(dot)org
Subject: Re: BUG #17892: Bug primary key
Date: 2023-04-13 15:18:08
Message-ID: 20230413151808.dlrbdvedxkmtflp2@alvherre.pgsql
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

On 2023-Apr-12, PG Bug reporting form wrote:

> The following bug has been logged on the website:
>
> Bug reference: 17892
> Logged by: Comanici Bogdan
> Email address: comanicibogdy(at)yahoo(dot)com
> PostgreSQL version: 13.9
> Operating system: Debian 11
> Description:
>
> We have a database with over 3 milions apps in it
> Something happend we just found duplicates with same id, but we have
> constraints : PRIMARY KEY on id
>
> How it's happened?
> ID(PRIMARY KEY) SLUG(UNIQUE KEY)
> 29181513 nottiled 0 Home-and-Hobby Recreation mirwanda-center false 0 false false false true false 2019-07-15
> 11:53:38.000 2023-04-10
> 21:44:34.000 0.0 0.0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 NotTiled 0 0.00 1.8.6 Freeware 0.0 2022-09-19 adsense true 0 2 false
> 29181513 nottiled 0 Home-and-Hobby Recreation mirwanda-center false 0 false false false true false 2019-07-15
> 11:53:38.000 2023-03-25
> 14:32:55.000 0.0 0.0 0 26 0 1 0 26 0 0 5 0 0 0 5 0 0 NotTiled 22020096 0.00 1.7.0a Freeware 0.0 2021-09-03 adsense true 20230410 2 false
>
> We tried manually to insert a new row with same value, get error for unique
> key slug

It sounds like both are different row versions of the same row, given
that they have the same ID and SLUG. To me it sounds like the second
row shown here existed, then an UPDATE came along and created the other
one (changing for example the second timestamp column from
'2023-03-25 14:32:55.000' to '2023-04-10 21:44:34.000', among other
things); and then the first one should have been deleted but wasn't.

This could have happened, for example, if VACUUM later processed the
table, removed a few dead entries at the end of the table, but while it
was waiting to truncate the last few pages empty of any live tuple, a
crash came along; then, after recovery, pages which should have been
truncated away, came back to life.

You say you have more cases of the same thing; maybe check the "ctid"
values for the duplicates; if you can see a bunch of them nearby, in a
page close to the physical end of the table, that bug may explain the
issue.

You have a column which in the second copy of the row has the value
20230410, which appears to match another timestamp column in the other
copy of the row. That's a curious coincidence; I wonder if that column
indicate "recheck this app by this time" or something like that. Not
that it matters ...

--
Álvaro Herrera 48°01'N 7°57'E — https://www.EnterpriseDB.com/
"This is what I like so much about PostgreSQL. Most of the surprises
are of the "oh wow! That's cool" Not the "oh shit!" kind. :)"
Scott Marlowe, http://archives.postgresql.org/pgsql-admin/2008-10/msg00152.php

In response to

Browse pgsql-bugs by date

  From Date Subject
Next Message fjz22 2023-04-13 16:21:19 1. PostgreSQL Server Subprocess Went down at function 'pg_detoast_datum_packed'
Previous Message Wetmore, Matthew (CTR) 2023-04-13 14:57:18 Error while installing Postgresql