Re: INSERT ... ON CONFLICT DO UPDATE

From: Geoff Winkless <pgsqladmin(at)geoff(dot)dj>
To: Postgres General <pgsql-general(at)postgresql(dot)org>
Subject: Re: INSERT ... ON CONFLICT DO UPDATE
Date: 2015-07-20 14:58:38
Message-ID: CAEzk6fddbXi6nykQ0X+y2Q8Ac7VZK0+0e5r59mOetiPZj_2hag@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On 20 July 2015 at 15:07, Alvaro Herrera <alvherre(at)2ndquadrant(dot)com> wrote:

> Not sure what type of indexes would be affected by that problem, but I
> don't think Postgres' btrees would be.
>

I admit it's not really my area.​

​Take it up with Drew Blas, I guess :)

https://blog.starkandwayne.com/2015/05/23/uuid-primary-keys-in-postgresql/
(see the addendum near the bottom).

My first stab at comprehending what he means:

​Imagine if you have around a thousand entries in your index. In a
sequential set, the btree could be (with each leaf containing 100 values)
499
299 799
100 200 300 400 500 600 700 800 900 ​

​In a (16-bit, say) random set, the btree could be
33208
21728 45220
927 15927 ​21729 29661 33209 34917 40121 45221 49826

Inserting a new value in a sequential key is always going to go into the
900 node. When that fills, ​you can add a new node and just balance the
parent branches (assuming postgres doesn't try to keep a proper balance
across the nodes too?).

However in the random tree a new value could be inserted into _any_ of the
leaf nodes, which means you either have to create more leaf nodes than you
require (in order to leave plenty of blank space, and even if you do that
you will probably end up with uneven fill, which means some searches will
be slower than others) or you end up having to refactor all of your nodes
every time you insert a value.

Geoff​

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Jeff Janes 2015-07-20 17:18:24 Re: INSERT ... ON CONFLICT DO UPDATE
Previous Message Igor Neyman 2015-07-20 14:56:08 Re: INSERT ... ON CONFLICT DO UPDATE