From: | Albe Laurenz <laurenz(dot)albe(at)wien(dot)gv(dot)at> |
---|---|
To: | "David G Johnston *EXTERN*" <david(dot)g(dot)johnston(at)gmail(dot)com>, "pgsql-general(at)postgresql(dot)org" <pgsql-general(at)postgresql(dot)org> |
Subject: | Re: Is "WITH () UPDATE" Thread Safe ? |
Date: | 2014-12-03 08:54:28 |
Message-ID: | A737B7A37273E048B164557ADEF4A58B17DA7437@ntex2010a.host.magwien.gv.at |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-general |
David G Johnston wrote:
> Albe Laurenz *EXTERN* wrote
>> Paul GOERGLER wrote:
>>> I have a lot of tickets, i need to take a batch of tickets and process
>>> them.
>>> So the process is :
>>> SELECT ONLY 100 tickets
>>> PROCESS ticket
>>> MARK THEM AS « done »
>>>
>>> I’m selecting the tickets with :
>>>
>>> WITH t0 AS (
>>> SELECT t.id,
[...]
>>> FROM tickets AS t
[...]
>>> FOR UPDATE OF t
>>> )
>>> UPDATE tickets AS t1
[...]
>>> FROM t0
[...]
>>>
>>>
>>> I wonder if this query is thread safe, Can a ticket be updated between
>>> the SELECT part (t0) and the
>>> UPDATE part ?
>>> If this query is not « thread safe » how can i do this ?
>> There is no race condition in your query because you used SELECT ... FOR
>> UPDATE.
>>
>> That causes the rows found in the WITH clause to be locked against
>> concurrent modification.
>>
>> So you should be fine.
> I was under the impression that the presence of FOR UPDATE in this situation
> was unnecessary since the execution of the update occurs in the same
> statement as the select and thus the relevant data will be locked at
> execution.
>
> The FOR UPDATE is for situations where other code needs to intervene between
> the select and a subsequent update.
>
> The documentation is silent on this distinction, but...
>
> Note that the use of a CTE in this example is a convenience and that the top
> level command is still UPDATE, not SELECT.
>
> It may be worthwhile to update the UPDATE page's WITH commentary to note
> that (if correct) there is no need for a FOR UPDATE clause on the contained
> subquery (yes, that was quite a mouthful...)
Well, the SELECT and the UPDATE are part of the same statement, so they
run in the same transaction and with the same snapshot.
But it could happen that (for example) two of the above queries are running
concurrently. Then both could find the same row in the SELECT and modify
it (because of the exclusive row lock, one of them will have to wait until
the other is done). Then this ticket will be processed twice.
With FOR UPDATE, no two SELECT statements can find the same row, because
by the time the lock taken by the first SELECT is released, the status of the
ticket will have changed and the second SELECT will not find it.
Yours,
Laurenz Albe
From | Date | Subject | |
---|---|---|---|
Next Message | Guyren Howe | 2014-12-03 09:15:50 | I did some testing of GIST/GIN vs BTree indexing… |
Previous Message | Tim Schäfer | 2014-12-03 08:36:29 | Re: Auto vacuum not running -- Could not bind socket for statistics collector |