Re: Linux OOM killer

From: Ariel Tejera <artejera(at)gmail(dot)com>
To: Tomas Vondra <tomas(at)vondra(dot)me>
Cc: Laurenz Albe <laurenz(dot)albe(at)cybertec(dot)at>, pgsql-bugs(at)lists(dot)postgresql(dot)org
Subject: Re: Linux OOM killer
Date: 2024-10-02 15:55:15
Message-ID: CAEO-Fc45XMXb73cTHsxpg5CJoYvERzQXYXfG5D2G_dxNDBsosg@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

Hi,

Right .. I'll try to upgrade versions and then retry, as you recommend,
unfortunately we're short of hands at the moment.
For us the issue is in practice solved with memoizing=off
Yours,
Ariel Tejera

On Wed, Oct 2, 2024 at 2:22 AM Tomas Vondra <tomas(at)vondra(dot)me> wrote:

>
>
> On 10/2/24 06:16, Laurenz Albe wrote:
> > On Tue, 2024-10-01 at 12:17 -0600, Ariel Tejera wrote:
> >> Hi. I hope this message finds you well.
> >>
> >> The issue is that one of our Postgres servers hit a bug and was killed
> by linux OOM, as
> >> shown in the lines below, showing two events:
> >>
> >> We were able to fix this problem adjusting the server configuration
> with:
> >> enable_memoize = off
> >>
> >> Our Postgres version is 14.5
> >> Linux AWS linux2 (with diverse concurrent workloads)
> >> Ram 32GB
> >> Database size 200 GB
> >>
> >> This is the first reproducible bug I've found in 20 years using
> postgres, heavily (!)
> >>
> >> As this bug is associated with large databases, it is impractical to
> offer a reproducible example for it.
> >> We hope, however, that this report will be of some use for the Postgres
> project.
> >
> > First of all, update to 14.latest. I find at least one bug fixed in
> this area:
> > https://postgr.es/c/e4b95b9b02, discussed in
> https://postgr.es/m/83281eed63c74e4f940317186372abfd%40cft.ru
> >
> > Then, disable memory overcommit, so that you don't get killed by the OOM
> killer.
> > Then you will get an "out of memory" error and a memory context dump in
> the log.
> > We'd need to see that to figure out if it really is a bug.
> >
>
> FWIW I don't think anyone can investigate this without more information.
> In particular, we'd need the query plan triggering the issue, with info
> about the schema (which data types, ...) and data sizes. And the memory
> context information - either logged during OOM, or collected using gdb.
>
> But yeah, definitely update to newest 14.x first. Chances are this is
> already fixed.
>
>
> regards
>
> --
> Tomas Vondra
>
>

In response to

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Cameron Vogt 2024-10-02 17:16:45 PostgreSQL 17 Segmentation Fault
Previous Message Andrei Lepikhov 2024-10-02 09:40:01 Re: BUG #18643: EXPLAIN estimated rows mismatch