Re: window function induces full table scan

From: Thomas Mayer <thomas(dot)mayer(at)student(dot)kit(dot)edu>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, pgsql-performance <pgsql-performance(at)postgresql(dot)org>
Subject: Re: window function induces full table scan
Date: 2014-01-02 22:45:56
Message-ID: 52C5EC24.9060206@student.kit.edu
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-performance

You understood me correctly, Tom.

As you mention, the result would be correct in my case:
- The window function is performing a "PARTITION BY user_id".
- user_id is used for the WHERE condition.

I agree, that in general (PARTITION BY and WHERE don't use the same set
of attributes), incorrect results could occur when performing the WHERE
condition before performing the window function.

However, in this special case, PARTITION BY and WHERE use the same set
of attributes which safely allows some optimization.

In fact, this is getting complicated when multiple window functions with
different PARTITION BY's are used in one statement.

I think, performing the WHERE condition before performing the window
function would be safe if the WHERE condition attribute is element of
the PARTITION-BY-set of attributes of _every_ window function of the
statement.

To ensure correctness, WHERE condition attributes which are _not_
element of the PARTITION-BY-set of attributes of _every_ window function
of the statement need to be performed after performing the window function.

So, the optimizer could check if it's safe or not.

Regards,
Thomas

Am 02.01.2014 22:52, schrieb Tom Lane:
> Thomas Mayer <thomas(dot)mayer(at)student(dot)kit(dot)edu> writes:
>> When querying a view with a WHERE condition, postgresql normally is able
>> to perform an index scan which reduces time for evaluation dramatically.
>
>> However, if a window function is evaluated in the view, postgresql is
>> evaluating the window function before the WHERE condition is applied.
>> This induces a full table scan.
>
> You haven't exactly provided full details, but it looks like you are
> thinking that WHERE clauses applied above a window function should
> be pushed to below it. A moment's thought about the semantics should
> convince you that such an optimization would be incorrect: the window
> function would see fewer input rows than it should, and therefore would
> (in general) return the wrong values for the selected rows.
>
> It's possible that in the specific case you exhibit here, pushing down
> the clause wouldn't result in changes in the window function's output for
> the selected rows, but the optimizer doesn't have enough knowledge about
> window functions to determine that.
>
> regards, tom lane
> .
>

--
======================================
Thomas Mayer
Durlacher Allee 61
D-76131 Karlsruhe
Telefon: +49-721-2081661
Fax: +49-721-72380001
Mobil: +49-174-2152332
E-Mail: thomas(dot)mayer(at)student(dot)kit(dot)edu
=======================================

In response to

Browse pgsql-performance by date

  From Date Subject
Next Message Thomas Mayer 2014-01-02 23:12:55 Re: window function induces full table scan
Previous Message Tom Lane 2014-01-02 22:43:12 Re: window function induces full table scan