Home > mailing lists

Re: Indexes on expressions with multiple columns and operators - Mailing list pgsql-performance

From	Andrei Lepikhov
Subject	Re: Indexes on expressions with multiple columns and operators
Date	October 14, 2025 12:29:27
Msg-id	f129ab71-2684-49fb-ae43-3a82ff34728f@gmail.com Whole thread Raw
In response to	Re: Indexes on expressions with multiple columns and operators (Tom Lane <tgl@sss.pgh.pa.us>)
List	pgsql-performance

Tree view

On 13/10/2025 16:55, Tom Lane wrote:
> Andrei Lepikhov <lepihov@gmail.com> writes:
>> On 25/9/2025 12:41, Frédéric Yhuel wrote:
>>> So, on SQL Server, you can do this:
>>> CREATE STATISTICS FooStats ON foo (ackid, crit) WHERE crit = 'WARNING';
> 
>> Nice! Thanks for the report. I think the only reason why Postgres
>> doesn't have it yet is the computational cost.
> 
> I think it's more lack of round tuits.  If we had such an option for
> statistics objects, presumably we'd determine the applicability of a
> particular statistics object to a query the same way we do for partial
> indexes, namely try to prove the statistics' restriction condition
> from the query WHERE clauses.  I've not heard complaints about that
> being unduly expensive.
> 
> In the meantime, I believe the old-fashioned approach of creating
> a partial expression index and letting ANALYZE collect stats on that
> will serve, at least for simple statistics.I know at least two extensions (one of which is mine) that attempt to 
analyse query post-execution state, identify unsuccessful predictions on 
cardinality, number of groups, and work_mem, and fix these issues by 
creating MCV and distinct extended statistics.

Of course, without extended statistics on join clauses, their effect is 
highly limited, but we are preparing ;).

Many combinations of clauses may occur. Partial indexes can affect the 
whole system's performance in automatic mode. Additionally, I would 
personally like to play the same game as SQL Server already does - 
compute statistics in an efficient manner - during a Scan. The filter of 
such a scan may serve as a WHERE condition in the extended statistics.

-- 
regards, Andrei Lepikhov,
pgEdge

pgsql-performance by date:

From: Tom Lane
Date: 13 October 2025, 17:55:07
Subject: Re: Indexes on expressions with multiple columns and operators

From: Jonathan Reis
Date: 22 October 2025, 06:37:30
Subject: Performance implications of partitioning by UUIDv7 range in PostgreSQL v18

Re: Indexes on expressions with multiple columns and operators - Mailing list pgsql-performance

Previous

Next