Home > mailing lists

Re: Should we optimize the `ORDER BY random() LIMIT x` case? - Mailing list pgsql-hackers

From	Vik Fearing
Subject	Re: Should we optimize the `ORDER BY random() LIMIT x` case?
Date	May 17 00:10:49
Msg-id	c724e28b-3888-4e8a-8187-a5802d226f2d@postgresfriends.org Whole thread Raw
In response to	Re: Should we optimize the `ORDER BY random() LIMIT x` case? (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: Should we optimize the `ORDER BY random() LIMIT x` case? Re: Should we optimize the `ORDER BY random() LIMIT x` case?
List	pgsql-hackers

Tree view

On 16/05/2025 15:01, Tom Lane wrote:
> Aleksander Alekseev <aleksander@timescale.com> writes:
>> If I'm right about the limitations of aggregate functions and SRFs
>> this leaves us the following options:
>> 1. Changing the constraints of aggregate functions or SRFs. However I
>> don't think we want to do it for such a single niche scenario.
>> 2. Custom syntax and a custom node.
>> 3. To give up
> Seems to me the obvious answer is to extend TABLESAMPLE (or at least, some
> of the tablesample methods) to allow it to work on a subquery.


Isn't this a job for <fetch first clause>?


Example:

SELECT ...
FROM ... JOIN ...
FETCH SAMPLE FIRST 10 ROWS ONLY


Then the nodeLimit could do some sort of reservoir sampling.


There are several enhancements to <fetch first clause> coming down the 
pipe, this could be one of them.

-- 

Vik Fearing

pgsql-hackers by date:

From: Tom Lane
Date: 16 May, 23:58:05
Subject: Re: PG 17.2 compilation fails with -std=c11 on mac

From: Tom Lane
Date: 17 May, 00:21:08
Subject: Re: Should we optimize the `ORDER BY random() LIMIT x` case?

Re: Should we optimize the `ORDER BY random() LIMIT x` case? - Mailing list pgsql-hackers

Previous

Next