Re: Getting a random row - Mailing list pgsql-performance

From Scott Marlowe
Subject Re: Getting a random row
Date
Msg-id dcc563d10910140030r3e02d327q62924b02ffb1ec4d@mail.gmail.com
Whole thread Raw
In response to Re: Getting a random row  (Pavel Stehule <pavel.stehule@gmail.com>)
Responses Re: Getting a random row
List pgsql-performance
On Wed, Oct 14, 2009 at 1:20 AM, Pavel Stehule <pavel.stehule@gmail.com> wrote:
> 2009/10/14 Thom Brown <thombrown@gmail.com>:
>> 2009/10/14 Scott Marlowe <scott.marlowe@gmail.com>:
>>>
>>> If what you're trying to do is emulate a real world app which randomly
>>> grabs rows, then you want to setup something ahead of time that has a
>>> pseudo random order and not rely on using anything like order by
>>> random() limit 1 or anything like that.  Easiest way is to do
>>> something like:
>>>
>>> select id into randomizer from maintable order by random();
>>>
>>> then use a cursor to fetch from the table to get "random" rows from
>>> the real table.
>>>
>>>
>>
>> Why not just do something like:
>>
>> SELECT thisfield, thatfield
>> FROM my_table
>> WHERE thisfield IS NOT NULL
>> ORDER BY RANDOM()
>> LIMIT 1;
>>
>
> this works well on small tables. On large tables this query is extremely slow.

Exactly.  If you're running that query over and over your "performance
test" is on how well pgsql can run that very query. :)  Anything else
you do is likely to be noise by comparison.

pgsql-performance by date:

Previous
From: Pavel Stehule
Date:
Subject: Re: Getting a random row
Next
From: Віталій Тимчишин
Date:
Subject: Re: Getting a random row