Thread: How to implement oracle like rownum(function or seudocolumn) ?

How to implement oracle like rownum(function or seudocolumn) ?

From
Juan Manuel Diaz Lara
Date:
I need a rownum column, like Oracle. I have searched the mailing lists and I don't see a satisfactory solution, so I was wondering write a UDF to implement it, the requirements are:
 
1. Need to preserve state between calls to the function (the rowcount). Maybe using a MemoryContext with a lifespan equal to the executing query.
2. It should be called every time a new row is produced.
3. And more important, need to be called in the right place when called from subquerys:
 
Imagine one to man relationship between table A and tabla B. Then next query
 
SELECT s.id, s.rownum
FROM (SELECT a.id, rownum() AS rownum
            FROM a) s,
           b
WHERE a.id = b.id
 
In this case, the rownum function should behave as if it was an aggregated function, where the subquery is evaluated first and the results joined with table b. I think that a UDF could be evaluated last by the planner (when producing the final resultset) so it won't give the intended result.
 
I looking for a general solution, Any ideas? .
 
Thanks.
 
 
 


Atte.

Juan Manuel Díaz Lara


New Yahoo! Messenger with Voice. Call regular phones from your PC and save big.

Re: How to implement oracle like rownum(function or seudocolumn) ?

From
Tom Lane
Date:
Juan Manuel Diaz Lara <jmdiazlr@yahoo.com> writes:
> I need a rownum column, like Oracle. I have searched the mailing lists and I don't see a satisfactory solution, so I
waswondering write a UDF to implement it, the requirements are:
 

Try keeping a counter in fcinfo->flinfo->fn_extra.

>   3. And more important, need to be called in the right place when called from subquerys:

Don't expect miracles in this department.  The planner will evaluate the
function where it sees fit...
        regards, tom lane


Re: How to implement oracle like rownum(function or seudocolumn) ?

From
Michael Fuhr
Date:
On Sat, Apr 08, 2006 at 12:46:06PM -0400, Tom Lane wrote:
> Juan Manuel Diaz Lara <jmdiazlr@yahoo.com> writes:
> > I need a rownum column, like Oracle. I have searched the mailing lists
> > and I don't see a satisfactory solution, so I was wondering write a
> > UDF to implement it, the requirements are:
> 
> Try keeping a counter in fcinfo->flinfo->fn_extra.

Is this close to being correct?

Datum
rownum(PG_FUNCTION_ARGS)
{   int64  *row_counter;
   if (fcinfo->flinfo->fn_extra == NULL) {       row_counter = (int64 *)MemoryContextAlloc(fcinfo->flinfo->fn_mcxt,
                                           sizeof(int64));       *row_counter = 0;       fcinfo->flinfo->fn_extra =
row_counter;  }
 
   row_counter = fcinfo->flinfo->fn_extra;
   PG_RETURN_INT64(++(*row_counter));
}

> >   3. And more important, need to be called in the right place when
>  called from subquerys:
> 
> Don't expect miracles in this department.  The planner will evaluate the
> function where it sees fit...

Would OFFSET 0 be the workaround in this case?

SELECT rownum(), *
FROM (SELECT rownum() AS f_rownum, id AS f_id FROM foo) AS f,    (SELECT rownum() AS b_rownum, id AS b_id FROM bar) AS
b;rownum| f_rownum | f_id  | b_rownum | b_id  
 
--------+----------+-------+----------+-------     1 |        1 | foo-1 |        1 | bar-1     2 |        2 | foo-1 |
    2 | bar-2     3 |        3 | foo-2 |        3 | bar-1     4 |        4 | foo-2 |        4 | bar-2     5 |        5
|foo-3 |        5 | bar-1     6 |        6 | foo-3 |        6 | bar-2
 
(6 rows)

SELECT rownum(), *
FROM (SELECT rownum() AS f_rownum, id AS f_id FROM foo OFFSET 0) AS f,    (SELECT rownum() AS b_rownum, id AS b_id FROM
barOFFSET 0) AS b;rownum | f_rownum | f_id  | b_rownum | b_id  
 
--------+----------+-------+----------+-------     1 |        1 | foo-1 |        1 | bar-1     2 |        1 | foo-1 |
    2 | bar-2     3 |        2 | foo-2 |        1 | bar-1     4 |        2 | foo-2 |        2 | bar-2     5 |        3
|foo-3 |        1 | bar-1     6 |        3 | foo-3 |        2 | bar-2
 
(6 rows)

-- 
Michael Fuhr


Re: How to implement oracle like rownum(function or seudocolumn)

From
Jan Wieck
Date:
Someone correct me if I'm wrong, but I was allways under the impression 
that Oracle's ROWNUM is a thing attached to a row in the final result 
set, whatever (possibly random) order that happens to have. Now a) this 
is something that IMHO belongs into the client or stored procedure code, 
b) if I am right, the code below will break as soon as an ORDER BY is 
added to the query and most importantly c) if a) cannot do the job, it 
indicates that the database schema or business process definition lacks 
some key/referential definition and is in need of a fix.

My humble guess is that c) is also the reason why the ANSI didn't find a 
ROWNUM desirable.


Jan


On 4/8/2006 1:26 PM, Michael Fuhr wrote:
> On Sat, Apr 08, 2006 at 12:46:06PM -0400, Tom Lane wrote:
>> Juan Manuel Diaz Lara <jmdiazlr@yahoo.com> writes:
>> > I need a rownum column, like Oracle. I have searched the mailing lists
>> > and I don't see a satisfactory solution, so I was wondering write a
>> > UDF to implement it, the requirements are:
>> 
>> Try keeping a counter in fcinfo->flinfo->fn_extra.
> 
> Is this close to being correct?
> 
> Datum
> rownum(PG_FUNCTION_ARGS)
> {
>     int64  *row_counter;
> 
>     if (fcinfo->flinfo->fn_extra == NULL) {
>         row_counter = (int64 *)MemoryContextAlloc(fcinfo->flinfo->fn_mcxt,
>                                                   sizeof(int64));
>         *row_counter = 0;
>         fcinfo->flinfo->fn_extra = row_counter;
>     }
> 
>     row_counter = fcinfo->flinfo->fn_extra;
> 
>     PG_RETURN_INT64(++(*row_counter));
> }
> 
>> >   3. And more important, need to be called in the right place when
>>  called from subquerys:
>> 
>> Don't expect miracles in this department.  The planner will evaluate the
>> function where it sees fit...
> 
> Would OFFSET 0 be the workaround in this case?
> 
> SELECT rownum(), *
> FROM (SELECT rownum() AS f_rownum, id AS f_id FROM foo) AS f,
>      (SELECT rownum() AS b_rownum, id AS b_id FROM bar) AS b;
>  rownum | f_rownum | f_id  | b_rownum | b_id  
> --------+----------+-------+----------+-------
>       1 |        1 | foo-1 |        1 | bar-1
>       2 |        2 | foo-1 |        2 | bar-2
>       3 |        3 | foo-2 |        3 | bar-1
>       4 |        4 | foo-2 |        4 | bar-2
>       5 |        5 | foo-3 |        5 | bar-1
>       6 |        6 | foo-3 |        6 | bar-2
> (6 rows)
> 
> SELECT rownum(), *
> FROM (SELECT rownum() AS f_rownum, id AS f_id FROM foo OFFSET 0) AS f,
>      (SELECT rownum() AS b_rownum, id AS b_id FROM bar OFFSET 0) AS b;
>  rownum | f_rownum | f_id  | b_rownum | b_id  
> --------+----------+-------+----------+-------
>       1 |        1 | foo-1 |        1 | bar-1
>       2 |        1 | foo-1 |        2 | bar-2
>       3 |        2 | foo-2 |        1 | bar-1
>       4 |        2 | foo-2 |        2 | bar-2
>       5 |        3 | foo-3 |        1 | bar-1
>       6 |        3 | foo-3 |        2 | bar-2
> (6 rows)
> 


-- 
#======================================================================#
# It's easier to get forgiveness for being wrong than for being right. #
# Let's break this rule - forgive me.                                  #
#================================================== JanWieck@Yahoo.com #


Re: How to implement oracle like rownum(function or seudocolumn)

From
"Jonah H. Harris"
Date:
On 4/8/06, Jan Wieck <JanWieck@yahoo.com> wrote:
> Someone correct me if I'm wrong, but I was allways under the impression
> that Oracle's ROWNUM is a thing attached to a row in the final result
> set, whatever (possibly random) order that happens to have. Now a) this
> is something that IMHO belongs into the client or stored procedure code,
> b) if I am right, the code below will break as soon as an ORDER BY is
> added to the query and most importantly c) if a) cannot do the job, it
> indicates that the database schema or business process definition lacks
> some key/referential definition and is in need of a fix.

Yes, a rownum is assigned at fetch time.

An example is the following PostgreSQL query:

SELECT id FROM sometable ORDER BY id LIMIT 5;

In Oracle-land is written as:

SELECT id FROM (SELECT id FROM sometable ORDER BY id) WHERE rownum <= 5;

> My humble guess is that c) is also the reason why the ANSI didn't find a
> ROWNUM desirable.

I believe this is a good assumption.


--
Jonah H. Harris, Database Internals Architect
EnterpriseDB Corporation
732.331.1324


Re: How to implement oracle like rownum(function or seudocolumn)

From
Tom Lane
Date:
Jan Wieck <JanWieck@Yahoo.com> writes:
> My humble guess is that c) is also the reason why the ANSI didn't find a 
> ROWNUM desirable.

I've never understood what the conceptual model is for Oracle's rownum.
Where along the SQL operational pipeline (FROM / WHERE / GROUP BY /
aggregate / compute output columns / ORDER BY) is it supposed to be
computed?  To be useful for the often-requested purpose of nicely
labeling output with line numbers, it'd have to be assigned
post-ORDER-BY, but then it doesn't make any sense at all to use it in
WHERE, nor in sub-selects.

A function implemented as per Michael's example would not give the
results that I think people would expect for
SELECT rownum(), * FROM foo ORDER BY whatever;

unless the planner chances to do the ordering with an indexscan.
If it does it with a sort step then the rownums will be computed before
sorting :-(
        regards, tom lane


Re: How to implement oracle like rownum(function or seudocolumn)

From
"Jonah H. Harris"
Date:
On 4/8/06, Tom Lane <tgl@sss.pgh.pa.us> wrote:
> A function implemented as per Michael's example would not give the
> results that I think people would expect for
>
>         SELECT rownum(), * FROM foo ORDER BY whatever;
>

Yep, the query would have to be rewritten similar to Oracle's:

SELECT rownum(), * FROM (SELECT * FROM foo ORDER BY whatever);

IIRC, processing-wise, rownum and order-by processing is handled as follows:

SELECT id, whatever FROM foo WHERE rownum <= 10 ORDER BY id;

is the same as PostgreSQL's

SELECT id, whatever FROM (SELECT id, whatever FROM foo LIMIT 10) ORDER BY id;

--
Jonah H. Harris, Database Internals Architect
EnterpriseDB Corporation
732.331.1324


Re: How to implement oracle like rownum(function or seudocolumn)

From
David Fetter
Date:
On Sat, Apr 08, 2006 at 02:02:53PM -0400, Jan Wieck wrote:
> Someone correct me if I'm wrong, but I was allways under the impression 
> that Oracle's ROWNUM is a thing attached to a row in the final result 
> set, whatever (possibly random) order that happens to have. Now a) this 
> is something that IMHO belongs into the client or stored procedure code, 
> b) if I am right, the code below will break as soon as an ORDER BY is 
> added to the query and most importantly c) if a) cannot do the job, it 
> indicates that the database schema or business process definition lacks 
> some key/referential definition and is in need of a fix.
> 
> My humble guess is that c) is also the reason why the ANSI didn't find a 
> ROWNUM desirable.

Sadly, ANSI did just that.

http://troels.arvin.dk/db/rdbms/#select-limit
http://troels.arvin.dk/db/rdbms/#select-top-n
http://troels.arvin.dk/db/rdbms/#select-limit-offset

Cheers,
D
-- 
David Fetter <david@fetter.org> http://fetter.org/
phone: +1 415 235 3778        AIM: dfetter666                             Skype: davidfetter

Remember to vote!


Re: How to implement oracle like rownum(function or seudocolumn)

From
"Jonah H. Harris"
Date:
On 4/8/06, David Fetter <david@fetter.org> wrote:
> Sadly, ANSI did just that.

Thanks for pointing that out... I'd been using LIMIT/OFFSET for so
long, I totally forgot about the standard :(


--
Jonah H. Harris, Database Internals Architect
EnterpriseDB Corporation
732.331.1324


Re: How to implement oracle like rownum(function or seudocolumn)

From
Christopher Kings-Lynne
Date:
> I need a rownum column, like Oracle. I have searched the mailing lists 
> and I don't see a satisfactory solution, so I was wondering write a UDF 
> to implement it, the requirements are:

+1

I would _love_ to see rownums in PostgreSQL :)

Chris


Re: How to implement oracle like rownum(function or seudocolumn)

From
"Jim C. Nasby"
Date:
On Sat, Apr 08, 2006 at 03:04:40PM -0400, Tom Lane wrote:
> Jan Wieck <JanWieck@Yahoo.com> writes:
> > My humble guess is that c) is also the reason why the ANSI didn't find a 
> > ROWNUM desirable.
> 
> I've never understood what the conceptual model is for Oracle's rownum.
> Where along the SQL operational pipeline (FROM / WHERE / GROUP BY /
> aggregate / compute output columns / ORDER BY) is it supposed to be
> computed?  To be useful for the often-requested purpose of nicely
> labeling output with line numbers, it'd have to be assigned
> post-ORDER-BY, but then it doesn't make any sense at all to use it in
> WHERE, nor in sub-selects.

AFAIK rownum() is Oracle's solution to doing LIMIT ... OFFSET from
before those were ANSI. rownum() is applied as rows are leaving the
relevant node, which means you can't use rownum() in any part of a
SELECT statement other than the SELECT clause (you can't even use it in
a HAVING clause afaik, though I would think you should be able to). So,
if you want to actually do anything useful with rownum(), you have to
use it in a subquery and then operate at a higher level:

SELECT * FROM (SELECT rownum() AS row_number, * FROM table) z ORDER BY
row_number;

> A function implemented as per Michael's example would not give the
> results that I think people would expect for
> 
>     SELECT rownum(), * FROM foo ORDER BY whatever;
> 
> unless the planner chances to do the ordering with an indexscan.
> If it does it with a sort step then the rownums will be computed before
> sorting :-(

I think you're right.

If people are that hot-to-trot about having Oracle compatable rownum()
in PostgreSQL, perhaps EnterpriseDB has some code they could share.
Though I think it'd be better to understand what people actually want
this info for. Personally I think having a rank function (or a complete
suite of analytic functions) would be far more useful.
-- 
Jim C. Nasby, Sr. Engineering Consultant      jnasby@pervasive.com
Pervasive Software      http://pervasive.com    work: 512-231-6117
vcard: http://jim.nasby.net/pervasive.vcf       cell: 512-569-9461


Re: How to implement oracle like rownum(function or seudocolumn)

From
"Nicolas Barbier"
Date:
2006/4/8, Tom Lane <tgl@sss.pgh.pa.us>:

> I've never understood what the conceptual model is for Oracle's rownum.
> Where along the SQL operational pipeline (FROM / WHERE / GROUP BY /
> aggregate / compute output columns / ORDER BY) is it supposed to be
> computed?  To be useful for the often-requested purpose of nicely
> labeling output with line numbers, it'd have to be assigned
> post-ORDER-BY, but then it doesn't make any sense at all to use it in
> WHERE, nor in sub-selects.
>
> A function implemented as per Michael's example would not give the
> results that I think people would expect for
>
>         SELECT rownum(), * FROM foo ORDER BY whatever;
>
> unless the planner chances to do the ordering with an indexscan.
> If it does it with a sort step then the rownums will be computed before
> sorting :-(

I don't know about Oracle or ROW_NUM, but SQL apparently defines
ROW_NUMBER() OVER (..) (see
<url:http://en.wikipedia.org/wiki/Select_(SQL)#ROW_NUMBER.28.29_window_function>)

This gives a number for each output row, according to some ordering
(in SQL, one cannot do ORDER BY in a subquery AFAIK). If used in a
subquery, one can then of course use the resulting column in the WHERE
clause of the outer query:

SELECT * FROM ( SELECT   ROW_NUMBER() OVER (ORDER BY key ASC) AS rownumber,   columns FROM tablename
) AS foo
WHERE rownumber <= 10

(example stolen from the Wikipedia article linked above).

--
Nicolas Barbier
http://www.gnu.org/philosophy/no-word-attachments.html