Thread: Using an ALIAS in WHERE clause
I'm using a query with similar functionality to the following: SELECT id, sum(hours) AS totalhours FROM mytable WHERE totalhours > 50; I get the following error: Attribute 'totalhours' not found. Am I not allowed to use an alias here? If not, how can I get my desired output? Thanks Note: I'm using postgresql 7.2 -- Ron St.Pierre Syscor R&D tel: 250-361-1681 email: rstpierre@syscor.com
On Fri, 2002-11-29 at 01:17, Ron St.Pierre wrote: > I'm using a query with similar functionality to the following: > > SELECT id, > sum(hours) AS totalhours > FROM mytable > WHERE totalhours > 50; > > I get the following error: > Attribute 'totalhours' not found. > > Am I not allowed to use an alias here? If not, how can I get my desired > output? select id, sum(hours) as totalhours from mytable group by id having totalhours > 50 'where' is for tuple selection criteria 'having' is for group selection criteria i suppose you want to have the total number of hours per id (therefore we need to group by id. Does this help ? Tycho
"Ron St.Pierre" <rstpierre@syscor.com> writes: > I'm using a query with similar functionality to the following: > SELECT id, > sum(hours) AS totalhours > FROM mytable > WHERE totalhours > 50; > I get the following error: > Attribute 'totalhours' not found. > Am I not allowed to use an alias here? No. Evaluation of the WHERE clause logically precedes evaluation of the SELECT list, so it's really quite nonsensical to expect SELECT outputs to be available in WHERE. Furthermore, in this particular case you'd be introducing an aggregate function into WHERE, which is also nonsensical. Aggregate results have to be checked in HAVING, which acts after grouping/aggregation, whereas WHERE filters rows beforehand. You may find it helpful to read http://www.ca.postgresql.org/users-lounge/docs/7.2/postgres/tutorial-agg.html (BTW, I assume there's really a "GROUP BY id" in there? If not, you've got other problems.) > If not, how can I get my desired output? Like so: SELECT id, sum(hours) AS totalhours FROM mytable GROUP BY id HAVING sum(hours) > 50; If you really can't be bothered to write sum() twice, you could consider a two-level SELECT: SELECT * FROM (SELECT id, sum(hours) AS totalhours FROM mytable GROUP BY id) ss WHERE totalhours > 50; The sub-select has its own aggregation pipeline that acts before the outer select does anything, so the basic rule of "no aggregate references in WHERE" is not being violated here. regards, tom lane
Tom Lane <tgl@sss.pgh.pa.us> wrote: [snip] > > The sub-select has its own aggregation pipeline that acts before the > outer select does anything, so the basic rule of "no aggregate > references in WHERE" is not being violated here. > I was thinking of an related thing, how do we handle queries like these (actual used query): select o.id as order_id, o.cust_id, i.id as invoice_id, i.created::date as invoice_created, extract('days' from (now() - i.created)) as days_overdue, c.type, c.status from order o, invoice i, cust c where (o.ordersystem = 0) and (o.status = 3 and o.substatus = 3) and (i.order_id = o.id) and (c.id = o.cust_id) and (c.account_expires >= now()) and ((c.type & (1|4|8)::int8) = 0) and /* some int8 flags */ ((c.status & (2|4)::int8) = 0) and /* some other int8 flags */ ->extract('days' from (now() - i.created)) >= 20 order by dagar_overdue desc ; Is the days_overdue calculated twice, if it is, how can i get the effect of replacing the where condition with days_overdue? Like: select days_overdue ... where (extract('days' from (now() - i.created)) AS days_overdue) >= 20 Hmm. Well it's not that big of an hassle but it'd look nice! Not sure of how big of a performance win it would be, the extract thing shouldn't be that slow, right? Regards Magnus Naeslund
"Magnus Naeslund\(f\)" <mag@fbab.net> writes: > select > ... > extract('days' from (now() - i.created)) as days_overdue, > ... > where > ... > extract('days' from (now() - i.created)) >= 20 > Is the days_overdue calculated twice, Yes. > Not sure of how big of a performance win it would be, the extract thing > shouldn't be that slow, right? In general I think this is useless micro-optimization ;-). There are few functions in SQL that are expensive enough that it's worth worrying about calling them twice per row. If you have a case where it really does matter (super-expensive user-defined function, perhaps) you could probably do something with the multi-level-SELECT technique I illustrated. Years ago, someone at Berkeley did a thesis about planning in the presence of expensive functions, and the remnants of that thesis are still in the Postgres sources --- but it's dead code and would not be easy to resurrect. I personally doubt it could be worth the trouble. regards, tom lane