Thread: SQL question, TOP 5 and all OTHERS
Hi,
I’m trying to get up-to-speed with PostgreSQL and have a dumb question. I have a basic query to pull the top 5 vendors that have sent me the most bills. I would like to lump all the other vendors into a row named “Other” and get a count of all those bills excluding the top 5. Below is the basic query.
SELECT vendor_name AS vendor_name,
count(DISTINCT inv_id) AS "# of Invoices"
FROM SpendTable
GROUP BY vendor_name
ORDER BY "# of Invoices" DESC
LIMIT 5
Thanks,
Scott
what about using a Common Table Expression and the clause WITH ?
I am not at home now but you can write something like
WITH top5 AS (SELECT vendor_name AS vendor_name,
count(DISTINCT inv_id) AS "# of Invoices"
FROM SpendTable
GROUP BY vendor_name
ORDER BY "# of Invoices" DESC LIMIT 5)
SELECT * FROM top5
UNION
SELECT 'all other' AS vendor_name,
count(DISTINCT st.inv_id) AS "# of Invoices"
FROM SpendTable AS st
WHERE st.vendor_name NOT IN (SELECT vendor_name FROM top5)
ORDER BY "# of Invoices" DESC
Best regards,
--
Jean MAURICE
Grenoble - France - Europe
www.j-maurice.fr
www.atoutfox.org
www.aedtf.org
@font-face {font-family:"Cambria Math"; panose-1:2 4 5 3 5 4 6 3 2 4;}@font-face {font-family:Calibri; panose-1:2 15 5 2 2 2 4 3 2 4;}p.MsoNormal, li.MsoNormal, div.MsoNormal {margin:0in; font-size:11.0pt; font-family:"Calibri",sans-serif;}.MsoChpDefault {mso-style-type:export-only; font-size:10.0pt;}div.WordSection1 {page:WordSection1;} Hi,
I’m trying to get up-to-speed with PostgreSQL and have a dumb question. I have a basic query to pull the top 5 vendors that have sent me the most bills. I would like to lump all the other vendors into a row named “Other” and get a count of all those bills excluding the top 5. Below is the basic query.
SELECT vendor_name AS vendor_name,
count(DISTINCT inv_id) AS "# of Invoices"
FROM SpendTable
GROUP BY vendor_name
ORDER BY "# of Invoices" DESC
LIMIT 5
Thanks,
Scott
-- J. MAURICE
On Mon, Jun 06, 2022 at 09:46:12PM +0200, Jean MAURICE wrote: > Hi Scott, > what about using a Common Table Expression and the clause WITH ? > I am not at home now but you can write something like > > WITH top5 AS (SELECT vendor_name AS vendor_name, > > ?????? count(DISTINCT inv_id) AS "# of Invoices" > > FROM SpendTable > > GROUP BY vendor_name > > ORDER BY "# of Invoices" DESC > > LIMIT 5) > SELECT * FROM top5 > UNION > > SELECT 'all other' AS vendor_name, > > ?????? count(DISTINCT st.inv_id) AS "# of Invoices" > > FROM SpendTable AS st > > WHERE st.vendor_name NOT IN (SELECT vendor_name FROM top5) > > ORDER BY "# of Invoices" DESC There might be a challenge with ties, especially if you don't order by the vendor name since you could get different results even on the same data set, depending on how the query plan goes. It depends on what the OP is looking for, I guess. -- -- Skylar Thompson (skylar2@u.washington.edu) -- Genome Sciences Department (UW Medicine), System Administrator -- Foege Building S046, (206)-685-7354 -- Pronouns: He/Him/His