Re: Processor usage/tuning question - Mailing list pgsql-general

From Jim Nasby
Subject Re: Processor usage/tuning question
Date
Msg-id 543849EE.9060900@BlueTreble.com
Whole thread Raw
In response to Re: Processor usage/tuning question  (Israel Brewster <israel@ravnalaska.net>)
Responses Re: Processor usage/tuning question  (Jeff Janes <jeff.janes@gmail.com>)
Re: Processor usage/tuning question  (Israel Brewster <israel@ravnalaska.net>)
List pgsql-general
On 10/8/14, 3:17 PM, Israel Brewster wrote:
> Except that the last data point received is still valid as the aircraft's current location, even if it came in
severalhours ago - which we may well care about. That is, some users may need to see where a given aircraft (or the
entirefleet) is, even if an aircraft hasn't updated in a while. That said, I did discuss this with my higher-ups, and
gotthe ok to take it down to four hours. 

Note that in your explain output nothing is filtering by time at all; are you sure you posted the right explain?

>> I don't think PostgreSQL is going to be able to reason very effectively about a ROW_NUMBER() in a inner table and
thena row<=5 in the outer one being equivalent to a LIMIT query for which it could walk an index and then stopping once
itfinds 5 of them. 
>>
>> Does this need to issued as a single query?  Why not issue 55 different queries?  It seems like the client is likely
goingto need to pick the returned list back out by tail number anyway, so both the client and the server might be
happierwith separate queries. 
>
> Good point. Doing that brought the entire execution time down to around 60ms. Just ever so slightly better than the
~1200msI was getting before. :-) I just have an unreasonable aversion to making multiple queries. I guess this is a
primeexample of why I need to overcome that :-) 

Do you actually need the last 5 points? If you could get away with just the most recent point, SELECT DISTINCT ON might
doa better job of this in a single query. 

As for the concern about issuing multiple queries, if you code this into a database function it should still be quite
fastbecause there won't be any round-trip between your application and the database. 

Something else to consider is having a second table that only keeps the last X aircraft positions. I would do this by
duplicatingevery insert into that table via a trigger, and then have a separate process that ran once a minute to
deleteany records other than the newest X. Because that table would always be tiny queries against it should be blazing
fast.Do note that you'll want to vacuum that table frequently, like right after each delete. 
--
Jim Nasby, Data Architect, Blue Treble Consulting
Data in Trouble? Get it in Treble! http://BlueTreble.com


pgsql-general by date:

Previous
From: Jim Nasby
Date:
Subject: Re: table versioning approach (not auditing)
Next
From: Stephen Davies
Date:
Subject: Re: psql connection issue