Re: [EXTERNAL] Re: RDS IO Read time - Mailing list pgsql-general

From Eden Aharoni
Subject Re: [EXTERNAL] Re: RDS IO Read time
Date
Msg-id PA6PR03MB102418866F5C140FB0604E1EBBBAD2@PA6PR03MB10241.eurprd03.prod.outlook.com
Whole thread Raw
In response to Re: RDS IO Read time  (Christophe Pettus <xof@thebuild.com>)
List pgsql-general
First, thanks for the reply :)
So, I do know which part is taking a lot of IO time and it's to be honest any node that reads from the disk.. of course, we're running EXPLAIN on our queries (to be more specific we use auto_explain) but we can't seem to find what could cause an index scan that reads 34 MB to take more than a second (1.2 sec).. we do know that our dataset doesn't fit the memory and we were ok with that but with IO being so slow we just don't know what to do anymore. Any other suggestions other than contact AWS (which we did but it seems this path won't lead anywhere).

Thanks again 


From: Christophe Pettus <xof@thebuild.com>
Sent: Monday, March 31, 2025 6:15:39 PM
To: Eden Aharoni <edena@legitsecurity.com>
Cc: pgsql-general@lists.postgresql.org <pgsql-general@lists.postgresql.org>
Subject: [EXTERNAL] Re: RDS IO Read time
 
[You don't often get email from xof@thebuild.com. Learn why this is important at https://aka.ms/LearnAboutSenderIdentification ]

> On Mar 31, 2025, at 06:54, Eden Aharoni <edena@legitsecurity.com> wrote:
> Is this expected IO read rate? I can’t help but feel we’re missing something here..

Really, no particular I/O rate is "expected": if PostgreSQL needs that much data, it'll use that much I/O to get it.  From your description, it's likely that it's a case of the working set for the database just not fitting into the memory you have, so PostgreSQL needs to go out to secondary storage a lot to fetch the data.

The best first step is to use Performance Insights to see which queries are using I/O, and run sample ones with EXPLAIN (ANALYZE, BUFFERS) to see where the I/O is being used within the query.  Given that you allow users to assemble arbitrary queries, it's likely that PostgreSQL is having to use a wide variety of indexes (or sequential scans), so it can't successfully cache a particular set in memory.

pgsql-general by date:

Previous
From: Tom Lane
Date:
Subject: Re: Querying one partition in a function takes locks on all partitions
Next
From: Christophe Pettus
Date:
Subject: Re: [EXTERNAL] RDS IO Read time