Re: WIP: cross column correlation ... - Mailing list pgsql-hackers

From Rod Taylor
Subject Re: WIP: cross column correlation ...
Date
Msg-id AANLkTi=5F5nyLE+Ox9j2C1AvzG-1qboZzpuZfGSukdZO@mail.gmail.com
Whole thread Raw
In response to Re: WIP: cross column correlation ...  (Robert Haas <robertmhaas@gmail.com>)
Responses Re: WIP: cross column correlation ...  (Alvaro Herrera <alvherre@commandprompt.com>)
List pgsql-hackers

4. Even if we could accurately estimate the percentage of the table
that is cached, what then?  For example, suppose that a user issues a
query which retrieves 1% of a table, and we know that 1% of that table
is cached.  How much of the data that the user asked for is cache?
Hard to say, right?  It could be none of it or all of it.  The second
scenario is easy to imagine - just suppose the query's been executed
twice.  The first scenario isn't hard to imagine either.


I have a set of slow disks which can impact performance nearly as much as in cached in memory versus the fast disks.

How practical would it be for analyze to keep a record of response times for given sections of a table as it randomly accesses them and generate some kind of a map for expected response times for the pieces of data it is analysing?

It may well discover, on it's own, that recent data (1 month old or less) has a random read response time of N, older data (1 year old) in a different section of the relation tends to have a response time of 1000N, and really old data (5 year old) tends to have a response time of 3000N.

pgsql-hackers by date:

Previous
From: Cédric Villemain
Date:
Subject: Re: WIP: cross column correlation ...
Next
From: Tom Lane
Date:
Subject: Re: wCTE behaviour