Home > mailing lists

Global shared meta cache - Mailing list pgsql-hackers

From	Ideriha, Takeshi
Subject	Global shared meta cache
Date	June 26, 2018 09:48:28
Msg-id	4E72940DA2BF16479384A86D54D0988A567B9245@G01JPEXMBKW04 Whole thread
Responses	RE: Global shared meta cache Re: Global shared meta cache Re: Global shared meta cache RE: Global shared meta cache Re: Global shared meta cache
List	pgsql-hackers

Tree view

Hi, hackers!

My customer created hundreds of thousands of partition tables and tried to select data from hundreds of applications,
which resulted in enormous consumption of memory because it consumed # of backend multiplied by # of local memory (ex.
100backends X 1GB = 100GB).

Relation caches are loaded on each backend local memory. 

To address this issue I'm trying to move meta caches like catcache or relcache into shared memory.

This topic seems to have been discussed several times.
For instance this thread: 
https://www.postgresql.org/message-id/CA%2BTgmobjDw_SWsxyJwT9z-YOwWv0ietuQx5fb%3DWEYdDfvCbzGQ%40mail.gmail.com 

In my understanding, it discussed moving catcache and relcache to shared memory rather than current local backend
memory,
and is most concerned with performance overhead.

Robert Haas wrote:
> I think it would be interested for somebody to build a prototype here
> that ignores all the problems but the first and uses some
> straightforward, relatively unoptimized locking strategy for the first
> problem. Then benchmark it. If the results show that the idea has
> legs, then we can try to figure out what a real implementation would
> look like.
> (One possible approach: use Thomas Munro's DHT stuff to build the shared cache.)

I'm inspired by this comment and now developing a prototype (please see attached),
but I haven't yet put cache structure on shared memory.
Instead, I put dummy data on shared memory which is initialized at startup, 
and then acquire/release lock just before/after searching/creating catcache entry.

I haven't considered relcache and catcachelist either.
It is difficult for me to do everything at one time with right direction. 
So I'm trying to make small prototype and see what I'm walking on the proper way.

I tested pgbench to compare master branch with my patch. 

0) Environment
   - RHEL 7.4
   - 16 cores
   - 128 GB memory

1) Initialized with pgbench -i -s10

2) benchmarked 3 times for each conditions and got the average result of TPS.
                                     |master branch | prototype      | proto/master (%)
   ------------------------------------------------------------------------------------
   pgbench -c48 -T60 -Msimple -S   | 131297       |130541       |101%
   pgbench -c48 -T60 -Msimple      | 4956          |4965       |95%
   pgbench -c48 -T60 -Mprepared -S |129688       |132538       |97%
   pgbench -c48 -T60 -Mprepared    |5113       |4615       |84%

  This result seems to show except for prepared protocol with "not only SELECT" it didn't make much difference.

What do you think about it?
Before I dig deeper, I want to hear your thoughts.

Best regards,
Takeshi Ideriha

Attachment

001_global_meta_cache.patch

pgsql-hackers by date:

From: Prabhat Sahu
Date: 26 June 2018, 09:42:58
Subject: "Access privileges" is missing after pg_dumpall

From: Kyotaro HORIGUCHI
Date: 26 June 2018, 10:10:26
Subject: Let's remove DSM_IMPL_NONE.

Global shared meta cache - Mailing list pgsql-hackers

Attachment

Previous

Next