Home > mailing lists

RE: [Patch] Optimize dropping of relation buffers using dlist - Mailing list pgsql-hackers

From	tsunakawa.takay@fujitsu.com
Subject	RE: [Patch] Optimize dropping of relation buffers using dlist
Date	October 23, 2020 00:56:35
Msg-id	TYAPR01MB2990C1DBBC8985E27CC28ADBFE1A0@TYAPR01MB2990.jpnprd01.prod.outlook.com Whole thread Raw
In response to	Re: [Patch] Optimize dropping of relation buffers using dlist (Thomas Munro <thomas.munro@gmail.com>)
Responses	RE: [Patch] Optimize dropping of relation buffers using dlist
List	pgsql-hackers

Tree view

From: Thomas Munro <thomas.munro@gmail.com>
> > I'm probably being silly, but can't we avoid the problem by using fstat()
> instead of lseek(SEEK_END)?  Would they return the same value from the
> i-node?
> 
> Amazingly, st_size can disagree with SEEK_END when using the Linux NFS
> client, but its behaviour is worse.  Here's a sequence from a Linux
> NFS client talking to a Linux NFS server with no free space.  This
> time, I also replaced the fsync() with sleep(60), just to make it
> clear that SEEK_END offset can move at any time due to asynchronous
> activity in kernel threads:

Thank you for experimenting.  That's surely amazing.  So, it makes sense for commercial DBMSs and MySQL to preallocate
datafiles... (But IIRC, MySQL has provided an option to allocate a file per table like Postgres relatively recently.)
 

FWIW, it seems safe to use the nodelalloc mount option with ext4 to disable delayed allocation, while xfs doesn't have
suchan option.
 

> > Or, can't we just try to do BufTableLookup() one block after what
> smgrnblocks() returns?
> 
> Unfortunately the problem isn't limited to one block.

You're right.  The data file can be extended by multiple blocks between disk writes.


Regards
Takayuki Tsunakawa

pgsql-hackers by date:

From: Ian Lawrence Barwick
Date: 23 October 2020, 00:53:29
Subject: proposal: function pg_setting_value_split() to parse shared_preload_libraries etc.

From: Peter Geoghegan
Date: 23 October 2020, 01:05:28
Subject: heapam and bottom-up garbage collection, keeping version chains short (Was: Deleting older versions in unique indexes to avoid page splits)

RE: [Patch] Optimize dropping of relation buffers using dlist - Mailing list pgsql-hackers

Previous

Next