Re: Read/Write block sizes (Was: Caching by Postgres) - Mailing list pgsql-performance

From Jignesh Shah
Subject Re: Read/Write block sizes (Was: Caching by Postgres)
Date
Msg-id d4b67d7c2b.d7c2bd4b67@bur-mail1.east.sun.com
Whole thread Raw
Responses Re: Read/Write block sizes (Was: Caching by Postgres)
List pgsql-performance
> Does that include increasing the size of read/write blocks? I've
> noticedthat with a large enough table it takes a while to do a
> sequential scan,
> even if it's cached; I wonder if the fact that it takes a million
> read(2) calls to get through an 8G table is part of that.
>


Actually some of that readaheads,etc  the OS does  already if it does some sort of throttling/clubbing of reads/writes.
Butits not enough for such types of workloads. 

Here is what I think will help:

* Support for different Blocksize TABLESPACE without recompiling the code.. (Atlease support for a different Blocksize
forthe whole database without recompiling the code) 

* Support for bigger sizes of WAL files instead of 16MB files WITHOUT recompiling the code.. Should be a tuneable if
youask me (with checkpoint_segments at 256.. you have too many 16MB files in the log directory) (This will help OLTP
benchmarksmore since now they don't spend time rotating log files) 

* Introduce a multiblock or extent tunable variable where you can define a multiple of 8K (or BlockSize tuneable) to
reada bigger chunk and store it in the bufferpool.. (Maybe writes too) (Most devices now support upto 1MB chunks for
readsand writes) 

*There should be a way to preallocate files for TABLES in TABLESPACES otherwise with multiple table writes in the same
filesystemends with fragmented files which causes poor "READS" from the files.  

* With 64bit 1GB file chunks is also moot.. Maybe it should be tuneable too like 100GB without recompiling the code.


Why recompiling is bad? Most companies that will support Postgres will support their own binaries and they won't prefer
differentversions of binaries for different blocksizes, different WAL file sizes, etc... and hence more function using
thesame set of binaries is more desirable in enterprise environments 


Regards,
Jignesh



pgsql-performance by date:

Previous
From: Chris Browne
Date:
Subject: Re: Caching by Postgres
Next
From: William Yu
Date:
Subject: Re: Caching by Postgres