Home > mailing lists

Re: pgbench - allow to specify scale as a size - Mailing list pgsql-hackers

From	Fabien COELHO
Subject	Re: pgbench - allow to specify scale as a size
Date	March 4, 2018 12:09:11
Msg-id	alpine.DEB.2.20.1803040947500.12500@lancre Whole thread
In response to	Re: pgbench - allow to specify scale as a size (Peter Eisentraut <peter.eisentraut@2ndquadrant.com>)
Responses	Re: pgbench - allow to specify scale as a size
List	pgsql-hackers

Tree view

>>> Now the overhead is really 60-65%. Although the specification is unambiguous,
>>> but we still need some maths to know whether it fits in buffers or memory...
>>> The point of Karel regression is to take this into account.
>>>
>>> Also, whether this option would be more admissible to Tom is still an open
>>> question. Tom?
>>
>> Here is a version with this approach: the documentation talks about
>> "actual data size, without overheads", and points out that storage
>> overheads are typically an additional 65%.
>
> I think when deciding on a size for a test database for benchmarking,
> you want to size it relative to RAM or other storage layers.  So a
> feature that allows you to create a database of size N but it's actually
> not going to be anywhere near N seems pretty useless for that.

Hmmm.

At least the option say the size of the useful data, which should be what 
the user be really interested in:-) You have a developer point of view 
about the issue. From a performance point of view, ISTM that useful data 
storage size is an interesting measure, which allows to compare between 
(future) storage engines and show the impact of smaller overheads, for 
instance.

The other option can only be some kind of approximation, and would require 
some kind of maintenance (eg a future zheap overhead would be different 
that the heap overhead, the overhead depends on the size itself, and it 
could also depend on other options). This has been rejected, and I agree 
with the rejection (incredible:-).

So ISTM that the patch is dead because it is somehow necessarily 
imprecise. People will continue to do some wild guessing on how to 
translate scale to anything related to size.

> (Also, we have, for better or worse, settled on a convention for byte
> unit prefixes in guc.c.  Let's not introduce another one.)

Hmmm. Indeed for worse, as it is soooo much better to invent our own units 
than to reuse existing ones which were not confusing enough:-)

  - SI units: 1kB = 1000 bytes (*small* k)
  - IEC units: 1KiB = 1024 bytes
  - JEDEC units: 1KB = 1024 bytes (*capital* k)

But postgres documentation uses "kB" for 1024 bytes, too bad:-(

The gucs are about memory, which is measured in 1024, but the storage is 
usually measured in 1000, and this option was about storage, hence I felt 
it better to avoid confusion.

Conclusion: mark the patch as rejected?

-- 
Fabien.

pgsql-hackers by date:

From: David Rowley
Date: 04 March 2018, 11:43:23
Subject: Re: [HACKERS] Removing LEFT JOINs in more cases

From: Thomas Munro
Date: 04 March 2018, 12:27:01
Subject: Re: select_parallel test failure: gather sometimes losing tuples(maybe during rescans)?

Re: pgbench - allow to specify scale as a size - Mailing list pgsql-hackers

Previous

Next