Re: General purpose hashing func in pgbench - Mailing list pgsql-hackers

From Fabien COELHO
Subject Re: General purpose hashing func in pgbench
Date
Msg-id alpine.DEB.2.20.1712261822040.22976@lancre
Whole thread Raw
In response to Re: General purpose hashing func in pgbench  ("Daniel Verite" <daniel@manitou-mail.org>)
List pgsql-hackers
Bonjour Daniel,

>> Most "permutation" functions are really cryptographic cyphers which are
>> quite expensive, and require powers of two, which is not what is needed.
>> ISTM that there are some constructs to deal with arbitrary sizes based on
>> cryptographic functions, but that would make it too expensive for the
>> purpose.
>
> FWIW, you might have a look at the permuteseq extension:
>  https://pgxn.org/dist/permuteseq
> It permutes an arbitrary range of 64-bit integers into itself,
> with a user-supplied key as the seed.
> Outputs are coerced into the desired range by using the
> smallest possible power of two for the Feistel cypher's
> block size, and then cycle-walking over the results.

Thanks for the pointer.

I must admit that I do not like much the iterative "cycle walking" 
approach because it can be much more expensive for some values, and it 
makes the cost non uniform. Without that point, the overall encryption 
looks quite costly.

For testing purposes, I'm looking for "pretty cheap" and "good enough", 
and for that I'm ready to forsake "cryptographic":-)

Thanks again, it made an interesting read!

-- 
Fabien.


pgsql-hackers by date:

Previous
From: Jeff Janes
Date:
Subject: Re: Deadlock in multiple CIC.
Next
From: Edson Carlos Ericksson Richter
Date:
Subject: Re: Does PostgreSQL check database integrity at startup?