Home > mailing lists

Re: reducing the footprint of ScanKeyword (was Re: Large writable variables) - Mailing list pgsql-hackers

From	Tom Lane
Subject	Re: reducing the footprint of ScanKeyword (was Re: Large writable variables)
Date	March 20, 2019 20:24:25
Msg-id	18021.1553113465@sss.pgh.pa.us Whole thread Raw
In response to	Re: reducing the footprint of ScanKeyword (was Re: Large writable variables) (Joel Jacobson <joel@trustly.com>)
Responses	Re: reducing the footprint of ScanKeyword (was Re: Large writable variables)
List	pgsql-hackers

Tree view

Joel Jacobson <joel@trustly.com> writes:
> I've seen a performance trick in other hash functions [1]
> to instead read multiple bytes in each iteration,
> and then handle the remaining bytes after the loop.
> [1] https://github.com/wangyi-fudan/wyhash/blob/master/wyhash.h#L29

I can't get very excited about this, seeing that we're only going to
be hashing short strings.  I don't really believe your 30% number
for short strings; and even if I did, there's no evidence that the
hash functions are worth any further optimization in terms of our
overall performance.

Also, as best I can tell, the approach you propose would result
in an endianness dependence, meaning we'd have to have separate
lookup tables for BE and LE machines.  That's not a dealbreaker
perhaps, but it is certainly another point on the "it's not worth it"
side of the argument.

            regards, tom lane

pgsql-hackers by date:

From: legrand legrand
Date: 20 March 2019, 20:05:06
Subject: Re: [survey] New "Stable" QueryId based on normalized query text

From: Julien Rouhaud
Date: 20 March 2019, 20:30:01
Subject: Re: [survey] New "Stable" QueryId based on normalized query text

Re: reducing the footprint of ScanKeyword (was Re: Large writable variables) - Mailing list pgsql-hackers

Previous

Next