Home > mailing lists

Re: [PATCH] backend: compare word-at-a-time in bcTruelen - Mailing list pgsql-hackers

From	Jeremy Kerr
Subject	Re: [PATCH] backend: compare word-at-a-time in bcTruelen
Date	June 25, 2009 23:22:16
Msg-id	200906261018.58495.jk@ozlabs.org Whole thread Raw
In response to	Re: [PATCH] backend: compare word-at-a-time in bcTruelen (Stephen Frost <sfrost@snowman.net>)
Responses	Re: [PATCH] backend: compare word-at-a-time in bcTruelen
List	pgsql-hackers

Tree view

Hi Stephen,

> What would be really useful would be "best case" and "worst case"
> scenarios.

I've put together some data from a microbenchmark of the bcTrulen 
function, patched and unpatched.

As for best-case, if you have a long string of trailing spaces, we can 
go through them at theoretically one quarter of cost (a test benchmark 
on x86 shows an actual reduction of 11 to 3 sec with a string of 100 
spaces).

Worst-case behaviour is with smaller numbers of spaces. Here are the 
transition points (ie, where doing the word-wise comparison is faster 
than byte-wise) that I see from my benchmark:
align    spaces    3    7    2    6    1    5    0    4
- where 'align' is the alignment of the first byte to compare (ie, at 
the end of the string). This is pretty much as-expected, as these 
transition points are the first opportunity that the new function has to 
do a word compare.

In the worst cases, I see a 53% cost increase on x86 (with the string 
'aaa ') and a 97% increase on PowerPC ('a  ').

So, it all depends on the number of padding spaces we'd expect to see on 
workload data. Fortunately, we see the larger reductions on the more 
expensive operations (ie, longer strings).

Cheers,


Jeremy

pgsql-hackers by date:

From: Tom Lane
Date: 25 June 2009, 21:50:06
Subject: Re: 8.4 RC1 union/nested select cast bug?

From: Itagaki Takahiro
Date: 25 June 2009, 23:40:17
Subject: query cancel issues in contrib/dblink

Re: [PATCH] backend: compare word-at-a-time in bcTruelen - Mailing list pgsql-hackers

Previous

Next