Ühel kenal päeval, T, 2006-08-01 kell 10:54, kirjutas Andrew Dunstan:
> Gregory Stark wrote:
> >
> > I looked a while back and was suspicious about the actual hash functions too.
> > It seemed like a lot of them were vastly suboptimal. That would mean we're
> > often dealing with mostly empty and mostly full buckets instead of well
> > distributed hash tables.
> >
> >
> >
>
> This is now sounding like a lot of low hanging fruit ... highly
> performant hash indexed tables could possibly be a very big win.
>
Are you sure about the badness of our hash functions ?
I just tested and hashtext(text) has about 1.4% of collisions on about
120M distinct texts, which is not bad considering thet total space for
hashes is 4G, meaning that 120M covers itself already 3% of possible
hash space.
--
----------------
Hannu Krosing
Database Architect
Skype Technologies OÜ
Akadeemia tee 21 F, Tallinn, 12618, Estonia
Skype me: callto:hkrosing
Get Skype for free: http://www.skype.com