Re: Proposal: q-gram GIN and GiST indexes - Mailing list pgsql-hackers

From Alexander Korotkov
Subject Re: Proposal: q-gram GIN and GiST indexes
Date
Msg-id BANLkTik7VZc=2mVQMPwukti3R_EveD_5=g@mail.gmail.com
Whole thread Raw
In response to Re: Proposal: q-gram GIN and GiST indexes  (Alexander Korotkov <aekorotkov@gmail.com>)
Responses Re: Proposal: q-gram GIN and GiST indexes  (Robert Haas <robertmhaas@gmail.com>)
List pgsql-hackers
For example, here is distribution of q-grams count in 120 Mb of dblp paper titles (pretty large dataset).
q   count
2    7218
3  115107
4  589428
5 1648453
6 3336685
Number of 5-grams if about 15x larger than number of 3-grams. But most part of index space will be occupied by links to the rows(about 120 millions of links), while size of q-grams itself will be almost ignorable in comparison with it.

----
With best regards,
Alexander Korotkov.

pgsql-hackers by date:

Previous
From: Alexander Korotkov
Date:
Subject: Re: Proposal: q-gram GIN and GiST indexes
Next
From: Robert Haas
Date:
Subject: Re: Proposal: q-gram GIN and GiST indexes