That information is very helpful, thanks. I've tried verifying directly with the to_tsvector() function and could see
thatyou are right about no more than 255 locations being saved in the vector. So it makes sense adding that to the
documentation,otherwise people with large documents will obtain misleading or wrong numbers.
> On 21. Jan 2019, at 18:31, Tom Lane <tgl@sss.pgh.pa.us> wrote:
>
> =?utf-8?q?PG_Bug_reporting_form?= <noreply@postgresql.org> writes:
>> Unexpected behaviour:
>> netry for 'hello' results in 255 despite 'hello' occurs 539 times in the
>> attached test.
>
> I think this is a consequence of the MAXNUMPOS limitation in the source
> code, ie an individual tsvector won't store more than 255 locations for
> the same word. That's intentional to keep common words from bloating
> tsvectors too much. But if it's documented anywhere, I didn't see it.
>
> regards, tom lane