Home > mailing lists

Re: FTS parser - missing UUID token type - Mailing list pgsql-hackers

From	Tom Lane
Subject	Re: FTS parser - missing UUID token type
Date	September 14, 2022 17:10:39
Msg-id	2673581.1663164639@sss.pgh.pa.us Whole thread Raw
In response to	FTS parser - missing UUID token type (Przemysław Sztoch <przemyslaw@sztoch.pl>)
List	pgsql-hackers

Tree view

=?UTF-8?Q?Przemys=c5=82aw_Sztoch?= <przemyslaw@sztoch.pl> writes:
> I miss UUID, which indexes very strangely, is more and more popular and 
> people want to search for it.

Really?  UUIDs in running text seem like an extremely uncommon
use-case to me.  URLs in running text are common nowadays, which is
why the text search parser has special code for that, but UUIDs?

Adding such a thing isn't cost-free either.  Aside from the
probably-substantial development effort, we know from experience
with the URL support that it sometimes misfires and identifies
something as a URL or URL fragment when it really isn't one.
That leads to poorer indexing of the affected text.  It seems
likely that adding a UUID token type would be a net negative
for most people, since they'd be subject to that hazard even if
their text contains no true UUIDs.

It's a shame that the text search parser isn't more extensible.
If it were you could imagine having such a feature while making
it optional.  I'm not volunteering to fix that though :-(

            regards, tom lane

pgsql-hackers by date:

From: Alvaro Herrera
Date: 14 September 2022, 16:56:43
Subject: Re: Avoid redudant initialization and possible memory leak (src/backend/parser/parse_relation.c)

From: Marina Polyakova
Date: 14 September 2022, 17:19:34
Subject: Re: ICU for global collation

Re: FTS parser - missing UUID token type - Mailing list pgsql-hackers

Previous

Next