Re: FTS parser - missing UUID token type - Mailing list pgsql-hackers

From Tom Lane
Subject Re: FTS parser - missing UUID token type
Date
Msg-id 2673581.1663164639@sss.pgh.pa.us
Whole thread Raw
In response to FTS parser - missing UUID token type  (Przemysław Sztoch <przemyslaw@sztoch.pl>)
List pgsql-hackers
=?UTF-8?Q?Przemys=c5=82aw_Sztoch?= <przemyslaw@sztoch.pl> writes:
> I miss UUID, which indexes very strangely, is more and more popular and 
> people want to search for it.

Really?  UUIDs in running text seem like an extremely uncommon
use-case to me.  URLs in running text are common nowadays, which is
why the text search parser has special code for that, but UUIDs?

Adding such a thing isn't cost-free either.  Aside from the
probably-substantial development effort, we know from experience
with the URL support that it sometimes misfires and identifies
something as a URL or URL fragment when it really isn't one.
That leads to poorer indexing of the affected text.  It seems
likely that adding a UUID token type would be a net negative
for most people, since they'd be subject to that hazard even if
their text contains no true UUIDs.

It's a shame that the text search parser isn't more extensible.
If it were you could imagine having such a feature while making
it optional.  I'm not volunteering to fix that though :-(

            regards, tom lane



pgsql-hackers by date:

Previous
From: Alvaro Herrera
Date:
Subject: Re: Avoid redudant initialization and possible memory leak (src/backend/parser/parse_relation.c)
Next
From: Marina Polyakova
Date:
Subject: Re: ICU for global collation