pgsql: Improve procost estimates for some text search functions. - Mailing list pgsql-committers

From Tom Lane
Subject pgsql: Improve procost estimates for some text search functions.
Date
Msg-id E1YpMCl-0000Gs-7O@gemulon.postgresql.org
Whole thread Raw
List pgsql-committers
Improve procost estimates for some text search functions.

The text search functions that involve parsing raw text into lexemes are
remarkably CPU-intensive, so estimating them at the same cost as most other
built-in functions seems like a mistake; moreover, doing so turns out to
discourage the optimizer from using functional indexes on these functions.
After some debate, we've agreed to raise procost from 1 to 100 for
to_tsvector(), plainto_tsvector(), to_tsquery(), ts_headline(),
ts_match_tt(), and ts_match_tq(), which are all the text search functions
that parse raw text.

Also increase procost for the 2-argument form of ts_rewrite()
(tsquery_rewrite_query); while this function doesn't do text parsing,
it does execute a user-supplied SQL query, so its previous procost of 1 is
clearly a drastic underestimate.  It seems reasonable to assign it the same
cost we assign to PL functions by default, so 100 is the number here too.

I did not bother bumping catversion for this change, since it does not
break catalog compatibility with the server executable nor result in
any regression test changes.

Per complaint from Andrew Gierth and subsequent discussion.

Branch
------
master

Details
-------
http://git.postgresql.org/pg/commitdiff/2503982be4ca48f48d2bb6e1d46160b23e4bb268

Modified Files
--------------
src/include/catalog/pg_proc.h |   26 +++++++++++++-------------
1 file changed, 13 insertions(+), 13 deletions(-)


pgsql-committers by date:

Previous
From: Robert Haas
Date:
Subject: pgsql: Recursively fsync() the data directory after a crash.
Next
From: Robert Haas
Date:
Subject: pgsql: Use outerPlanState macro instead of referring to leffttree.