Home > mailing lists

Re: integrated tsearch doesn't work with non utf8 database - Mailing list pgsql-hackers

From	Heikki Linnakangas
Subject	Re: integrated tsearch doesn't work with non utf8 database
Date	September 10, 2007 11:57:47
Msg-id	46E55B1F.3090207@enterprisedb.com Whole thread Raw
In response to	Re: integrated tsearch doesn't work with non utf8 database (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: integrated tsearch doesn't work with non utf8 database
List	pgsql-hackers

Tree view

Tom Lane wrote:
> Teodor Sigaev <teodor@sigaev.ru> writes:
>>> Note the Seq Scan on pg_ts_config_map, with filter on ts_lexize(mapdict,
>>> $1). That means that it will call ts_lexize on every dictionary, which
>>> will try to load every dictionary. And loading danish_stem dictionary
>>> fails in latin2 encoding, because of the problem with the stopword file.
> 
>> Attached patch should fix it, I hope.
> 
> Uh, how will that help?  AFAICS it still has to call ts_lexize with
> every dictionary.

No, ts_lexize is no longer in the seq scan filter, but in the sort key
that's calculated only for those rows that match the filter 'mapcfg=?
AND maptokentype=?'. It is pretty kludgey, though. The planner could
choose another plan, that fails, if the statistics were different.
Rewriting the function in C would be a more robust fix.

--  Heikki Linnakangas EnterpriseDB   http://www.enterprisedb.com

pgsql-hackers by date:

From: Mark Mielke
Date: 10 September 2007, 11:38:08
Subject: Re: A Silly Idea for Vertically-Oriented Databases

From: Alvaro Herrera
Date: 10 September 2007, 12:01:48
Subject: Re: A Silly Idea for Vertically-Oriented Databases

Re: integrated tsearch doesn't work with non utf8 database - Mailing list pgsql-hackers

Previous

Next