Re: tsearch2 vs core? - Mailing list pgsql-hackers

From Junji TERAMOTO
Subject Re: tsearch2 vs core?
Date
Msg-id 42C4DBEE.8010603@lab.ntt.co.jp
Whole thread Raw
In response to Re: tsearch2 vs core?  ("Hiroshi Saito" <saito@inetrt.skcapi.co.jp>)
List pgsql-hackers
Hello all,

>>tsearch2 now doesn't support multibyte encoding and has problems with UTF :(.

I read this url;
http://www.sai.msu.su/~megera/postgres/gist/tsearch/V2/donate.shtml
---
- UTF-8 support
Currently, tsearch2's parser doesn't supports utf-8. This is very
important if you want to search multilingual documents.
---
But, there is another document;
"Tsearch2 and Unicode/UTF-8"
http://www.sai.msu.su/~megera/postgres/gist/tsearch/V2/docs/tsearch2_german_utf8.html

Which is true? The tsearch2 suppoort UTF or not?
If we use "simple" parser, we can store UTF-8 text into tsearch2?
Or, do we have to make own parser which can parse UTF-8 text?

> Japanese Mr. Junji TERAMOTO(NTT) is supporting it.
> http://www.oss.ecl.ntt.co.jp/tsearch2j/
> However, It is EUC_JP. He will probably understand the problem.
> Though I don't understand the problem....

Sorry, I don't understand this problem exactly...

-- 
Junji Teramoto


pgsql-hackers by date:

Previous
From: Michael Fuhr
Date:
Subject: Re: 3des functions?
Next
From: "Qingqing Zhou"
Date:
Subject: Re: Checkpoint cost, looks like it is WAL/CRC