Re: BUG #10589: hungarian.stop file spelling error - Mailing list pgsql-bugs

From Tom Lane
Subject Re: BUG #10589: hungarian.stop file spelling error
Date
Msg-id 5337.1402434505@sss.pgh.pa.us
Whole thread Raw
In response to Re: BUG #10589: hungarian.stop file spelling error  (Tom Lane <tgl@sss.pgh.pa.us>)
Responses Re: BUG #10589: hungarian.stop file spelling error  (Tom Lane <tgl@sss.pgh.pa.us>)
List pgsql-bugs
I wrote:
> [ we seem to have gotten a misencoded version of hungarian.stop ]

Actually, it looks like things are even worse than that: the Hungarian
stemmer code seems to be confused about this too.  In the first place,
we've got a LATIN1 version of that stemmer, which I would imagine is
entirely useless; and in the second place, the UTF8 version has no
reference to any non-LATIN1 characters.

Again, I'm suspecting this problem goes further than Hungarian,
because the set of stem_ISO_8859_1_foo.c files in
src/backend/snowball/libstemmer/ covers a lot more languages than
I think LATIN1 is meant to cope with.  I'm not sure how much of this
is broken in the original Snowball code and how much is our error
while importing the code.

            regards, tom lane

pgsql-bugs by date:

Previous
From: David G Johnston
Date:
Subject: Re: BUG #10591: setting newly added columns to null is slow
Next
From: Tom Lane
Date:
Subject: Re: BUG #10589: hungarian.stop file spelling error