unaccent extension missing some accents - Mailing list pgsql-hackers

From J Smith
Subject unaccent extension missing some accents
Date
Msg-id CADFUPgcM_W5-oeUHFF5rZePzYWFUZbsfpN9gAJBAw+4XAnHy4w@mail.gmail.com
Whole thread Raw
Responses Re: unaccent extension missing some accents
List pgsql-hackers
G'day list.

I've been messing around with the unaccent extension and I've noticed
that some of the characters listed in the unaccent.rules file aren't
actually being unaccented on my system.

Here are the system details and whatnot.

- OSX 10.7.2

- the server is compiled via macports. Tried using both gcc and llvm
4.2.1 compilers that come with the latest version of XCode.

- the same symptoms show up in both 9.0.5 and 9.1.1. I've also tried
building manually from the latest REL9_1_STABLE branch from git to
make sure macports wasn't the problem, but I'm getting the same
results with both compilers.

When I first do a CREATE EXTENSION for unaccent, I'm seeing the
following warnings in the log file:

===
WARNING:  duplicate TO argument, use first one
CONTEXT:  line 8 of configuration file
"/usr/local/postgresql91-local/share/tsearch_data/unaccent.rules":
"à    a"
WARNING:  duplicate TO argument, use first one
CONTEXT:  line 57 of configuration file
"/usr/local/postgresql91-local/share/tsearch_data/unaccent.rules":
"Ġ    G"
WARNING:  duplicate TO argument, use first one
CONTEXT:  line 144 of configuration file
"/usr/local/postgresql91-local/share/tsearch_data/unaccent.rules":
"Š    S"
===

I've dug around through the unaccent code a bit and I've noticed that
the sscanf it does when reading the file is producing some odd output.


pgsql-hackers by date:

Previous
From: Tom Lane
Date:
Subject: Re: [GENERAL] Strange problem with create table as select * from table;
Next
From: J Smith
Date:
Subject: Re: unaccent extension missing some accents