Re: BUG #18216: Unaccent function is unable to remove accents (diacritic signs) from Japanese character 'ド' - Mailing list pgsql-bugs

From Tom Lane
Subject Re: BUG #18216: Unaccent function is unable to remove accents (diacritic signs) from Japanese character 'ド'
Date
Msg-id 4143551.1701183515@sss.pgh.pa.us
Whole thread Raw
In response to BUG #18216: Unaccent function is unable to remove accents (diacritic signs) from Japanese character 'ド'  (PG Bug reporting form <noreply@postgresql.org>)
Responses Re: BUG #18216: Unaccent function is unable to remove accents (diacritic signs) from Japanese character 'ド'  (Michael Paquier <michael@paquier.xyz>)
List pgsql-bugs
PG Bug reporting form <noreply@postgresql.org> writes:
> PostgreSQL's unaccent module does not use Unicode normalisation, but only a
> simple search-and-replace dictionary. The dictionary, unaccent.rules
> (https://github.com/postgres/postgres/blob/master/contrib/unaccent/unaccent.rules)
>   , does not contain these Japanese  characters, thus  its unable to remove
> the diacritic signs.  Can someone please guide when we can expect these
> Japanese characters will be added.

unaccent.rules, as distributed, is just an example.  It is not meant
to be exhaustive or authoritative.  Feel free to add your own entries
to your copy.

            regards, tom lane



pgsql-bugs by date:

Previous
From: Sri Mrudula Attili
Date:
Subject: Re: Could not read from file "pg_subtrans/00F5" at offset 122880: Success.
Next
From: David Rowley
Date:
Subject: Re: BUG #17540: Prepared statement: PG switches to a generic query plan which is consistently much slower