Re: BUG #13440: unaccent does not remove all diacritics - Mailing list pgsql-bugs

From Alvaro Herrera
Subject Re: BUG #13440: unaccent does not remove all diacritics
Date
Msg-id 20150615045056.GM133018@postgresql.org
Whole thread Raw
In response to Re: BUG #13440: unaccent does not remove all diacritics  (Michael Gradek <mike@busbud.com>)
Responses Re: BUG #13440: unaccent does not remove all diacritics  (Tom Lane <tgl@sss.pgh.pa.us>)
List pgsql-bugs
Michael Gradek wrote:

> grand-central=# select 'ț' as input, unaccent('ț') as observed, 't' as
> expected;
>
>  input | observed | expected
>
> -------+----------+----------
>
>  ț     | ț        | t

> > Hm, I do see
> >
> > ţ       t

My terminal shows these characters to be different.  One is
http://graphemica.com/%C8%9B
    latin small letter t with comma below (U+021B)

The other is
http://graphemica.com/%C5%A3
    latin small letter t with cedilla (U+0163)

--
Álvaro Herrera                http://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services

pgsql-bugs by date:

Previous
From: Thomas Munro
Date:
Subject: Re: BUG #13440: unaccent does not remove all diacritics
Next
From: Michael Paquier
Date:
Subject: Re: Lack of Sanity Checking in file 'misc.c' for PostgreSQL 9.4.x