Tatsuo Ishii <ishii@postgresql.org> writes:
> BTW, same characters are assigned different code points are pretty
> common in many character sets (Unicode, for example).
This is widely considered a security bug; read section 10 in RFC 3629 (the
definition of UTF8), and search the CVE database a bit if you still doubt
it's a threat. I'm going to push back very hard on any suggestion that
Postgres should build itself around a text representation with that kind
of weakness designed in.
regards, tom lane
[1] http://tools.ietf.org/html/rfc3629#section-10