Re: BUG #13440: unaccent does not remove all diacritics - Mailing list pgsql-bugs

From Léonard Benedetti
Subject Re: BUG #13440: unaccent does not remove all diacritics
Date
Msg-id 56E3909B.5060702@mlpo.fr
Whole thread Raw
In response to Re: BUG #13440: unaccent does not remove all diacritics  (Peter Eisentraut <peter_e@gmx.net>)
Responses Re: BUG #13440: unaccent does not remove all diacritics
List pgsql-bugs
12/03/2016 04:02, Peter Eisentraut wrote:
> On 3/11/16 1:16 PM, Tom Lane wrote:
>> Léonard Benedetti <benedetti@mlpo.fr> writes:
>>> Despite all that, I think this transition to Python 3 is wise, it is
>>> available since 2008. Python 2 is legacy and its last version (2.7) is a
>>> “end-of-life release�.
>> Doesn't matter.  We support both Python 2 and 3, and this script must
>> do so as well, else it's not getting committed.  Any desupport for
>> Python 2 in PG is very far away; no one has even suggested we consider
>> it yet.
> This script is only run occasionally when the unaccent data needs to be
> updated from Unicode data, so it's not really that important what
> language and version it's written in.  That said, the mentioned reason
> for changing this to Python 3 is so that one can include Unicode
> characters into the source text, which I find undesirable in general
> (for PostgreSQL source code) and not very useful in this particular
> case.  I think the script can be kept in Python 2 style.  Making it
> upward compatible with Python 3 can be a separate (small) project.
>
I completely agree. This script does not have to be run regularly (as
mentioned, just when the Unicode standard changes or characters of
transliterator). Moreover, even when it should be done, users can wait
for the next version of PostgreSQL where the rules file has already been
updated. So, it is indeed a one-time shot, and the language of this
script is not so important.

However, concerning support for Unicode characters into the source code,
version of Python does not change much (both versions support it). The
change to Python 3 was rather done to anticipate the end of life of
Python 2. But as has been pointed out by Tom Lane, it's not going to
happen shortly (according to the PEP 0373: “The current plan is to
support [Python 2] for at least 10 years from the initial 2.7 release.
This means there will be bugfix releases until 2020.”). Furthermore, as
I stated above, adaptation to Python 3 was quite trivial, and could be
made easily in due course.

So I think we can keep just a version for Python 2 for now. If everyone
agrees, I'll update the files and patch.

Léonard Benedetti



pgsql-bugs by date:

Previous
From: Peter Eisentraut
Date:
Subject: Re: BUG #13440: unaccent does not remove all diacritics
Next
From: Tom Lane
Date:
Subject: Re: BUG #13440: unaccent does not remove all diacritics