pgsql: Rewrite the perl scripts to produce our Unicode conversion table - Mailing list pgsql-committers

From Heikki Linnakangas
Subject pgsql: Rewrite the perl scripts to produce our Unicode conversion table
Date
Msg-id E1cC4QA-0003GT-RH@gemulon.postgresql.org
Whole thread Raw
List pgsql-committers
Rewrite the perl scripts to produce our Unicode conversion tables.

Generate EUC_CN mappings from gb-18030-2000.xml, because GB2312.TXT is no
longer available.

Get UHC from windows-949-2000.xml, it's more up-to-date.

Plus tons more small changes. With these changes, the perl scripts
faithfully produce the *.map files we have in the repository, from the
external source files.

In the passing, fix the Makefile to also download CP932.TXT and CP950.TXT.

Based on patches by Kyotaro Horiguchi, reviewed by Daniel Gustafsson.

Discussion: https://postgr.es/m/08e7892a-d55c-eefe-76e6-7910bc8dd1f3@iki.fi

Branch
------
master

Details
-------
http://git.postgresql.org/pg/commitdiff/1de9cc0dcca649d1900720924f4ea5c430d1a51e

Modified Files
--------------
src/backend/utils/mb/Unicode/Makefile              |  22 +-
src/backend/utils/mb/Unicode/UCS_to_BIG5.pl        | 184 ++--------
src/backend/utils/mb/Unicode/UCS_to_EUC_CN.pl      | 154 +++-----
.../utils/mb/Unicode/UCS_to_EUC_JIS_2004.pl        | 272 +-------------
src/backend/utils/mb/Unicode/UCS_to_EUC_JP.pl      | 398 +++++++++------------
src/backend/utils/mb/Unicode/UCS_to_EUC_KR.pl      | 115 +-----
src/backend/utils/mb/Unicode/UCS_to_EUC_TW.pl      | 142 ++------
src/backend/utils/mb/Unicode/UCS_to_GB18030.pl     |  80 +----
src/backend/utils/mb/Unicode/UCS_to_JOHAB.pl       |  31 ++
.../utils/mb/Unicode/UCS_to_SHIFT_JIS_2004.pl      | 205 ++---------
src/backend/utils/mb/Unicode/UCS_to_SJIS.pl        | 157 ++------
src/backend/utils/mb/Unicode/UCS_to_UHC.pl         |  51 +++
src/backend/utils/mb/Unicode/UCS_to_most.pl        | 116 +-----
src/backend/utils/mb/Unicode/convutils.pm          | 282 +++++++++++++++
.../utils/mb/Unicode/euc_jis_2004_to_utf8.map      |   9 +-
.../mb/Unicode/euc_jis_2004_to_utf8_combined.map   |   7 +-
src/backend/utils/mb/Unicode/euc_jp_to_utf8.map    |   4 +-
src/backend/utils/mb/Unicode/euc_kr_to_utf8.map    |   2 +
src/backend/utils/mb/Unicode/johab_to_utf8.map     |   2 +
.../utils/mb/Unicode/shift_jis_2004_to_utf8.map    |   9 +-
.../mb/Unicode/shift_jis_2004_to_utf8_combined.map |   7 +-
src/backend/utils/mb/Unicode/ucs2utf.pl            |  35 --
src/backend/utils/mb/Unicode/uhc_to_utf8.map       |   2 +
src/backend/utils/mb/Unicode/utf8_to_euc_cn.map    |   2 +
.../utils/mb/Unicode/utf8_to_euc_jis_2004.map      |   9 +-
.../mb/Unicode/utf8_to_euc_jis_2004_combined.map   |   7 +-
src/backend/utils/mb/Unicode/utf8_to_euc_jp.map    |   2 +
src/backend/utils/mb/Unicode/utf8_to_euc_kr.map    |   2 +
src/backend/utils/mb/Unicode/utf8_to_johab.map     |   2 +
.../utils/mb/Unicode/utf8_to_shift_jis_2004.map    |   9 +-
.../mb/Unicode/utf8_to_shift_jis_2004_combined.map |   7 +-
src/backend/utils/mb/Unicode/utf8_to_sjis.map      |   4 +-
src/backend/utils/mb/Unicode/utf8_to_uhc.map       |   2 +
33 files changed, 791 insertions(+), 1541 deletions(-)


pgsql-committers by date:

Previous
From: Heikki Linnakangas
Date:
Subject: pgsql: Remove leading zeros, for consistency with other map files.
Next
From: Peter Eisentraut
Date:
Subject: pgsql: doc: Fix typo