Thread: pgsql: Rewrite the perl scripts to produce our Unicode conversion table

pgsql: Rewrite the perl scripts to produce our Unicode conversion table

From
Heikki Linnakangas
Date:
Rewrite the perl scripts to produce our Unicode conversion tables.

Generate EUC_CN mappings from gb-18030-2000.xml, because GB2312.TXT is no
longer available.

Get UHC from windows-949-2000.xml, it's more up-to-date.

Plus tons more small changes. With these changes, the perl scripts
faithfully produce the *.map files we have in the repository, from the
external source files.

In the passing, fix the Makefile to also download CP932.TXT and CP950.TXT.

Based on patches by Kyotaro Horiguchi, reviewed by Daniel Gustafsson.

Discussion: https://postgr.es/m/08e7892a-d55c-eefe-76e6-7910bc8dd1f3@iki.fi

Branch
------
master

Details
-------
http://git.postgresql.org/pg/commitdiff/1de9cc0dcca649d1900720924f4ea5c430d1a51e

Modified Files
--------------
src/backend/utils/mb/Unicode/Makefile              |  22 +-
src/backend/utils/mb/Unicode/UCS_to_BIG5.pl        | 184 ++--------
src/backend/utils/mb/Unicode/UCS_to_EUC_CN.pl      | 154 +++-----
.../utils/mb/Unicode/UCS_to_EUC_JIS_2004.pl        | 272 +-------------
src/backend/utils/mb/Unicode/UCS_to_EUC_JP.pl      | 398 +++++++++------------
src/backend/utils/mb/Unicode/UCS_to_EUC_KR.pl      | 115 +-----
src/backend/utils/mb/Unicode/UCS_to_EUC_TW.pl      | 142 ++------
src/backend/utils/mb/Unicode/UCS_to_GB18030.pl     |  80 +----
src/backend/utils/mb/Unicode/UCS_to_JOHAB.pl       |  31 ++
.../utils/mb/Unicode/UCS_to_SHIFT_JIS_2004.pl      | 205 ++---------
src/backend/utils/mb/Unicode/UCS_to_SJIS.pl        | 157 ++------
src/backend/utils/mb/Unicode/UCS_to_UHC.pl         |  51 +++
src/backend/utils/mb/Unicode/UCS_to_most.pl        | 116 +-----
src/backend/utils/mb/Unicode/convutils.pm          | 282 +++++++++++++++
.../utils/mb/Unicode/euc_jis_2004_to_utf8.map      |   9 +-
.../mb/Unicode/euc_jis_2004_to_utf8_combined.map   |   7 +-
src/backend/utils/mb/Unicode/euc_jp_to_utf8.map    |   4 +-
src/backend/utils/mb/Unicode/euc_kr_to_utf8.map    |   2 +
src/backend/utils/mb/Unicode/johab_to_utf8.map     |   2 +
.../utils/mb/Unicode/shift_jis_2004_to_utf8.map    |   9 +-
.../mb/Unicode/shift_jis_2004_to_utf8_combined.map |   7 +-
src/backend/utils/mb/Unicode/ucs2utf.pl            |  35 --
src/backend/utils/mb/Unicode/uhc_to_utf8.map       |   2 +
src/backend/utils/mb/Unicode/utf8_to_euc_cn.map    |   2 +
.../utils/mb/Unicode/utf8_to_euc_jis_2004.map      |   9 +-
.../mb/Unicode/utf8_to_euc_jis_2004_combined.map   |   7 +-
src/backend/utils/mb/Unicode/utf8_to_euc_jp.map    |   2 +
src/backend/utils/mb/Unicode/utf8_to_euc_kr.map    |   2 +
src/backend/utils/mb/Unicode/utf8_to_johab.map     |   2 +
.../utils/mb/Unicode/utf8_to_shift_jis_2004.map    |   9 +-
.../mb/Unicode/utf8_to_shift_jis_2004_combined.map |   7 +-
src/backend/utils/mb/Unicode/utf8_to_sjis.map      |   4 +-
src/backend/utils/mb/Unicode/utf8_to_uhc.map       |   2 +
33 files changed, 791 insertions(+), 1541 deletions(-)