Thread: pgsql: Rewrite the perl scripts to produce our Unicode conversion table
pgsql: Rewrite the perl scripts to produce our Unicode conversion table
From
Heikki Linnakangas
Date:
Rewrite the perl scripts to produce our Unicode conversion tables. Generate EUC_CN mappings from gb-18030-2000.xml, because GB2312.TXT is no longer available. Get UHC from windows-949-2000.xml, it's more up-to-date. Plus tons more small changes. With these changes, the perl scripts faithfully produce the *.map files we have in the repository, from the external source files. In the passing, fix the Makefile to also download CP932.TXT and CP950.TXT. Based on patches by Kyotaro Horiguchi, reviewed by Daniel Gustafsson. Discussion: https://postgr.es/m/08e7892a-d55c-eefe-76e6-7910bc8dd1f3@iki.fi Branch ------ master Details ------- http://git.postgresql.org/pg/commitdiff/1de9cc0dcca649d1900720924f4ea5c430d1a51e Modified Files -------------- src/backend/utils/mb/Unicode/Makefile | 22 +- src/backend/utils/mb/Unicode/UCS_to_BIG5.pl | 184 ++-------- src/backend/utils/mb/Unicode/UCS_to_EUC_CN.pl | 154 +++----- .../utils/mb/Unicode/UCS_to_EUC_JIS_2004.pl | 272 +------------- src/backend/utils/mb/Unicode/UCS_to_EUC_JP.pl | 398 +++++++++------------ src/backend/utils/mb/Unicode/UCS_to_EUC_KR.pl | 115 +----- src/backend/utils/mb/Unicode/UCS_to_EUC_TW.pl | 142 ++------ src/backend/utils/mb/Unicode/UCS_to_GB18030.pl | 80 +---- src/backend/utils/mb/Unicode/UCS_to_JOHAB.pl | 31 ++ .../utils/mb/Unicode/UCS_to_SHIFT_JIS_2004.pl | 205 ++--------- src/backend/utils/mb/Unicode/UCS_to_SJIS.pl | 157 ++------ src/backend/utils/mb/Unicode/UCS_to_UHC.pl | 51 +++ src/backend/utils/mb/Unicode/UCS_to_most.pl | 116 +----- src/backend/utils/mb/Unicode/convutils.pm | 282 +++++++++++++++ .../utils/mb/Unicode/euc_jis_2004_to_utf8.map | 9 +- .../mb/Unicode/euc_jis_2004_to_utf8_combined.map | 7 +- src/backend/utils/mb/Unicode/euc_jp_to_utf8.map | 4 +- src/backend/utils/mb/Unicode/euc_kr_to_utf8.map | 2 + src/backend/utils/mb/Unicode/johab_to_utf8.map | 2 + .../utils/mb/Unicode/shift_jis_2004_to_utf8.map | 9 +- .../mb/Unicode/shift_jis_2004_to_utf8_combined.map | 7 +- src/backend/utils/mb/Unicode/ucs2utf.pl | 35 -- src/backend/utils/mb/Unicode/uhc_to_utf8.map | 2 + src/backend/utils/mb/Unicode/utf8_to_euc_cn.map | 2 + .../utils/mb/Unicode/utf8_to_euc_jis_2004.map | 9 +- .../mb/Unicode/utf8_to_euc_jis_2004_combined.map | 7 +- src/backend/utils/mb/Unicode/utf8_to_euc_jp.map | 2 + src/backend/utils/mb/Unicode/utf8_to_euc_kr.map | 2 + src/backend/utils/mb/Unicode/utf8_to_johab.map | 2 + .../utils/mb/Unicode/utf8_to_shift_jis_2004.map | 9 +- .../mb/Unicode/utf8_to_shift_jis_2004_combined.map | 7 +- src/backend/utils/mb/Unicode/utf8_to_sjis.map | 4 +- src/backend/utils/mb/Unicode/utf8_to_uhc.map | 2 + 33 files changed, 791 insertions(+), 1541 deletions(-)