pgsql: Add Unicode property tables. - Mailing list pgsql-committers

From Jeff Davis
Subject pgsql: Add Unicode property tables.
Date
Msg-id E1rhyEz-002S6h-Qe@gemulon.postgresql.org
Whole thread Raw
List pgsql-committers
Add Unicode property tables.

Provide functions to test for Unicode properties, such as Alphabetic
or Cased. These functions use tables derived from Unicode data files,
similar to the tables for Unicode normalization or general category,
and those tables can be updated with the 'update-unicode' build
target.

Use Unicode properties to provide functions to test for regex
character classes, like 'punct' or 'alnum'.

Infrastructure in preparation for a builtin collation provider, and
may also be useful for other callers.

Discussion: https://postgr.es/m/ff4c2f2f9c8fc7ca27c1c24ae37ecaeaeaff6b53.camel%40j-davis.com
Reviewed-by: Daniel Verite, Peter Eisentraut, Jeremy Schneider

Branch
------
master

Details
-------
https://git.postgresql.org/pg/commitdiff/ad49994538c57270c244fab4e1de040568066313

Modified Files
--------------
src/common/unicode/Makefile                        |    6 +-
src/common/unicode/README                          |   45 +-
src/common/unicode/category_test.c                 |  222 +-
.../unicode/generate-unicode_category_table.pl     |  390 ++-
src/common/unicode/meson.build                     |    4 +-
src/common/unicode_category.c                      |  318 +-
src/include/common/unicode_category.h              |   27 +-
src/include/common/unicode_category_table.h        | 3694 +++++++++++++++++++-
8 files changed, 4604 insertions(+), 102 deletions(-)


pgsql-committers by date:

Previous
From: Tom Lane
Date:
Subject: pgsql: Fix type-checking of RECORD-returning functions in FROM.
Next
From: Michael Paquier
Date:
Subject: pgsql: Revert "Fix parallel-safety check of expressions and predicate f