Home > mailing lists

Re: Character expansion with ICU collations - Mailing list pgsql-hackers

From	Peter Eisentraut
Subject	Re: Character expansion with ICU collations
Date	June 9, 2021 17:54:54
Msg-id	f7a2284c-9208-665e-d830-34e55e8d6f4d@enterprisedb.com Whole thread Raw
In response to	Character expansion with ICU collations ("Finnerty, Jim" <jfinnert@amazon.com>)
Responses	Re: Character expansion with ICU collations
List	pgsql-hackers

Tree view

On 09.06.21 17:31, Finnerty, Jim wrote:
> CREATE COLLATION CI_AS (provider = icu, 
> locale=’utf8@colStrength=secondary’, deterministic = false);
> 
> CREATE TABLE MyTable3
> (
> 
>      ID INT IDENTITY(1, 1),
>      Comments VARCHAR(100)
> 
> )
> 
> INSERT INTO MyTable3 (Comments) VALUES ('strasse')
> INSERT INTO MyTable3 (Comments) VALUES ('straße')
> SELECT * FROM MyTable3 WHERE Comments COLLATE CI_AS = 'strasse'
> SELECT * FROM MyTable3 WHERE Comments COLLATE CI_AS = 'straße'
> 
> We would like to control whether each SELECT statement finds both 
> records (because the sort key of ‘ß’ equals the sort key of ‘ss’), or 
> whether each SELECT statement finds just one record.

You can have these queries return both rows if you use an 
accent-ignoring collation, like this example in the documentation:

CREATE COLLATION ignore_accents (provider = icu, locale = 
'und-u-ks-level1-kc-true', deterministic = false);

pgsql-hackers by date:

From: Peter Eisentraut
Date: 09 June 2021, 17:37:01
Subject: Re: Adjust pg_regress output for new long test names

From: Tom Lane
Date: 09 June 2021, 17:58:08
Subject: Re: Character expansion with ICU collations

Re: Character expansion with ICU collations - Mailing list pgsql-hackers

Previous

Next