Re: Slow performance of collate "en_US.utf8" - Mailing list pgsql-performance

From Joe Conway
Subject Re: Slow performance of collate "en_US.utf8"
Date
Msg-id eb319bfd-fbfb-42c1-a01c-406988e177b5@joeconway.com
Whole thread Raw
In response to Re: Slow performance of collate "en_US.utf8"  (Laurenz Albe <laurenz.albe@cybertec.at>)
Responses Re: Slow performance of collate "en_US.utf8"
List pgsql-performance
On 2/28/25 09:16, Laurenz Albe wrote:
> On Thu, 2025-02-27 at 16:54 +0300, Alexey Borschev wrote:
>> I see poor performance of text sorting of collate "en_US.utf8" in PG 17.4.
> 
> I'd say that you would have to complain to the authors of the
> GNU C library, which provides this collation.

Yep -- glibc starting with version 2.21 has a massive performance 
regression for certain cases and the glibc folks have basically said 
they will not fix it. If you try the same thing on RHEL 7.x with glibc 
2.17 it will perform about the same as ICU.

If you are using pg17 you should consider using the new builtin 
collation provider -- it will perform almost as well as the 'C' locale. 
Something like:
--------
CREATE DATABASE builtincoll LOCALE_PROVIDER builtin
BUILTIN_LOCALE 'C.UTF-8' TEMPLATE template0;
--------

-- 
Joe Conway
PostgreSQL Contributors Team
RDS Open Source Databases
Amazon Web Services: https://aws.amazon.com



pgsql-performance by date:

Previous
From: Laurenz Albe
Date:
Subject: Re: Slow performance of collate "en_US.utf8"
Next
From: Thomas Munro
Date:
Subject: Re: Slow performance of collate "en_US.utf8"