Home > mailing lists

Re: Improve the performance of Unicode Normalization Forms. - Mailing list pgsql-hackers

From	Jeff Davis
Subject	Re: Improve the performance of Unicode Normalization Forms.
Date	June 20, 2025 20:20:08
Msg-id	96c54fc3dde2ebaccafbd1ffa29b8be1005c8b8c.camel@j-davis.com Whole thread Raw
In response to	Re: Improve the performance of Unicode Normalization Forms. (Alexander Borisov <lex.borisov@gmail.com>)
Responses	Re: Improve the performance of Unicode Normalization Forms.
List	pgsql-hackers

Tree view

On Fri, 2025-06-20 at 17:51 +0300, Alexander Borisov wrote:
> I don't quite see how this compares to the implementation on Rust. In
> the link provided, they use perfect hash, which I get rid of and get
> a x2 boost.
> If you take ICU implementations in C++, I have always considered them
> slow, at least when used in C code.
> I may well run benchmarks and compare the performance of the approach
> in Postgres and ICU. But this is beyond the scope of the patches
> under
> discussion.

Are you saying that, with these patches, Postgres will offer the
fastest open-source Unicode normalization? If so, that would be very
cool.

The reason I'm asking is because, if there are multiple open source
implementations, we should either have the best one, or just borrow
another one as long as it has a suitable license (perhaps translating
to C as necessary).

Regards,
    Jeff Davis

pgsql-hackers by date:

From: Jeff Davis
Date: 20 June 2025, 20:15:47
Subject: Re: Improve the performance of Unicode Normalization Forms.

From: Nico Williams
Date: 20 June 2025, 21:45:22
Subject: Re: Improve the performance of Unicode Normalization Forms.

Re: Improve the performance of Unicode Normalization Forms. - Mailing list pgsql-hackers

Previous

Next