Re: Inexplicable duplicate rows with unique constraint - Mailing list pgsql-general

From Daniel Verite
Subject Re: Inexplicable duplicate rows with unique constraint
Date
Msg-id f2f4c44f-5673-496b-b42b-cf63ba483a5e@manitou-mail.org
Whole thread Raw
In response to Inexplicable duplicate rows with unique constraint  (Richard van der Hoff <richard@matrix.org>)
Responses Re: Inexplicable duplicate rows with unique constraint
List pgsql-general
    Richard van der Hoff wrote:

> synapse=> select count(*), room_id, type, state_key from
> current_state_events group by 2,3,4 order by count(*) DESC LIMIT 2;
>  count |              room_id              |     type      |   state_key
> -------+-----------------------------------+---------------+-------------------------------------
>      3 | !ueLfVrSWYGMnFnoCbT:darkfasel.net | m.room.member |
> @irc_ebi_:darkfasel.net
>      3 | !HwocBmCtBcHQhILtYQ:matrix.org    | m.room.member |
> @freenode_AlmarShenwan_:matrix.org

Looking at these columns which are of type text but do not
contain words of any particular language, there's probably
no point in using a linguistic-aware collation for them.

If you maintain the database schema, what you could do to avoid
the dependency on the OS collation and stay clear of the particular
upgrade difficulty of collations is to use COLLATE "C" for this kind of
field, as opposed to the default collation of the database.
As a bonus, operations with the "C" collations tend to be faster,
sometimes even much faster.


Best regards,
--
Daniel Vérité
PostgreSQL-powered mailer: http://www.manitou-mail.org
Twitter: @DanielVerite



pgsql-general by date:

Previous
From: Richard van der Hoff
Date:
Subject: Re: Inexplicable duplicate rows with unique constraint
Next
From: Adrian Klaver
Date:
Subject: Re: Inexplicable duplicate rows with unique constraint