Home > mailing lists

is there a way to automate deduplication of strings? - Mailing list pgsql-novice

From	Chris Papademetrious
Subject	is there a way to automate deduplication of strings?
Date	December 27, 2025 15:36:20
Msg-id	DM4PR12MB603953767048EE1B8A39283ADDB1A@DM4PR12MB6039.namprd12.prod.outlook.com Whole thread Raw
Responses	Re: is there a way to automate deduplication of strings?
List	pgsql-novice

Tree view

Hello everyone! First time poster here.

I have a question about deduplicating text strings stored in a database. I am aware of the pattern of creating a separate table for unique values, then referencing those values by key. But this requires some transactional complexity for storage and retrieval, along with cleanup of no-longer-referenced values over time. And, this complexity grows with the number of primary-table columns that use this indirection.

I would only use this for (1) seldom-referenced columns that (2) have a high rate of duplication and (3) have an average string length that makes deduplication worthwhile.

Are there any native or extension-based methods to simplify this in Postgres? I searched and came up empty, but maybe I’m not searching with the right terms.

Thanks!

Chris

pgsql-novice by date:

From: Laurenz Albe
Date: 28 November 2025, 10:15:31
Subject: Re: AW: how long should Archive logs be retained

From: Greg Sabino Mullane
Date: 31 December 2025, 18:11:52
Subject: Re: is there a way to automate deduplication of strings?

is there a way to automate deduplication of strings? - Mailing list pgsql-novice

Previous

Next