Home > mailing lists

suggestion about time based partitioning and hibernate - Mailing list pgsql-general

From	Luca Ferrari
Subject	suggestion about time based partitioning and hibernate
Date	July 18, 2023 09:18:01
Msg-id	CAKoxK+6xJeTK4e8nQzu62preuxT9zsPmsbw0ESyFLOWfpoWifw@mail.gmail.com Whole thread Raw
Responses	Re: suggestion about time based partitioning and hibernate Re: suggestion about time based partitioning and hibernate Re: suggestion about time based partitioning and hibernate
List	pgsql-general

Tree view

Dear all,
I'm looking for ideas here, and it could be someone already stepped
into declarative partitioning of an existing database where Hibernate
(a Java ORM) handles the tables.
The situation is as follows:

create table foo( id primary key, a_date date, ... );

Now, the trivial way to partition this would be on a range based on
a_date, so that the primary key of the tables shifts from id to (id,
a_date). One thing that frightens me is that Hibernate does a lot of
per-row lookups by means of the id, so while the partitioning is
probably going to make things more manageable and even faster in some
scenarios, could lead to drawbacks when Hibernate queries by id.
Moreover, hibernate will think id is unique while it is not anymore.
Last but not least, referencing foreign keys are made by Hibernate
thru the id column, and it means that incoming foreign keys to foo
will not be in place anymore.

Now, I know that I can define a composite key in hibernate, in order
to match the effective new data structure, but this requires a huge
rewrite of the application code.
And after all, we are talking about a non-PostgreSQL related piece, so
the problem is not on the PostgreSQL side.

Anyone has already done a partitioning in such scenario?

I am thinking that partitioning on an hash of id could be the only way
to go without having to touch the hibernate side, even if this would
bring up a less balanced partitioned structure. In such case, I mean
partitioning by hash, having a table with 60 millions rows per 50 GB
in size, what would be the rule of thumb to select the number of
partitions (i.e., a suggested modulus)?

I will appreciate any suggestion.

Thanks,
Luca

pgsql-general by date:

From: Gert Cuykens
Date: 18 July 2023, 02:53:23
Subject: backup_manifest rename to backup_manifest.old after successful postgres start up

From: Ron
Date: 18 July 2023, 09:47:58
Subject: Re: suggestion about time based partitioning and hibernate

suggestion about time based partitioning and hibernate - Mailing list pgsql-general

Previous

Next