shardman : Documentation: 14: 2.3. Rebalancing the Data : Postgres Professional

2.3. Rebalancing the Data
Prev	Up	Chapter 2. Manage	Home	Next

2.3.2. Manually Rebalancing the Data #

There are times when you need to place partitions of sharded tables in a specific way across the cluster nodes. To solve this problem, Shardman supports the manual data rebalancing mode.

How it works:

Get a list of sharded tables using the shardmanctl tables sharded list command. As a result, we get an answer similar to the following:
```
 shardmanctl shardmanctl tables sharded list 
```

Request information about the selected sharded tables. Example:

 shardmanctl shardmanctl tables sharded info -t public.users Master clover-1-shrn1                   shrn1:5432 clover-2-shrn2                   shrn2:5432 clover-3-shrn3                   shrn3:5432 clover-1-shrn1                   shrn1:5432 clover-2-shrn2                   shrn2:5432 clover-3-shrn3                   shrn3:5432 clover-1-shrn1                   shrn1:5432 clover-2-shrn2                   shrn2:5432 clover-3-shrn3                   shrn3:5432 clover-1-shrn1                   shrn1:5432 clover-2-shrn2                   shrn2:5432 clover-3-shrn3                   shrn3:5432 clover-1-shrn1                   shrn1:5432 clover-2-shrn2                   shrn2:5432 clover-3-shrn3                   shrn3:5432 clover-1-shrn1                   shrn1:5432 clover-2-shrn2                   shrn2:5432 clover-3-shrn3                   shrn3:5432 clover-1-shrn1                   shrn1:5432 clover-2-shrn2                   shrn2:5432 clover-3-shrn3                   shrn3:5432 clover-1-shrn1                   shrn1:5432 clover-2-shrn2                   shrn2:5432 clover-3-shrn3                   shrn3:5432

Move a partition to a new shard, as shown below:

 shardmanctl --log-level debug tables sharded partmove -t public.users --partnum 1 --shard clover-1-shrn1 DEBUG   cmd/common.go:105       Waiting for metadata lock... DEBUG   rebalance/service.go:256        take extension lock DEBUG   broadcaster/worker.go:33        start broadcaster worker for repgroup id=3 DEBUG   broadcaster/worker.go:33        start broadcaster worker for repgroup id=2 DEBUG   broadcaster/worker.go:33        start broadcaster worker for repgroup id=1 DEBUG   broadcaster/worker.go:51        repgroup 3 connect established DEBUG   broadcaster/worker.go:51        repgroup 2 connect established DEBUG   broadcaster/worker.go:51        repgroup 1 connect established DEBUG   extension/lock.go:35    Waiting for extension lock... INFO    rebalance/service.go:276        Performing move partition... DEBUG   broadcaster/worker.go:33        start broadcaster worker for repgroup id=3 DEBUG   broadcaster/worker.go:33        start broadcaster worker for repgroup id=2 DEBUG   broadcaster/worker.go:33        start broadcaster worker for repgroup id=1 DEBUG   broadcaster/worker.go:51        repgroup 1 connect established DEBUG   broadcaster/worker.go:51        repgroup 2 connect established DEBUG   broadcaster/worker.go:51        repgroup 3 connect established DEBUG   rebalance/service.go:71 Performing cleanup after possible rebalance operation failure DEBUG   broadcaster/worker.go:75        finish broadcaster worker for repgroup id=3 DEBUG   broadcaster/worker.go:75        finish broadcaster worker for repgroup id=1 DEBUG   broadcaster/worker.go:75        finish broadcaster worker for repgroup id=2 DEBUG   rebalance/service.go:422        Rebalance will run 1 tasks DEBUG   rebalance/service.go:452        Guessing that rebalance() can use 3 workers DEBUG   rebalance/job.go:352    state: Idle     {"worker_id": 1, "table": "users", "partition num": 1, "source rgid": 2, "dest rgid": 1, "kind": "move"} DEBUG   rebalance/job.go:352    state: ConnsEstablished {"worker_id": 1, "table": "users", "partition num": 1, "source rgid": 2, "dest rgid": 1, "kind": "move"} DEBUG   rebalance/job.go:352    state: WaitInitCopy     {"worker_id": 1, "table": "users", "partition num": 1, "source rgid": 2, "dest rgid": 1, "kind": "move"} DEBUG   rebalance/job.go:347    current state   {"worker_id": 1, "table": "users", "partition num": 1, "source rgid": 2, "dest rgid": 1, "kind": "move", "state": "WaitInitialCatchup"} DEBUG   rebalance/job.go:352    state: WaitInitialCatchup       {"worker_id": 1, "table": "users", "partition num": 1, "source rgid": 2, "dest rgid": 1, "kind": "move"} DEBUG   rebalance/job.go:347    current state   {"worker_id": 1, "table": "users", "partition num": 1, "source rgid": 2, "dest rgid": 1, "kind": "move", "state": "WaitFullSync"} DEBUG   rebalance/job.go:352    state: WaitFullSync     {"worker_id": 1, "table": "users", "partition num": 1, "source rgid": 2, "dest rgid": 1, "kind": "move"} DEBUG   rebalance/job.go:347    current state   {"worker_id": 1, "table": "users", "partition num": 1, "source rgid": 2, "dest rgid": 1, "kind": "move", "state": "Committing"} DEBUG   rebalance/job.go:352    state: Committing       {"worker_id": 1, "table": "users", "partition num": 1, "source rgid": 2, "dest rgid": 1, "kind": "move"} DEBUG   rebalance/job.go:352    state: Complete {"worker_id": 1, "table": "users", "partition num": 1, "source rgid": 2, "dest rgid": 1, "kind": "move"} DEBUG   rebalance/service.go:583        Produce and process tasks on destination replication groups... DEBUG   rebalance/service.go:594        Produce and process tasks on source replication groups... DEBUG   rebalance/service.go:606        wait all tasks finish DEBUG   rebalance/service.go:531        Analyzing table public.users in rg 1    {"table": "public.users", "rgid": 1, "action": "analyze"} DEBUG   rebalance/service.go:531        Analyzing table public.users in rg 2    {"table": "public.users", "rgid": 2, "action": "analyze"} DEBUG   broadcaster/worker.go:75        finish broadcaster worker for repgroup id=1 DEBUG   broadcaster/worker.go:75        finish broadcaster worker for repgroup id=2 DEBUG   broadcaster/worker.go:75        finish broadcaster worker for repgroup id=3

In this example, partition number 1 of the public.users table will be moved to the clover-1-shrn1 shard.

After manually moving a partition of a sharded table and for all tables collocated with it, automatic data rebalancing for these tables will be disabled.

To get the list of tables with disabled automatic rebalancing, call the shardmanctl tables sharded norebalance command. Example:

 shardmanctl tables sharded norebalance

To enable automatic data rebalancing for a selected sharded table, call the shardmanctl tables sharded rebalance command, as shown in the example below:

 shardmanctl tables sharded rebalance -t public.users DEBUG   cmd/common.go:105       Waiting for metadata lock... DEBUG   broadcaster/worker.go:33        start broadcaster worker for repgroup id=1 DEBUG   broadcaster/worker.go:33        start broadcaster worker for repgroup id=2 DEBUG   broadcaster/worker.go:33        start broadcaster worker for repgroup id=3 DEBUG   broadcaster/worker.go:51        repgroup 1 connect established DEBUG   broadcaster/worker.go:51        repgroup 2 connect established DEBUG   broadcaster/worker.go:51        repgroup 3 connect established DEBUG   extension/lock.go:35    Waiting for extension lock... DEBUG   rebalance/service.go:381        Planned moving pnum 21 for table users from rg 1 to rg 2 INFO    rebalance/service.go:244        Performing rebalance... DEBUG   broadcaster/worker.go:33        start broadcaster worker for repgroup id=1 DEBUG   broadcaster/worker.go:33        start broadcaster worker for repgroup id=2 DEBUG   broadcaster/worker.go:33        start broadcaster worker for repgroup id=3 DEBUG   broadcaster/worker.go:51        repgroup 3 connect established DEBUG   broadcaster/worker.go:51        repgroup 1 connect established DEBUG   broadcaster/worker.go:51        repgroup 2 connect established DEBUG   rebalance/service.go:71 Performing cleanup after possible rebalance operation failure DEBUG   broadcaster/worker.go:75        finish broadcaster worker for repgroup id=1 DEBUG   broadcaster/worker.go:75        finish broadcaster worker for repgroup id=2 DEBUG   broadcaster/worker.go:75        finish broadcaster worker for repgroup id=3 DEBUG   rebalance/service.go:422        Rebalance will run 1 tasks DEBUG   rebalance/service.go:452        Guessing that rebalance() can use 3 workers DEBUG   rebalance/job.go:352    state: Idle     {"worker_id": 1, "table": "users", "partition num": 21, "source rgid": 1, "dest rgid": 2, "kind": "move"} DEBUG   rebalance/job.go:352    state: ConnsEstablished {"worker_id": 1, "table": "users", "partition num": 21, "source rgid": 1, "dest rgid": 2, "kind": "move"} DEBUG   rebalance/job.go:352    state: WaitInitCopy     {"worker_id": 1, "table": "users", "partition num": 21, "source rgid": 1, "dest rgid": 2, "kind": "move"} DEBUG   rebalance/job.go:347    current state   {"worker_id": 1, "table": "users", "partition num": 21, "source rgid": 1, "dest rgid": 2, "kind": "move", "state": "WaitInitialCatchup"} DEBUG   rebalance/job.go:352    state: WaitInitialCatchup       {"worker_id": 1, "table": "users", "partition num": 21, "source rgid": 1, "dest rgid": 2, "kind": "move"} DEBUG   rebalance/job.go:347    current state   {"worker_id": 1, "table": "users", "partition num": 21, "source rgid": 1, "dest rgid": 2, "kind": "move", "state": "WaitFullSync"} DEBUG   rebalance/job.go:352    state: WaitFullSync     {"worker_id": 1, "table": "users", "partition num": 21, "source rgid": 1, "dest rgid": 2, "kind": "move"} DEBUG   rebalance/job.go:347    current state   {"worker_id": 1, "table": "users", "partition num": 21, "source rgid": 1, "dest rgid": 2, "kind": "move", "state": "Committing"} DEBUG   rebalance/job.go:352    state: Committing       {"worker_id": 1, "table": "users", "partition num": 21, "source rgid": 1, "dest rgid": 2, "kind": "move"} DEBUG   rebalance/job.go:352    state: Complete {"worker_id": 1, "table": "users", "partition num": 21, "source rgid": 1, "dest rgid": 2, "kind": "move"} DEBUG   rebalance/service.go:583        Produce and process tasks on destination replication groups... DEBUG   rebalance/service.go:594        Produce and process tasks on source replication groups... DEBUG   rebalance/service.go:531        Analyzing table public.users in rg 2    {"table": "public.users", "rgid": 2, "action": "analyze"} DEBUG   rebalance/service.go:606        wait all tasks finish DEBUG   rebalance/service.go:531        Analyzing table public.users in rg 1    {"table": "public.users", "rgid": 1, "action": "analyze"} DEBUG   broadcaster/worker.go:75        finish broadcaster worker for repgroup id=3 DEBUG   broadcaster/worker.go:75        finish broadcaster worker for repgroup id=2 DEBUG   broadcaster/worker.go:75        finish broadcaster worker for repgroup id=1

To enable automatic data rebalancing for all sharded tables, run the shardmanctl rebalance command with the --force option.

 shardmanctl rebalance --force

2.3. Rebalancing the Data #

2.3.1. Automatically Rebalancing the Data #

Prev	Up	Next
2.2. Scaling the Cluster	Home	2.4. Analyzing and Vacuuming