Re: Checkpoint gets stuck in mdsync - Mailing list pgsql-hackers

From Heikki Linnakangas
Subject Re: Checkpoint gets stuck in mdsync
Date
Msg-id 4614D660.5080503@enterprisedb.com
Whole thread Raw
In response to Re: Checkpoint gets stuck in mdsync  (ITAGAKI Takahiro <itagaki.takahiro@oss.ntt.co.jp>)
List pgsql-hackers
ITAGAKI Takahiro wrote:
> Heikki Linnakangas <heikki@enterprisedb.com> wrote:
> 
>> Now that the CheckpointStartLock starvation has been taken care of, I'm 
>> seeing another problem with checkpoints in my test run: mdsync never 
>> finishes.
>>
>> My proposed fix is to make a copy of pendingOpsTable before entering the 
>> loop. AbsorbFsyncRequest will put new requests to a fresh new 
>> pendingOpsTable, while the mdsync loop will drain the copy. I'll write a 
>> patch along those lines if there's no better ideas.
> 
> Yeah, I'm also anxious about the stuck. I wrote a fix to use a copy of
> pendingOpsTable as you said, when I implemented Load distributed checkpoint
> patch. (http://momjian.us/mhonarc/patches/msg00025.html) It would make me
> very happy if you review my patch and check whether my fix is proper.

I just posted a patch to pgsql-patches that fixes the issue along the 
lines of your Load distributed checkpoint patch. Load distributed 
checkpoint patch now just needs to add the "calculate total file length" 
and the nap delay to mdsync.

Thanks for the patch!

--   Heikki Linnakangas  EnterpriseDB   http://www.enterprisedb.com


pgsql-hackers by date:

Previous
From: "Albe Laurenz"
Date:
Subject: Re: Bug in UTF8-Validation Code?
Next
From: NikhilS
Date:
Subject: Re: Auto Partitioning