Home > mailing lists

Re: URGENT: Out of disk space pg_xlog - Mailing list pgsql-performance

From	Simon Riggs
Subject	Re: URGENT: Out of disk space pg_xlog
Date	December 29, 2006 14:35:16
Msg-id	1167416298.3903.230.camel@silverbirch.site Whole thread Raw
In response to	Re: URGENT: Out of disk space pg_xlog (Bruce Momjian <bruce@momjian.us>)
List	pgsql-performance

Tree view

On Sat, 2006-12-23 at 13:13 -0500, Bruce Momjian wrote:

> The bottom line is that we know of now cases where a long-running
> transaction would delay recycling of the WAL files, so there is
> certainly something not understood here.

We can see from all of this that a checkpoint definitely didn't occur.
Tom's causal chain was just one way that could have happened, there
could well be others.

I've noticed previously that a checkpoint can be starved out when trying
to acquire the CheckpointStartLock. I've witnessed a two minute delay
plus in obtaining the lock in the face of heavy transactions.

If wal_buffers is small enough, WAL write rate high enough and the
transaction rate high enough, a long queue can form for the
WALWriteLock, which ensures that the CheckpointStartLock would queue
indefinitely.

I've tried implementing a queueable shared lock for the
CheckpointStartLock. That helps the checkpoint, but it harms performance
of other transactions waiting to commit, so I let that idea go.

--
  Simon Riggs
  EnterpriseDB   http://www.enterprisedb.com

pgsql-performance by date:

From: Sebastián Baioni
Date: 29 December 2006, 14:04:05
Subject: Re: Backup/Restore too slow

From: Russell Smith
Date: 29 December 2006, 17:25:28
Subject: Re: Backup/Restore too slow

Re: URGENT: Out of disk space pg_xlog - Mailing list pgsql-performance

Previous

Next