Re: [External] Re: pgadmin--pgagent---the process hang by unknow reasons - Mailing list pgadmin-support

From Dave Page
Subject Re: [External] Re: pgadmin--pgagent---the process hang by unknow reasons
Date
Msg-id CA+OCxox9D3VZLNZZBCrCDg5n+Te80Th7oF2k_mYGrK7rqQ1PKQ@mail.gmail.com
Whole thread Raw
In response to 答复: [External] Re: pgadmin--pgagent---the process hang by unknow reasons  (Zhiyu ZY13 Xu <xuzy13@lenovo.com>)
Responses 答复: [External] Re: pgadmin--pgagent---the process hang by unknow reasons  (Zhiyu ZY13 Xu <xuzy13@lenovo.com>)
List pgadmin-support
Hi

On Thu, Nov 26, 2020 at 4:45 PM Zhiyu ZY13 Xu <xuzy13@lenovo.com> wrote:

Hi Dave

 

   Thanks for your quick response.

This env was deploy on Jan 2019 by my team mate. Currently the pgagent have 30 jobs running. The version is  pgagent_10-3.4.0

I don’t know how to upgrade the pgagent. I try to find upgrade document. But failed.

Only find that edb ppas could upgrade pgagent.

https://www.enterprisedb.com/edb-docs/d/edb-postgres-advanced-server/installation-getting-started/upgrade-guide/11/EDB_Postgres_Advanced_Server_Upgrade_Guide.1.13.html

 


The PGAgent that comes with EDB Advanced Server is quite different from the Open Source version. Assuming you're using the RPM packages on RHEL/CentOS 6, you should just be able to use "yum upgrade ..." to upgrade to the latest version. Looking at the postgresql-common repository on yum.postgresql.org, I see that v4.0.0 is available (https://ftp.postgresql.org/pub/repos/yum/common/redhat/rhel-6-x86_64/)
 

If I re-install the pgagent with latest version. Whether the old pgagent jobs will drop with old version pgagent ?

Would you like to guide me to make pgagent using new Boost package and no impact currently working pgagent jobs ?

I don’t want to rebuild all pgagent jobs. Thanks in advance.


Upgrading pgAgent will not affect the jobs you have defined already. 
 

 

 

 

徐志宇(Jack

Database Engineer

    

DB Team,ITS. Lenovo China

Phone: 86-18910860709

Email:xuzy13@lenovo.com

No.6 Shangdi West Road, Haidian District Beijing, China, 100085

 

发件人: Dave Page <dpage@pgadmin.org>
发送时间: 20201126 19:39
收件人: Zhiyu ZY13 Xu <xuzy13@lenovo.com>
抄送: pgadmin-support@postgresql.org
主题: [External] Re: pgadmin--pgagent---the process hang by unknow reasons

 

Hi

 

Given the libwx* references in your stacktrace, you appear to be using an old version of pgagent - we removed the dependency on wxWidgets nearly 2.5 years ago and replaced it with Boost.

 

Please upgrade and try again.

 

Thanks.

 

On Thu, Nov 26, 2020 at 8:05 AM Zhiyu ZY13 Xu <xuzy13@lenovo.com> wrote:

Hi Support

 

   I using pgagent over 2 years. There are over 30 jobs running by pgagent.  Recently. I found a problem that sometime the pgagent hang by unknow reasons.

   From the stack information. Look like the pagent experience  dead-lock issue in code.

   The stack display many thread stop on this function in __lll_lock_wait

   If you need more information. Please let me know. I suspect this is a bug.

 

I collect to pgagent trace log and stack information on the attachment.

 

pgagent trace log

pg_agent_11_24.log

pg_agent_11_26.log

pgagent process stack

others information.

 

 

 

version:

pgagent_10-3.4.0-10.rhel6.x86_64

PG 10.5

 

The typical stack information.

 

[postgres@sltfjfrauxq pgagent_pd]$ cat 23389.stark.1

Thread 7 (Thread 0x7ff745f5c700 (LWP 906)):

#0  0x00007ff74b003334 in __lll_lock_wait () from /lib64/libpthread.so.0

#1  0x00007ff74affe5d8 in _L_lock_854 () from /lib64/libpthread.so.0

#2  0x00007ff74affe4a7 in pthread_mutex_lock () from /lib64/libpthread.so.0

#3  0x00007ff74ba979c9 in wxMutexInternal::Lock() () from /usr/lib64/libwx_baseu-2.8.so.0

#4  0x00007ff74c15b819 in DBconn::Return() ()

#5  0x00007ff74c161217 in Job::Execute() ()

#6  0x00007ff74c162899 in JobThread::Entry() ()

#7  0x00007ff74ba99021 in wxThreadInternal::PthreadStart(wxThread*) () from /usr/lib64/libwx_baseu-2.8.so.0

#8  0x00007ff74affcaa1 in start_thread () from /lib64/libpthread.so.0

#9  0x00007ff74ad49c4d in clone () from /lib64/libc.so.6

Thread 6 (Thread 0x7ff72ffff700 (LWP 908)):

#0  0x00007ff74b003334 in __lll_lock_wait () from /lib64/libpthread.so.0

#1  0x00007ff74affe5d8 in _L_lock_854 () from /lib64/libpthread.so.0

#2  0x00007ff74affe4a7 in pthread_mutex_lock () from /lib64/libpthread.so.0

#3  0x00007ff74ba979c9 in wxMutexInternal::Lock() () from /usr/lib64/libwx_baseu-2.8.so.0

#4  0x00007ff74c15b819 in DBconn::Return() ()

#5  0x00007ff74c161217 in Job::Execute() ()

#6  0x00007ff74c162899 in JobThread::Entry() ()

#7  0x00007ff74ba99021 in wxThreadInternal::PthreadStart(wxThread*) () from /usr/lib64/libwx_baseu-2.8.so.0

#8  0x00007ff74affcaa1 in start_thread () from /lib64/libpthread.so.0

#9  0x00007ff74ad49c4d in clone () from /lib64/libc.so.6

Thread 5 (Thread 0x7ff74695d700 (LWP 910)):

#0  0x00007ff74b003334 in __lll_lock_wait () from /lib64/libpthread.so.0

#1  0x00007ff74affe5d8 in _L_lock_854 () from /lib64/libpthread.so.0

#2  0x00007ff74affe4a7 in pthread_mutex_lock () from /lib64/libpthread.so.0

#3  0x00007ff74ba979c9 in wxMutexInternal::Lock() () from /usr/lib64/libwx_baseu-2.8.so.0

#4  0x00007ff74c15b819 in DBconn::Return() ()

#5  0x00007ff74c161217 in Job::Execute() ()

#6  0x00007ff74c162899 in JobThread::Entry() ()

#7  0x00007ff74ba99021 in wxThreadInternal::PthreadStart(wxThread*) () from /usr/lib64/libwx_baseu-2.8.so.0

#8  0x00007ff74affcaa1 in start_thread () from /lib64/libpthread.so.0

#9  0x00007ff74ad49c4d in clone () from /lib64/libc.so.6

Thread 4 (Thread 0x7ff74735e700 (LWP 1565)):

#0  0x00007ff74b003334 in __lll_lock_wait () from /lib64/libpthread.so.0

#1  0x00007ff74affe5d8 in _L_lock_854 () from /lib64/libpthread.so.0

#2  0x00007ff74affe4a7 in pthread_mutex_lock () from /lib64/libpthread.so.0

#3  0x00007ff74ba979c9 in wxMutexInternal::Lock() () from /usr/lib64/libwx_baseu-2.8.so.0

#4  0x00007ff74c15b819 in DBconn::Return() ()

#5  0x00007ff74c161217 in Job::Execute() ()

#6  0x00007ff74c162899 in JobThread::Entry() ()

#7  0x00007ff74ba99021 in wxThreadInternal::PthreadStart(wxThread*) () from /usr/lib64/libwx_baseu-2.8.so.0

#8  0x00007ff74affcaa1 in start_thread () from /lib64/libpthread.so.0

#9  0x00007ff74ad49c4d in clone () from /lib64/libc.so.6

Thread 3 (Thread 0x7ff74555b700 (LWP 1567)):

#0  0x00007ff74ad40403 in poll () from /lib64/libc.so.6

#1  0x00007ff74bd1c28f in ?? () from /usr/lib64/libpq.so.5

#2  0x00007ff74bd1c310 in ?? () from /usr/lib64/libpq.so.5

#3  0x00007ff74bd178e2 in ?? () from /usr/lib64/libpq.so.5

#4  0x00007ff74bd1865f in PQconnectdb () from /usr/lib64/libpq.so.5

#5  0x00007ff74c15ad71 in DBconn::Connect(wxString const&) ()

#6  0x00007ff74c15af73 in DBconn::DBconn(wxString const&, wxString const&) ()

#7  0x00007ff74c15bfe8 in DBconn::Get(wxString const&, wxString const&) ()

#8  0x00007ff74c16108f in Job::Execute() ()

#9  0x00007ff74c162899 in JobThread::Entry() ()

#10 0x00007ff74ba99021 in wxThreadInternal::PthreadStart(wxThread*) () from /usr/lib64/libwx_baseu-2.8.so.0

#11 0x00007ff74affcaa1 in start_thread () from /lib64/libpthread.so.0

#12 0x00007ff74ad49c4d in clone () from /lib64/libc.so.6

Thread 2 (Thread 0x7ff744b5a700 (LWP 1569)):

#0  0x00007ff74b003334 in __lll_lock_wait () from /lib64/libpthread.so.0

#1  0x00007ff74affe5d8 in _L_lock_854 () from /lib64/libpthread.so.0

#2  0x00007ff74affe4a7 in pthread_mutex_lock () from /lib64/libpthread.so.0

#3  0x00007ff74ba979c9 in wxMutexInternal::Lock() () from /usr/lib64/libwx_baseu-2.8.so.0

#4  0x00007ff74c15bf6b in DBconn::Get(wxString const&, wxString const&) ()

#5  0x00007ff74c16108f in Job::Execute() ()

#6  0x00007ff74c162899 in JobThread::Entry() ()

#7  0x00007ff74ba99021 in wxThreadInternal::PthreadStart(wxThread*) () from /usr/lib64/libwx_baseu-2.8.so.0

#8  0x00007ff74affcaa1 in start_thread () from /lib64/libpthread.so.0

#9  0x00007ff74ad49c4d in clone () from /lib64/libc.so.6

Thread 1 (Thread 0x7ff74c3507e0 (LWP 23389)):

#0  0x00007ff74b003334 in __lll_lock_wait () from /lib64/libpthread.so.0

#1  0x00007ff74affe5d8 in _L_lock_854 () from /lib64/libpthread.so.0

#2  0x00007ff74affe4a7 in pthread_mutex_lock () from /lib64/libpthread.so.0

#3  0x00007ff74ba979c9 in wxMutexInternal::Lock() () from /usr/lib64/libwx_baseu-2.8.so.0

#4  0x00007ff74c15a99d in DBconn::ClearConnections(bool) ()

#5  0x00007ff74c15e908 in MainRestartLoop(DBconn*) ()

#6  0x00007ff74c15f2a3 in MainLoop() ()

#7  0x00007ff74c15e016 in main ()

 

 

徐志宇(Jack

Database Engineer

    

DB Team,ITS. Lenovo China

Phone: 86-18910860709

Email:xuzy13@lenovo.com

No.6 Shangdi West Road, Haidian District Beijing, China, 100085

 


 

--

Dave Page
Blog: http://pgsnake.blogspot.com
Twitter: @pgsnake

EDB: http://www.enterprisedb.com



--
Dave Page
Blog: http://pgsnake.blogspot.com
Twitter: @pgsnake

EDB: http://www.enterprisedb.com

pgadmin-support by date:

Previous
From: Dave Page
Date:
Subject: Re: Export data from a query ran by pgagent
Next
From: Zhiyu ZY13 Xu
Date:
Subject: 答复: [External] Re: pgadmin--pgagent---the process hang by unknow reasons