Thread: archives.postgresql.org change

archives.postgresql.org change

From
Alvaro Herrera
Date:
Hi,

We've just changed the archives site so that we only pull the mboxes
from hub.org, and mhonarc is run locally on archives.pg.org.  This is
quicker -- we're now regenerating the archives every 15 minutes.

Let me know (via this list, I guess) if anything weird crops up.

-- 
Alvaro Herrera                                http://www.CommandPrompt.com/
The PostgreSQL Company - Command Prompt, Inc.


Re: archives.postgresql.org change

From
Magnus Hagander
Date:
On Fri, 2008-02-22 at 18:12 -0300, Alvaro Herrera wrote:
> Hi,
> 
> We've just changed the archives site so that we only pull the mboxes
> from hub.org, and mhonarc is run locally on archives.pg.org.  This is
> quicker -- we're now regenerating the archives every 15 minutes.
> 
> Let me know (via this list, I guess) if anything weird crops up.

Why was this done without any warning to the -www or sysadmins team?

Aside from that, I assume you have verified that the search system still
pulls things in properly? That's where I'd see a potential issue, but I
don't have time to look into it right now (need to fix broken slides for
tomorrows FOSDEM talks)

Oh, and since you've been fiddling it, please get some documentation up
on pmt. We didn't have any for the old one that I can find, but we'd
really like some for the new one. (for one, we've been talking here at
FOSDEM about auto-syncing some things about the lists between that one
and the search stuff so we stop missing lists when they are added - and
it'd be nice to have a documented system to work off for that one)

//Magnus



Re: archives.postgresql.org change

From
"Joshua D. Drake"
Date:
Magnus Hagander wrote:
> On Fri, 2008-02-22 at 18:12 -0300, Alvaro Herrera wrote:
>> Hi,
>>
>> We've just changed the archives site so that we only pull the mboxes
>> from hub.org, and mhonarc is run locally on archives.pg.org.  This is
>> quicker -- we're now regenerating the archives every 15 minutes.
>>
>> Let me know (via this list, I guess) if anything weird crops up.
> 
> Why was this done without any warning to the -www or sysadmins team?

Uhh... this has been being talking about for *months*.

> 
> Aside from that, I assume you have verified that the search system still
> pulls things in properly? That's where I'd see a potential issue, but I
> don't have time to look into it right now (need to fix broken slides for
> tomorrows FOSDEM talks)

I think you are missing what we did :)

It used to be that we rsynced all of archives.postgresql.org (website, 
php files, mbox files etc..).

Now we just rsync mbox. Basically we are saving bandwidth and allowing 
marc to deccomission and old fbsd4 box. Operationally for all dependent 
stuff nothing should change. The locations, urls, ips etc.. didn't change.

> 
> Oh, and since you've been fiddling it, please get some documentation up
> on pmt. We didn't have any for the old one that I can find, but we'd
> really like some for the new one. (for one, we've been talking here at
> FOSDEM about auto-syncing some things about the lists between that one
> and the search stuff so we stop missing lists when they are added - and
> it'd be nice to have a documented system to work off for that one)

Sure.

Joshua D. Drake




Re: archives.postgresql.org change

From
Magnus Hagander
Date:
On Sat, 2008-02-23 at 11:25 -0800, Joshua D. Drake wrote:
> Magnus Hagander wrote:
> > On Fri, 2008-02-22 at 18:12 -0300, Alvaro Herrera wrote:
> >> Hi,
> >>
> >> We've just changed the archives site so that we only pull the mboxes
> >> from hub.org, and mhonarc is run locally on archives.pg.org.  This is
> >> quicker -- we're now regenerating the archives every 15 minutes.
> >>
> >> Let me know (via this list, I guess) if anything weird crops up.
> > 
> > Why was this done without any warning to the -www or sysadmins team?
> 
> Uhh... this has been being talking about for *months*.

Really? I've missed that completely.


> > Aside from that, I assume you have verified that the search system still
> > pulls things in properly? That's where I'd see a potential issue, but I
> > don't have time to look into it right now (need to fix broken slides for
> > tomorrows FOSDEM talks)
> 
> I think you are missing what we did :)
> 
> It used to be that we rsynced all of archives.postgresql.org (website, 
> php files, mbox files etc..).
> 
> Now we just rsync mbox. Basically we are saving bandwidth and allowing 
> marc to deccomission and old fbsd4 box. Operationally for all dependent 
> stuff nothing should change. The locations, urls, ips etc.. didn't change.

I understand perfectly well what you did. But we've seen *a lot* of
cases before when moving one of these platform independent programs
(like mhonarc, which is written in perl IIRC) between Linux and FreeBSD
and get significantly different behaviour. Either in that it crashes, or
more often in that it gives different results without crashing. What you
did was move one of our core services from FreeBSD to Linux, which
potentially has a lot of effects. I'm not saying that something broke,
I'm just asking how much you actually verified.

AFAIK, we also had a bunch of customizations in mhonarc - did those get
migrated over, or are we now running a stock mhonarc?


> > Oh, and since you've been fiddling it, please get some documentation up
> > on pmt. We didn't have any for the old one that I can find, but we'd
> > really like some for the new one. (for one, we've been talking here at
> > FOSDEM about auto-syncing some things about the lists between that one
> > and the search stuff so we stop missing lists when they are added - and
> > it'd be nice to have a documented system to work off for that one)
> 
> Sure.

Good, thanks!

Oh, and I also notice there has been zero commits about this to the
autobackup repository, so obviously none of this is included in said
backups - that also needs to be fixed.

//Magnus


Re: archives.postgresql.org change

From
"Dave Page"
Date:
On Sun, Feb 24, 2008 at 9:53 AM, Magnus Hagander <magnus@hagander.net> wrote:
>
>  > Uhh... this has been being talking about for *months*.
>
>  Really? I've missed that completely.

Yeah, me too. I know we were aiming to upgrade svr5, but don't recall
any chatter about moving the archives generation. Marc normally just
moves VMs to a FreeBSD 6 host and upgrades them.

>  I understand perfectly well what you did. But we've seen *a lot* of
>  cases before when moving one of these platform independent programs
>  (like mhonarc, which is written in perl IIRC) between Linux and FreeBSD
>  and get significantly different behaviour.

Yeah - case in point - the new archives system I've been working on.
Works fine on Linux, on FreeBSD PHP core dumps and occasionally throws
mysterious out of memory errors and ignores the remaining messages in
an mbox.

>  AFAIK, we also had a bunch of customizations in mhonarc - did those get
>  migrated over, or are we now running a stock mhonarc?

Yeah, I'm not sure what they all were, but I do remember Marc spending
a bunch of time hacking stuff around on Oleg & John's advice to
optimise the way Google & other search engine index the archives. I
don't know if all that customisation is stuff that can be rsync'd.

-- 
Dave Page
EnterpriseDB UK: http://www.enterprisedb.com
The Oracle-compatible database company


Re: archives.postgresql.org change

From
Andrew Sullivan
Date:
On Sun, Feb 24, 2008 at 09:53:10AM +0100, Magnus Hagander wrote:
> > 
> > Uhh... this has been being talking about for *months*.
> 
> Really? I've missed that completely.

I also do not remember any posting of an announcement like, "We're moving
service $X from $machineA to $machineB at $timestamptz."  

There seems to be a predilection for moving things around without a lot of
clear planning.  If we're going to have a group of administrators
maintaining machines, surely we need to have clear announcements of the
above sort before and after such moves, so that nobody is surprised, no?

A


Re: archives.postgresql.org change

From
"Joshua D. Drake"
Date:
Magnus Hagander wrote:
> On Sat, 2008-02-23 at 11:25 -0800, Joshua D. Drake wrote:
>> Magnus Hagander wrote:
>>> On Fri, 2008-02-22 at 18:12 -0300, Alvaro Herrera wrote:
>>>> Hi,

> AFAIK, we also had a bunch of customizations in mhonarc - did those get
> migrated over, or are we now running a stock mhonarc?
>

We only did what Marc told us we had to do. We had a couple of scripts 
that he provided that we had to port from tcsh (yes) to sh, which Alvaro 
kindly did. As I understand it all the scripts did was format the 
messages from the mbox files but to be honest, I didn't read them. I had 
Alvaro and Marc work it out (I refuse to touch tcsh).

> 
>>> Oh, and since you've been fiddling it, please get some documentation up
>>> on pmt. We didn't have any for the old one that I can find, but we'd
>>> really like some for the new one. (for one, we've been talking here at
>>> FOSDEM about auto-syncing some things about the lists between that one
>>> and the search stuff so we stop missing lists when they are added - and
>>> it'd be nice to have a documented system to work off for that one)
>> Sure.
> 
> Good, thanks!
> 
> Oh, and I also notice there has been zero commits about this to the
> autobackup repository, so obviously none of this is included in said
> backups - that also needs to be fixed.

Its on the list :)

Joshua D. Drake




Re: archives.postgresql.org change

From
Magnus Hagander
Date:
On Sun, 2008-02-24 at 07:53 -0800, Joshua D. Drake wrote:
> Magnus Hagander wrote:
> > On Sat, 2008-02-23 at 11:25 -0800, Joshua D. Drake wrote:
> >> Magnus Hagander wrote:
> >>> On Fri, 2008-02-22 at 18:12 -0300, Alvaro Herrera wrote:
> >>>> Hi,
> 
> > AFAIK, we also had a bunch of customizations in mhonarc - did those get
> > migrated over, or are we now running a stock mhonarc?
> >
> 
> We only did what Marc told us we had to do. We had a couple of scripts 
> that he provided that we had to port from tcsh (yes) to sh, which Alvaro 
> kindly did. As I understand it all the scripts did was format the 
> messages from the mbox files but to be honest, I didn't read them. I had 
> Alvaro and Marc work it out (I refuse to touch tcsh).

As long as these things are covered, that's fine. But it'd be good if
someone could verify it who actually knew about it. I think that means
you, Marc :-) But if that's already done, then drop the point - it just
wasn't clear from the original message.


> >>> Oh, and since you've been fiddling it, please get some documentation up
> >>> on pmt. We didn't have any for the old one that I can find, but we'd
> >>> really like some for the new one. (for one, we've been talking here at
> >>> FOSDEM about auto-syncing some things about the lists between that one
> >>> and the search stuff so we stop missing lists when they are added - and
> >>> it'd be nice to have a documented system to work off for that one)
> >> Sure.
> > 
> > Good, thanks!
> > 
> > Oh, and I also notice there has been zero commits about this to the
> > autobackup repository, so obviously none of this is included in said
> > backups - that also needs to be fixed.
> 
> Its on the list :)

Yay :-)

//Magnus


Re: archives.postgresql.org change

From
"Joshua D. Drake"
Date:
Magnus Hagander wrote:

> As long as these things are covered, that's fine. But it'd be good if
> someone could verify it who actually knew about it. I think that means
> you, Marc :-) But if that's already done, then drop the point - it just
> wasn't clear from the original message.

And Marc has been in the loop the whole time (just fyi).

>>>
>>> Oh, and I also notice there has been zero commits about this to the
>>> autobackup repository, so obviously none of this is included in said
>>> backups - that also needs to be fixed.

Yeah... that has just been done.

Joshua D. Drake





Re: archives.postgresql.org change

From
"Marc G. Fournier"
Date:
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1



- --On Sunday, February 24, 2008 10:42:45 +0100 Dave Page <dpage@pgadmin.org> 
wrote:


> Yeah, me too. I know we were aiming to upgrade svr5, but don't recall
> any chatter about moving the archives generation. Marc normally just
> moves VMs to a FreeBSD 6 host and upgrades them.

Actually, I had asked Joshua to do this months ago ... figured it made more 
sense to pull justhte mbox files to the archives server and regen, then pull 
them to an intermediary box, and then have archives pull  the generated files 
... the middle step didn't make much sense ...

> Yeah, I'm not sure what they all were, but I do remember Marc spending
> a bunch of time hacking stuff around on Oleg & John's advice to
> optimise the way Google & other search engine index the archives. I
> don't know if all that customisation is stuff that can be rsync'd.

The only thing that that should have affected would have been the .resource 
file ... Joshua, we did copy that over, right?

- ----
Marc G. Fournier           Hub.Org Networking Services (http://www.hub.org)
Email . scrappy@hub.org                              MSN . scrappy@hub.org
Yahoo . yscrappy               Skype: hub.org        ICQ . 7615664
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.4 (FreeBSD)

iD8DBQFHwbZk4QvfyHIvDvMRAiAJAKCrGsBnk5TRtBPw4hcRdtbowQQdbQCgyu5i
9cPAoBYqWSB/WZUZk7T3Y+Q=
=0tij
-----END PGP SIGNATURE-----



Re: archives.postgresql.org change

From
"Joshua D. Drake"
Date:
On Sun, 24 Feb 2008 14:24:36 -0400
"Marc G. Fournier" <scrappy@hub.org> wrote:

> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
>
>
> - --On Sunday, February 24, 2008 10:42:45 +0100 Dave Page
> <dpage@pgadmin.org> wrote:
>
>
> > Yeah, me too. I know we were aiming to upgrade svr5, but don't
> > recall any chatter about moving the archives generation. Marc
> > normally just moves VMs to a FreeBSD 6 host and upgrades them.
>
> Actually, I had asked Joshua to do this months ago ... figured it
> made more sense to pull justhte mbox files to the archives server and
> regen, then pull them to an intermediary box, and then have archives
> pull  the generated files ... the middle step didn't make much
> sense ...
>
> > Yeah, I'm not sure what they all were, but I do remember Marc
> > spending a bunch of time hacking stuff around on Oleg & John's
> > advice to optimise the way Google & other search engine index the
> > archives. I don't know if all that customisation is stuff that can
> > be rsync'd.
>
> The only thing that that should have affected would have been
> the .resource file ... Joshua, we did copy that over, right?

I have the following:

-rw-r--r--  1 archives archives 8041 Jul 13  2006 .resource-new
-rw-r--r--  1 archives archives 7856 Aug  9  2006 .resource-old

Which contain a bunch of mhonarc macros (at least that is what it looks
like). It is called from mk-mhonarc like so:
 if [ `echo $month | awk -F. '{print $2}'` -le 200607 ]; then     resourcefile=.resource-old   else
resourcefile=.resource-new  fi 

Sincerely,

Joshua D. Drake

--
The PostgreSQL Company since 1997: http://www.commandprompt.com/
PostgreSQL Community Conference: http://www.postgresqlconference.org/
Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate
PostgreSQL SPI Liaison | SPI Director |  PostgreSQL political pundit


Re: archives.postgresql.org change

From
"Marc G. Fournier"
Date:
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1


Perfect, just wanted to make sure ...

- --On Sunday, February 24, 2008 10:35:54 -0800 "Joshua D. Drake" 
<jd@commandprompt.com> wrote:

> On Sun, 24 Feb 2008 14:24:36 -0400
> "Marc G. Fournier" <scrappy@hub.org> wrote:
>
>> -----BEGIN PGP SIGNED MESSAGE-----
>> Hash: SHA1
>>
>>
>>
>> - --On Sunday, February 24, 2008 10:42:45 +0100 Dave Page
>> <dpage@pgadmin.org> wrote:
>>
>>
>> > Yeah, me too. I know we were aiming to upgrade svr5, but don't
>> > recall any chatter about moving the archives generation. Marc
>> > normally just moves VMs to a FreeBSD 6 host and upgrades them.
>>
>> Actually, I had asked Joshua to do this months ago ... figured it
>> made more sense to pull justhte mbox files to the archives server and
>> regen, then pull them to an intermediary box, and then have archives
>> pull  the generated files ... the middle step didn't make much
>> sense ...
>>
>> > Yeah, I'm not sure what they all were, but I do remember Marc
>> > spending a bunch of time hacking stuff around on Oleg & John's
>> > advice to optimise the way Google & other search engine index the
>> > archives. I don't know if all that customisation is stuff that can
>> > be rsync'd.
>>
>> The only thing that that should have affected would have been
>> the .resource file ... Joshua, we did copy that over, right?
>
> I have the following:
>
> -rw-r--r--  1 archives archives 8041 Jul 13  2006 .resource-new
> -rw-r--r--  1 archives archives 7856 Aug  9  2006 .resource-old
>
> Which contain a bunch of mhonarc macros (at least that is what it looks
> like). It is called from mk-mhonarc like so:
>
>   if [ `echo $month | awk -F. '{print $2}'` -le 200607 ]; then
>       resourcefile=.resource-old
>     else
>       resourcefile=.resource-new
>     fi
>
> Sincerely,
>
> Joshua D. Drake
>
> --
> The PostgreSQL Company since 1997: http://www.commandprompt.com/
> PostgreSQL Community Conference: http://www.postgresqlconference.org/
> Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate
> PostgreSQL SPI Liaison | SPI Director |  PostgreSQL political pundit
>



- ----
Marc G. Fournier           Hub.Org Networking Services (http://www.hub.org)
Email . scrappy@hub.org                              MSN . scrappy@hub.org
Yahoo . yscrappy               Skype: hub.org        ICQ . 7615664
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.4 (FreeBSD)

iD8DBQFHwbpS4QvfyHIvDvMRAhFYAJ0UrYH3jF3MdE4AgfgsYvSRsI03uQCcC4wa
MyN85lOf3QWhVfXcfCOMJWs=
=8BqQ
-----END PGP SIGNATURE-----



Re: archives.postgresql.org change

From
Alvaro Herrera
Date:
Magnus Hagander wrote:
> 
> On Fri, 2008-02-22 at 18:12 -0300, Alvaro Herrera wrote:
> > Hi,
> > 
> > We've just changed the archives site so that we only pull the mboxes
> > from hub.org, and mhonarc is run locally on archives.pg.org.  This is
> > quicker -- we're now regenerating the archives every 15 minutes.
> > 
> > Let me know (via this list, I guess) if anything weird crops up.
> 
> Why was this done without any warning to the -www or sysadmins team?

Sorry, I don't know the answer to that.  I kept looking for you, Dave or
Stefan on Jabber, but saw none.

> Aside from that, I assume you have verified that the search system still
> pulls things in properly? That's where I'd see a potential issue, but I
> don't have time to look into it right now (need to fix broken slides for
> tomorrows FOSDEM talks)

Good question -- I didn't.

> Oh, and since you've been fiddling it, please get some documentation up
> on pmt.

If you get me a password for accessing pmt, I'll gladly document the
whole shebang (which really needs some documenting -- the whole setup is
not entirely trivial).  Also, I'd like to put the scripts somewhere in a
community SCM (I put them in a CMD SVN as a temporary measure.)

> We didn't have any for the old one that I can find, but we'd
> really like some for the new one. (for one, we've been talking here at
> FOSDEM about auto-syncing some things about the lists between that one
> and the search stuff so we stop missing lists when they are added - and
> it'd be nice to have a documented system to work off for that one)

That would be nice -- also the current scripts to generate the list of
lists contain some duplicate info; it's no wonder that the lists has
been out of date continuously.  I can lend a hand with that as needed.

-- 
Alvaro Herrera                                http://www.CommandPrompt.com/
PostgreSQL Replication, Consulting, Custom Development, 24x7 support


Re: archives.postgresql.org change

From
Alvaro Herrera
Date:
Alvaro Herrera wrote:
> Magnus Hagander wrote:

> > Aside from that, I assume you have verified that the search system still
> > pulls things in properly? That's where I'd see a potential issue, but I
> > don't have time to look into it right now (need to fix broken slides for
> > tomorrows FOSDEM talks)
> 
> Good question -- I didn't.

Ok, it seems to work.  I searched for "archives.postgresql.org change"
and this thread shows up.

-- 
Alvaro Herrera                                http://www.CommandPrompt.com/
The PostgreSQL Company - Command Prompt, Inc.


Re: archives.postgresql.org change

From
"Joshua D. Drake"
Date:
Alvaro Herrera wrote:
> Alvaro Herrera wrote:
>> Magnus Hagander wrote:
> 
>>> Aside from that, I assume you have verified that the search system still
>>> pulls things in properly? That's where I'd see a potential issue, but I
>>> don't have time to look into it right now (need to fix broken slides for
>>> tomorrows FOSDEM talks)
>> Good question -- I didn't.
> 
> Ok, it seems to work.  I searched for "archives.postgresql.org change"
> and this thread shows up.
> 

A note on the svn, the files are now in the autobackup system so they 
are now in svn for community as well.

Joshua D. Drake


Re: archives.postgresql.org change

From
"Dave Page"
Date:
On Mon, Feb 25, 2008 at 2:19 AM, Joshua D. Drake <jd@commandprompt.com> wrote:
> Alvaro Herrera wrote:
>  > Alvaro Herrera wrote:
>  >> Magnus Hagander wrote:
>  >
>  >>> Aside from that, I assume you have verified that the search system still
>  >>> pulls things in properly? That's where I'd see a potential issue, but I
>  >>> don't have time to look into it right now (need to fix broken slides for
>  >>> tomorrows FOSDEM talks)
>  >> Good question -- I didn't.
>  >
>  > Ok, it seems to work.  I searched for "archives.postgresql.org change"
>  > and this thread shows up.
>  >
>
>  A note on the svn, the files are now in the autobackup system so they
>  are now in svn for community as well.

Config files you mean? There shouldn't be any content type stuff in
auto-backup, as that should work the other way round as people test
changes in sandboxes and then commit them to SVN, from where the
production site can check them out (something we'll need to sort out
for archives).

-- 
Dave Page
EnterpriseDB UK: http://www.enterprisedb.com
The Oracle-compatible database company


Re: archives.postgresql.org change

From
"Dave Page"
Date:
On Sun, Feb 24, 2008 at 4:39 PM, Joshua D. Drake <jd@commandprompt.com> wrote:
> Magnus Hagander wrote:
>
>  > As long as these things are covered, that's fine. But it'd be good if
>  > someone could verify it who actually knew about it. I think that means
>  > you, Marc :-) But if that's already done, then drop the point - it just
>  > wasn't clear from the original message.
>
>  And Marc has been in the loop the whole time (just fyi).

Then he also deserves a proverbial slap for not sharing with the rest of us.

We are managing services used by thousands of people here - we need to
coordinate all major changes to ensure everyone has a chance to think
of any potential problems, and so we know whats happening. Consider;
Stefan and I are chatting and notice that Nagios has started alerting
on a service. We then spend an hour or two trying to diagnose the
problem between us, not realising that the old box which we're looking
at is fine, it's just that you've moved the service to a different
server and something is subtly different, causing Nagios to barf.

Communication JD - that's all we need!!


-- 
Dave Page
EnterpriseDB UK: http://www.enterprisedb.com
The Oracle-compatible database company