Thread: Switching to XML

Switching to XML

From

David Blewett

Date:

08 December 2006, 10:18:33

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hi All:

I read a thread from July explaining the current status of moving to XML
[1].

Peter, you mention that if someone has a clear need you would be open to
switching. Let me explain the problems I've encountered with SGML.

A few months ago, I split up the 8.1 pdf into 2 volumes so that I could
get a hardbound copy from Lulu. You can get those here [2-3] (I'm not
making any money, it's the pure cost of production). For 8.2, I wanted
to go deeper and be able to distribute the final version to bookstores.
In order to do that, each volume needs to be under 700 pages.

I started work on modifying the SGML to use a set of 3 volumes, split at
roughly 500 page intervals. I wanted to generate individual ToC's and
indexes for each volume. I started to modify the SGML to include a
"role" attribute for each indexterm, to tell what volume it was part of.
This was done with a simple sed script. When I went to generate the
actual indexes, I hit a brick wall. Apparently, the SGML toolchain
cannot handle typed indexes as described here [4]. Only the XML
toolchain currently handles them.

I am working with Joshua Drake on creating a Lulu account for the
fundraising group. We would then move the volumes I did for 8.1 to their
account, and raise the price so that any profit went to them. The
ability to order the 8.2 manual from bookstores could increase the
visibility of the project as a whole.

David Blewett

1.
http://groups.google.com/group/pgsql.docs/browse_thread/thread/37ff3a011bb705d7
2. http://www.lulu.com/content/455020
3. http://www.lulu.com/content/464949
4. http://www.xml.com/pub/a/2004/07/14/dbndx.html?page=3
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.3 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFFeXIOZmlc6wNjtLYRAo2yAKCZT1NbsklCd8djADdv48MuLELG7wCfYYG3
ypUw8LU3g++GaY6Dz2vUi+I=
=lC+c
-----END PGP SIGNATURE-----

Re: Switching to XML

From

"Joshua D. Drake"

Date:

08 December 2006, 11:38:12

On Fri, 2006-12-08 at 09:09 -0500, David Blewett wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> Hi All:
>
> I read a thread from July explaining the current status of moving to XML
> [1].

To add to this, it would take minutes to transform XML versus the days
it takes the SGML.

Joshua D. Drake

>
> Peter, you mention that if someone has a clear need you would be open to
> switching. Let me explain the problems I've encountered with SGML.
>
> A few months ago, I split up the 8.1 pdf into 2 volumes so that I could
> get a hardbound copy from Lulu. You can get those here [2-3] (I'm not
> making any money, it's the pure cost of production). For 8.2, I wanted
> to go deeper and be able to distribute the final version to bookstores.
> In order to do that, each volume needs to be under 700 pages.
>
> I started work on modifying the SGML to use a set of 3 volumes, split at
> roughly 500 page intervals. I wanted to generate individual ToC's and
> indexes for each volume. I started to modify the SGML to include a
> "role" attribute for each indexterm, to tell what volume it was part of.
> This was done with a simple sed script. When I went to generate the
> actual indexes, I hit a brick wall. Apparently, the SGML toolchain
> cannot handle typed indexes as described here [4]. Only the XML
> toolchain currently handles them.
>
> I am working with Joshua Drake on creating a Lulu account for the
> fundraising group. We would then move the volumes I did for 8.1 to their
> account, and raise the price so that any profit went to them. The
> ability to order the 8.2 manual from bookstores could increase the
> visibility of the project as a whole.
>
> David Blewett
>
> 1.
> http://groups.google.com/group/pgsql.docs/browse_thread/thread/37ff3a011bb705d7
> 2. http://www.lulu.com/content/455020
> 3. http://www.lulu.com/content/464949
> 4. http://www.xml.com/pub/a/2004/07/14/dbndx.html?page=3
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.3 (MingW32)
> Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
>
> iD8DBQFFeXIOZmlc6wNjtLYRAo2yAKCZT1NbsklCd8djADdv48MuLELG7wCfYYG3
> ypUw8LU3g++GaY6Dz2vUi+I=
> =lC+c
> -----END PGP SIGNATURE-----
>
> ---------------------------(end of broadcast)---------------------------
> TIP 4: Have you searched our list archives?
>
>                http://archives.postgresql.org
>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

Peter Eisentraut

Date:

08 December 2006, 14:02:14

David Blewett wrote:
> I started work on modifying the SGML to use a set of 3 volumes, split
> at roughly 500 page intervals. I wanted to generate individual ToC's
> and indexes for each volume. I started to modify the SGML to include
> a "role" attribute for each indexterm, to tell what volume it was
> part of. This was done with a simple sed script. When I went to
> generate the actual indexes, I hit a brick wall. Apparently, the SGML
> toolchain cannot handle typed indexes as described here [4]. Only the
> XML toolchain currently handles them.

But no one is forcing you to use "the SGML toolchain".  I requote the
message you cited:

 First of all, moving to DocBook XML will not do anything in the way of
 improving our output processing abilities.  Any tool that you can use
 on DocBook SGML can also be used on DocBook XML and vice versa.

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

Josh Berkus

Date:

08 December 2006, 14:38:23

Peter,

>  First of all, moving to DocBook XML will not do anything in the way of
>  improving our output processing abilities.  Any tool that you can use
>  on DocBook SGML can also be used on DocBook XML and vice versa.
>

As I said then, this is absolutely untrue.  OpenOffice.org, for example,
works with DocBook XML but not SGML.  There are also a plethora of XML
editing and publishing tools which can been used for Docbook XML which
are not available for SGML.  A simple look at this page:
http://wiki.docbook.org/topic/DocBookAuthoringTools
.... shows that there are more than twice as many authoring tools which
support only XML as support SGML -- and that most of the tools which
support SGML are out-of-maintenance.

--Josh Berkus

Re: Switching to XML

From

"Joshua D. Drake"

Date:

08 December 2006, 15:06:18

On Fri, 2006-12-08 at 13:36 -0500, Josh Berkus wrote:
> Peter,
>
> >  First of all, moving to DocBook XML will not do anything in the way of
> >  improving our output processing abilities.  Any tool that you can use
> >  on DocBook SGML can also be used on DocBook XML and vice versa.
> >
>
> As I said then, this is absolutely untrue.

You are correct Josh. I am not sure why Peter ignores this simple fact.

>  OpenOffice.org, for example,
> works with DocBook XML but not SGML.  There are also a plethora of XML
> editing and publishing tools which can been used for Docbook XML which
> are not available for SGML.  A simple look at this page:
> http://wiki.docbook.org/topic/DocBookAuthoringTools
> .... shows that there are more than twice as many authoring tools which
> support only XML as support SGML -- and that most of the tools which
> support SGML are out-of-maintenance.

Yep.

Sincerely,

Joshua D. Drake


>
> --Josh Berkus
>
>
> ---------------------------(end of broadcast)---------------------------
> TIP 2: Don't 'kill -9' the postmaster
>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

Peter Eisentraut

Date:

08 December 2006, 16:15:59

Josh Berkus wrote:
> As I said then, this is absolutely untrue.  OpenOffice.org, for
> example, works with DocBook XML but not SGML.

What does "works" mean?

> There are also a
> plethora of XML editing and publishing tools which can been used for
> Docbook XML which are not available for SGML.

That has nothing to do with processing.

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

"Joshua D. Drake"

Date:

08 December 2006, 16:19:57

On Fri, 2006-12-08 at 21:15 +0100, Peter Eisentraut wrote:
> Josh Berkus wrote:
> > As I said then, this is absolutely untrue.  OpenOffice.org, for
> > example, works with DocBook XML but not SGML.
>
> What does "works" mean?

You can create, edit, convert, save, and open docbook xml in
OpenOfice.org.

>
> > There are also a
> > plethora of XML editing and publishing tools which can been used for
> > Docbook XML which are not available for SGML.
>
> That has nothing to do with processing.

You are correct but XML does give us more modern and efficient toolsets
for processing. XSLT, FOP + Xerces for example.

Sincerely,

Joshua D. Drake


>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

Peter Eisentraut

Date:

08 December 2006, 16:58:11

Joshua D. Drake wrote:
> You can create, edit, convert, save, and open docbook xml in
> OpenOfice.org.

Sure, there are more editing options with DocBook XML.  No one disputes
that.  But the question at hand was about processing the DocBook.

> You are correct but XML does give us more modern and efficient
> toolsets for processing. XSLT, FOP + Xerces for example.

You can do that right now.

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

"Joshua D. Drake"

Date:

08 December 2006, 17:24:16

On Fri, 2006-12-08 at 21:58 +0100, Peter Eisentraut wrote:
> Joshua D. Drake wrote:
> > You can create, edit, convert, save, and open docbook xml in
> > OpenOfice.org.
>
> Sure, there are more editing options with DocBook XML.  No one disputes
> that.  But the question at hand was about processing the DocBook.

Yes which is generated from our use of SGML which is the core of this
problem and the core of the question as a whole.

SGML is making working with the documentation *harder*.

We have people that *DO NOT* contribute because of this SGML
requirement. They have what I consider extremely valid reasons, namely
it is dumb to require a writer to use emacs or write tags explictly.

Hell, the only reason I have even bothered to contribute what little I
have to the docs is because I wrote a book in SGML, thus it is a no
brainer to me. Others aren't so tortured as to have done the same.

There is a long standing support within the community to move to XML
including:

Josh Berkus
Josh Drake
Robert Treat
Andrew Dunslane
David Blewett
David Fetter
Devrim Gunduz
Darcy Buskermolen

And that is just from #postgresql

The french team also uses Docbook XML and they can generate a PDF in 30
minutes... it takes us DAYS because of the SGML.

How about we stopping using assembling and move to C shall we?

Sincerely,

Joshua D. Drake

--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

Scott Marlowe

Date:

08 December 2006, 17:46:02

On Fri, 2006-12-08 at 15:26, Joshua D. Drake wrote:
> On Fri, 2006-12-08 at 21:58 +0100, Peter Eisentraut wrote:
> > Joshua D. Drake wrote:
> > > You can create, edit, convert, save, and open docbook xml in
> > > OpenOfice.org.
> >
> > Sure, there are more editing options with DocBook XML.  No one disputes
> > that.  But the question at hand was about processing the DocBook.
>
> Yes which is generated from our use of SGML which is the core of this
> problem and the core of the question as a whole.
>
> SGML is making working with the documentation *harder*.
>
> We have people that *DO NOT* contribute because of this SGML
> requirement. They have what I consider extremely valid reasons, namely
> it is dumb to require a writer to use emacs or write tags explictly.

I'm one.  I had the vacuuming stuff about written, but couldn't figure
out the tool set this summer, and didn't have the time.  Does OO make it
pretty much painless to work with?  I couldn't find any tools that I
felt comfortable using for SGML / Docbooks.

Re: Switching to XML

From

"Joshua D. Drake"

Date:

08 December 2006, 17:50:17

On Fri, 2006-12-08 at 15:45 -0600, Scott Marlowe wrote:
> On Fri, 2006-12-08 at 15:26, Joshua D. Drake wrote:
> > On Fri, 2006-12-08 at 21:58 +0100, Peter Eisentraut wrote:
> > > Joshua D. Drake wrote:
> > > > You can create, edit, convert, save, and open docbook xml in
> > > > OpenOfice.org.
> > >
> > > Sure, there are more editing options with DocBook XML.  No one disputes
> > > that.  But the question at hand was about processing the DocBook.
> >
> > Yes which is generated from our use of SGML which is the core of this
> > problem and the core of the question as a whole.
> >
> > SGML is making working with the documentation *harder*.
> >
> > We have people that *DO NOT* contribute because of this SGML
> > requirement. They have what I consider extremely valid reasons, namely
> > it is dumb to require a writer to use emacs or write tags explictly.
>
> I'm one.  I had the vacuuming stuff about written, but couldn't figure
> out the tool set this summer, and didn't have the time.  Does OO make it
> pretty much painless to work with?  I couldn't find any tools that I
> felt comfortable using for SGML / Docbooks.

OO treats Docbook like a normal document. You will however loose styles
(like bold, italic). It does support tables, it understands
transformation from things like sect1 (from OO heading1) etc...

The style loss is to be expected because Docbook doesn't contain
representation data. That belongs to a style sheet.

Sincerely,

Joshua D. Drake



--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

Scott Marlowe

Date:

08 December 2006, 17:54:06

On Fri, 2006-12-08 at 15:51, Joshua D. Drake wrote:
> On Fri, 2006-12-08 at 15:45 -0600, Scott Marlowe wrote:
> > On Fri, 2006-12-08 at 15:26, Joshua D. Drake wrote:
> > > On Fri, 2006-12-08 at 21:58 +0100, Peter Eisentraut wrote:
> > > > Joshua D. Drake wrote:
> > > > > You can create, edit, convert, save, and open docbook xml in
> > > > > OpenOfice.org.
> > > >
> > > > Sure, there are more editing options with DocBook XML.  No one disputes
> > > > that.  But the question at hand was about processing the DocBook.
> > >
> > > Yes which is generated from our use of SGML which is the core of this
> > > problem and the core of the question as a whole.
> > >
> > > SGML is making working with the documentation *harder*.
> > >
> > > We have people that *DO NOT* contribute because of this SGML
> > > requirement. They have what I consider extremely valid reasons, namely
> > > it is dumb to require a writer to use emacs or write tags explictly.
> >
> > I'm one.  I had the vacuuming stuff about written, but couldn't figure
> > out the tool set this summer, and didn't have the time.  Does OO make it
> > pretty much painless to work with?  I couldn't find any tools that I
> > felt comfortable using for SGML / Docbooks.
>
> OO treats Docbook like a normal document. You will however loose styles
> (like bold, italic). It does support tables, it understands
> transformation from things like sect1 (from OO heading1) etc...
>
> The style loss is to be expected because Docbook doesn't contain
> representation data. That belongs to a style sheet.

and let me add that I'm not really anti-sgml docbook, I just couldn't
find a "starter set" for editing the stuff.  It seemed like everything I
found on docbook xml was written for people who already use docbook xml.

Re: Switching to XML

From

"Joshua D. Drake"

Date:

08 December 2006, 18:01:24

> > OO treats Docbook like a normal document. You will however loose styles
> > (like bold, italic). It does support tables, it understands
> > transformation from things like sect1 (from OO heading1) etc...
> >
> > The style loss is to be expected because Docbook doesn't contain
> > representation data. That belongs to a style sheet.
>
> and let me add that I'm not really anti-sgml docbook, I just couldn't
> find a "starter set" for editing the stuff.  It seemed like everything I
> found on docbook xml was written for people who already use docbook xml.

Nor am I anti-sgml. I am however anti-noncontribution, if people are not
contributing because of our sgml but would if it is xml, that is a no
brainer.

Further, here is a real world problem that our toolset creates...

I take 5 minutes, change the stylesheet for SGML. I want to see what my
changes will look like... 3 days later, I will know.

That is stupid. If it was XML, it would be 30 minutes. That is a
workable timeframe.


Sincerely,

Joshua D. Drake


>
> ---------------------------(end of broadcast)---------------------------
> TIP 7: You can help support the PostgreSQL project by donating at
>
>                 http://www.postgresql.org/about/donate
>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

Oisin Glynn

Date:

08 December 2006, 18:10:42

Joshua D. Drake wrote:
>>> OO treats Docbook like a normal document. You will however loose styles
>>> (like bold, italic). It does support tables, it understands
>>> transformation from things like sect1 (from OO heading1) etc...
>>>
>>> The style loss is to be expected because Docbook doesn't contain
>>> representation data. That belongs to a style sheet.
>>>
>> and let me add that I'm not really anti-sgml docbook, I just couldn't
>> find a "starter set" for editing the stuff.  It seemed like everything I
>> found on docbook xml was written for people who already use docbook xml.
>>
>
> Nor am I anti-sgml. I am however anti-noncontribution, if people are not
> contributing because of our sgml but would if it is xml, that is a no
> brainer.
>
> Further, here is a real world problem that our toolset creates...
>
> I take 5 minutes, change the stylesheet for SGML. I want to see what my
> changes will look like... 3 days later, I will know.
>
> That is stupid. If it was XML, it would be 30 minutes. That is a
> workable timeframe.
>
>
> Sincerely,
>
> Joshua D. Drake
>
>
>
>> ---------------------------(end of broadcast)---------------------------
>> TIP 7: You can help support the PostgreSQL project by donating at
>>
>>                 http://www.postgresql.org/about/donate
>>
>>
I am not an expert in any of this but would say the one time I had
something I would have added to a doc (windows Vs *nix differences in
copy command) I was in knots with the whole SGML thing and figured it
would be quicker to just post an answer on the mailing list so it would
be immortalized, it should not be so hard to contribute. I feel there
are people who maybe cannot code or add in that way  but could add to
docs if it was easier. I had an easier time setting up a pgAdmin build
than figuring out the SGML stuff.

Oisin

Attachment

smime.p7s

Re: Switching to XML

From

Mario

Date:

08 December 2006, 21:49:53

On 08/12/06, Joshua D. Drake <jd@commandprompt.com> wrote:
> There is a long standing support within the community to move to XML
> including:
>
> Josh Berkus
> Josh Drake
> Robert Treat
> Andrew Dunslane
> David Blewett
> David Fetter
> Devrim Gunduz
> Darcy Buskermolen
>
> And that is just from #postgresql
>

  And now in Chile, I wrote an app to translate postgres documentation
(l10n.postgresql.cl). I took SGML sources, transform to XML and to POT
files

  The results are good! but is it not the best. To move from SGML to
XML is a hard work.

>

--
http://www.advogato.org/person/mgonzalez/

Re: Switching to XML

From

Guillaume Lelarge

Date:

09 December 2006, 04:19:41

Joshua D. Drake a écrit :
> On Fri, 2006-12-08 at 21:58 +0100, Peter Eisentraut wrote:
>> Joshua D. Drake wrote:
>>> You can create, edit, convert, save, and open docbook xml in
>>> OpenOfice.org.
>> Sure, there are more editing options with DocBook XML.  No one disputes
>> that.  But the question at hand was about processing the DocBook.
>
> Yes which is generated from our use of SGML which is the core of this
> problem and the core of the question as a whole.
>
> SGML is making working with the documentation *harder*.
>

+1

> We have people that *DO NOT* contribute because of this SGML
> requirement. They have what I consider extremely valid reasons, namely
> it is dumb to require a writer to use emacs or write tags explictly.
>
> Hell, the only reason I have even bothered to contribute what little I
> have to the docs is because I wrote a book in SGML, thus it is a no
> brainer to me. Others aren't so tortured as to have done the same.
>

I'm not so sure it will help you find more contributors. I'm part of a
project which aims to translate HOWTO from TLDP. They don't find
contributors and we too have really hard times to find contributors
despite the fact we try to only use DocBook XML (TLDP use DocBook SGML,
DocBook XML and LinuxDoc formats).

Did you try to use OpenOffice.org with DocBook ? I tried once and it was
a complete disaster. But it was a long time ago. I will try again this
week-end.

> There is a long standing support within the community to move to XML
> including:
>
> Josh Berkus
> Josh Drake
> Robert Treat
> Andrew Dunslane
> David Blewett
> David Fetter
> Devrim Gunduz
> Darcy Buskermolen
>
> And that is just from #postgresql
>
> The french team also uses Docbook XML and they can generate a PDF in 30
> minutes... it takes us DAYS because of the SGML.
>

In fact, we need 15 minutes to build HTML files and 10 minutes to build
PDF file. To be completely honest, I don't seem to be able to build PDF
file for 8.2.0 release. I must have made a mistake (or perhaps a lot of
:) ).

Regards.

--
Guillaume.
<!-- http://abs.traduc.org/
     http://lfs.traduc.org/
     http://docs.postgresqlfr.org/ -->

Re: Switching to XML

From

Theo Kramer

Date:

09 December 2006, 05:07:43

On Fri, 2006-12-08 at 13:26 -0800, Joshua D. Drake wrote:
<snip>
> Yes which is generated from our use of SGML which is the core of this
> problem and the core of the question as a whole.
>
> SGML is making working with the documentation *harder*.

From a total outsider's point of view I have to disagree. It took me a
couple of minutes to figure out how to make the tiny change I did the
other day by looking at the rest of the sgml (managed to get the diff
the wrong way around but not the sgml :/).

> We have people that *DO NOT* contribute because of this SGML
> requirement. They have what I consider extremely valid reasons, namely
> it is dumb to require a writer to use emacs or write tags explictly.

Again would have to disagree - surely if someone really wants to
contribute they could provide their input in plain text, and someone on
the list could then integrate those contribs.

> Hell, the only reason I have even bothered to contribute what little I
> have to the docs is because I wrote a book in SGML, thus it is a no
> brainer to me. Others aren't so tortured as to have done the same.

I would hate to hand edit the stuff generated by something like
OpenOffice.org.

> There is a long standing support within the community to move to XML
> including:
>
> Josh Berkus
> Josh Drake
> Robert Treat
> Andrew Dunslane
> David Blewett
> David Fetter
> Devrim Gunduz
> Darcy Buskermolen
>
> And that is just from #postgresql
>
> The french team also uses Docbook XML and they can generate a PDF in 30
> minutes... it takes us DAYS because of the SGML.

Here I agree - 30 minutes vs days is a good reason - as long as editing
with an ascii editor is not taken away.

--
Regards
Theo

Re: Switching to XML

From

Peter Eisentraut

Date:

09 December 2006, 06:59:24

Joshua D. Drake wrote:
> Further, here is a real world problem that our toolset creates...
>
> I take 5 minutes, change the stylesheet for SGML. I want to see what
> my changes will look like... 3 days later, I will know.
>
> That is stupid. If it was XML, it would be 30 minutes. That is a
> workable timeframe.

I feel like I'm talkling to a wall here, but I'm going to say this one
last time:

Any processing toolchain that you can use with DocBook XML you can use
with DocBook SGML and vice versa.

If you know of a way to create a PDF off DocBook XML in 30 minutes,
please tell us.

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

Tom Lane

Date:

09 December 2006, 14:19:00

Josh Berkus <josh@agliodbs.com> writes:
> As I said then, this is absolutely untrue.  OpenOffice.org, for example,
> works with DocBook XML but not SGML.  There are also a plethora of XML
> editing and publishing tools which can been used for Docbook XML which
> are not available for SGML.  A simple look at this page:
> http://wiki.docbook.org/topic/DocBookAuthoringTools
> .... shows that there are more than twice as many authoring tools which
> support only XML as support SGML -- and that most of the tools which
> support SGML are out-of-maintenance.

This is confusing authoring tools (ie, stuff for more or less WYSIWYG
editing of the document source) with output generation tools.

As for authoring tools, show me one that produces SGML or XML that's
reasonably readable, and I might worry about allowing people to use it.
Most of the ones I've seen would render the doc sources unreadable for
anyone not using an authoring tool (possibly even the very same
authoring tool).  We are not going to move in that direction
because it would piss off the people who do the bulk of the work now.

            regards, tom lane

Re: Switching to XML

From

Tom Lane

Date:

09 December 2006, 14:32:43

>> The french team also uses Docbook XML and they can generate a PDF in 30
>> minutes... it takes us DAYS because of the SGML.

Has anyone looked into actually fixing the performance problem?

oprofile results for jade trying to produce tex output from our docs are
suggestive of a localized performance issue:

samples  %        symbol name
2082917  98.5829  OpenJade_DSSSL::PairNodeListObj::nodeListFirst(OpenJade_DSSSL::EvalContext&,
OpenJade_DSSSL::Interpreter&)
9713      0.4597  OpenJade_DSSSL::PairNodeListObj::nodeListRest(OpenJade_DSSSL::EvalContext&,
OpenJade_DSSSL::Interpreter&)
5019      0.2375  OpenJade_DSSSL::AppendSosofoObj::traceSubObjects(Collector&) const
3571      0.1690  Collector::collect()
1938      0.0917  OpenJade_DSSSL::FlowObj::traceSubObjects(Collector&) const

I attached to the process with gdb and found it nested four thousand (!)
call levels deep in OpenJade_DSSSL::PairNodeListObj::nodeListFirst and
OpenJade_DSSSL::PairNodeListObj::nodeListRest calls.  Meanwhile, looking
at the output-so-far-emitted makes me think it was working on a fairly
large <programlisting> example.  The last little bit is:

{asis}\def\InputWhitespaceTreatment%
{preserve}}\Seq%
{}\Seq%
{}~~~~\endSeq{}/*
\Seq%
{}~~~~\endSeq{}~*~testlibpq2.c
\Seq%
{}~~~~\endSeq{}~*~~~~~~Test~of~the~asynchronous~notification~interface
\Seq%
{}~~~~\endSeq{}~*
\Seq%
{}~~~~\endSeq{}~*~Start~this~program,~then~from~psql~in~another~window~do
\Seq%
{}~~~~\endSeq{}~*~~~NOTIFY~TBL2;
\Seq%
{}~~~~\endSeq{}~*~Repeat~four~times~to~get~this~program~to~exit.
\Seq%
{}~~~~\endSeq{}~*
\Seq%
{}~~~~\endSeq{}~*~Or,~if~you~want~to~get~fancy,~try~this:
\Seq%
{}~~~~\endSeq{}~*~populate~a~database~with~th

What it looks like to me is that there is some bit of stupidity that is
producing a deeply nested list representation of a <programlisting>
section, probably one list level per character in the text, making the
runtime O(N^2) or worse in the length of the <programlisting>.  (The
particular example it's stuck on here is about 10K characters.)

Since jade does not go into this kind of spiral when producing html
output from the same sources, I suggest that it's not jade's fault,
but rather crummy coding in the sgml-to-tex conversion scripts it's
using.  I don't know enough about those to know where to look, but maybe
someone here does?

            regards, tom lane

Re: Switching to XML

From

"Joshua D. Drake"

Date:

09 December 2006, 15:59:56

>
> > Hell, the only reason I have even bothered to contribute what little I
> > have to the docs is because I wrote a book in SGML, thus it is a no
> > brainer to me. Others aren't so tortured as to have done the same.
>
> I would hate to hand edit the stuff generated by something like
> OpenOffice.org.

As would I.

> >
> > The french team also uses Docbook XML and they can generate a PDF in 30
> > minutes... it takes us DAYS because of the SGML.
>
> Here I agree - 30 minutes vs days is a good reason - as long as editing
> with an ascii editor is not taken away.

I would never expect to take editing away.. I personally would likely
just use bluefish, not Openoffice.

Sincerely,

Joshua D. Drake


>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

"Joshua D. Drake"

Date:

09 December 2006, 17:03:39

> If you know of a way to create a PDF off DocBook XML in 30 minutes,
> please tell us.

See the post on this thread from :

Guillaume Lelarge

10 minutes to generate a pdf.

Sincerely,

Joshua D. Drake



>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

Peter Eisentraut

Date:

09 December 2006, 19:33:02

Joshua D. Drake wrote:
> > If you know of a way to create a PDF off DocBook XML in 30 minutes,
> > please tell us.
>
> See the post on this thread from :
>
> Guillaume Lelarge
>
> 10 minutes to generate a pdf.

Right.  So why don't we just use that?

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

"Joshua D. Drake"

Date:

09 December 2006, 21:50:12

On Sun, 2006-12-10 at 00:32 +0100, Peter Eisentraut wrote:
> Joshua D. Drake wrote:
> > > If you know of a way to create a PDF off DocBook XML in 30 minutes,
> > > please tell us.
> >
> > See the post on this thread from :
> >
> > Guillaume Lelarge
> >
> > 10 minutes to generate a pdf.
>
> Right.  So why don't we just use that?

How about the fact that we are putting undue burden on our fellow
regional projects?

How about it isn't in english?

Joshua D. Drake


>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

Peter Eisentraut

Date:

09 December 2006, 22:13:55

Joshua D. Drake wrote:
> On Sun, 2006-12-10 at 00:32 +0100, Peter Eisentraut wrote:
> > Joshua D. Drake wrote:
> > > > If you know of a way to create a PDF off DocBook XML in 30
> > > > minutes, please tell us.
> > >
> > > See the post on this thread from :
> > >
> > > Guillaume Lelarge
> > >
> > > 10 minutes to generate a pdf.
> >
> > Right.  So why don't we just use that?
>
> How about the fact that we are putting undue burden on our fellow
> regional projects?
>
> How about it isn't in english?

Well, I didn't actually find a post titled "10 minutes to generate a
pdf." anywhere, so I assumed you referred to another one of his posts
which described how they used FOP to build PDF documentation.  That
method does not burden anyone unduly and it is in English.  But maybe
we're not talking about the same thing.

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

Peter Eisentraut

Date:

09 December 2006, 22:15:51

Tom Lane wrote:
> Since jade does not go into this kind of spiral when producing html
> output from the same sources, I suggest that it's not jade's fault,
> but rather crummy coding in the sgml-to-tex conversion scripts it's
> using.

Right.  I fixed that, so now it takes about 15 minutes to build the
whole thing.

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

Mark Kirkwood

Date:

09 December 2006, 22:54:51

Tom Lane wrote:

>
> As for authoring tools, show me one that produces SGML or XML that's
> reasonably readable, and I might worry about allowing people to use it.
> Most of the ones I've seen would render the doc sources unreadable for
> anyone not using an authoring tool (possibly even the very same
> authoring tool).  We are not going to move in that direction
> because it would piss off the people who do the bulk of the work now.
>

+1

With regards to the current setup discouraging contribution - I've done
one (or maybe two) doc submissions and ISTR that it was not very
difficult - even as a complete SGML newbie - to get up and running, make
my changes and build the (HTML) docs.

Cheers

Mark

Re: Switching to XML

From

Tom Lane

Date:

09 December 2006, 23:19:12

Peter Eisentraut <peter_e@gmx.net> writes:
> Tom Lane wrote:
>> Since jade does not go into this kind of spiral when producing html
>> output from the same sources, I suggest that it's not jade's fault,
>> but rather crummy coding in the sgml-to-tex conversion scripts it's
>> using.

> Right.  I fixed that, so now it takes about 15 minutes to build the
> whole thing.

Actually, I just finished fixing what seems to be the underlying problem
in jade: it's got a spectacularly bad implementation of linked lists.
In PG-code terms, if the list is built by successive lappends(), then
both lfirst() and lnext() take O(N) time for an N-element list, thus
iterating through the list in the usual way is O(N^2) ... and not even
with a reasonably small multiplier, because it stresses the hell out of
memory allocation and garbage collection while it's at it.  (Which may
well mean that it's really more like O(N^3), since the garbage collector
will iterate over every allocated object every so often.)

With the patch it takes me about 5 minutes to do the jade step of the
PDF build, using this morning's SGML sources.  (I don't know how to set
the TeX configuration to get the pdfjadetex steps to go through, so I
dunno about total time.)

However, I have no idea what it'll take to get this patch propagated
into the copies people actually use, so your fix sounds good for the
short term.

            regards, tom lane

diff -cr openjade-1.3.2.orig/style/ELObj.cxx openjade-1.3.2/style/ELObj.cxx
*** openjade-1.3.2.orig/style/ELObj.cxx    Fri Jan 11 10:48:38 2002
--- openjade-1.3.2/style/ELObj.cxx    Sat Dec  9 21:58:36 2006
***************
*** 1044,1049 ****
--- 1044,1054 ----
    return new (interp) ReverseNodeListObj(this);
  }

+ NodeListObj *NodeListObj::nodeListAppend(NodeListObj *newTail, Interpreter &interp)
+ {
+   return new (interp) HeadNodeListObj(this, newTail, interp);
+ }
+
  long NodeListObj::nodeListLength(EvalContext &context, Interpreter &interp)
  {
    NodeListObj *nl = this;
***************
*** 1217,1236 ****

  NodeListObj *PairNodeListObj::nodeListRest(EvalContext &context, Interpreter &interp)
  {
!   if (!head_ || !head_->nodeListFirst(context, interp))
      return tail_->nodeListRest(context, interp);
    NodeListObj *tem = head_->nodeListRest(context, interp);
    ELObjDynamicRoot protect(interp, tem);
!   return new (interp) PairNodeListObj(tem, tail_);
  }

  NodeListObj *PairNodeListObj::nodeListChunkRest(EvalContext &context, Interpreter &interp, bool &chunk)
  {
!   if (!head_ || !head_->nodeListFirst(context, interp))
      return tail_->nodeListChunkRest(context, interp, chunk);
    NodeListObj *tem = head_->nodeListChunkRest(context, interp, chunk);
    ELObjDynamicRoot protect(interp, tem);
!   return new (interp) PairNodeListObj(tem, tail_);
  }

  void PairNodeListObj::traceSubObjects(Collector &c) const
--- 1222,1260 ----

  NodeListObj *PairNodeListObj::nodeListRest(EvalContext &context, Interpreter &interp)
  {
!   if (!head_)
!     return tail_->nodeListRest(context, interp);
!   if (!head_->nodeListFirst(context, interp)) {
!     head_ = 0;
      return tail_->nodeListRest(context, interp);
+   }
    NodeListObj *tem = head_->nodeListRest(context, interp);
    ELObjDynamicRoot protect(interp, tem);
!   if (!tem || !tem->nodeListFirst(context, interp))
!     return tail_;
!   return tem->nodeListAppend(tail_, interp);
  }

  NodeListObj *PairNodeListObj::nodeListChunkRest(EvalContext &context, Interpreter &interp, bool &chunk)
  {
!   if (!head_)
!     return tail_->nodeListChunkRest(context, interp, chunk);
!   if (!head_->nodeListFirst(context, interp)) {
!     head_ = 0;
      return tail_->nodeListChunkRest(context, interp, chunk);
+   }
    NodeListObj *tem = head_->nodeListChunkRest(context, interp, chunk);
    ELObjDynamicRoot protect(interp, tem);
!   if (!tem || !tem->nodeListFirst(context, interp))
!     return tail_;
!   return tem->nodeListAppend(tail_, interp);
! }
!
! NodeListObj *PairNodeListObj::nodeListAppend(NodeListObj *newTail, Interpreter &interp)
! {
!   NodeListObj *tem = new (interp) PairNodeListObj(tail_, newTail);
!   ELObjDynamicRoot protect(interp, tem);
!   return new (interp) PairNodeListObj(head_, tem);
  }

  void PairNodeListObj::traceSubObjects(Collector &c) const
***************
*** 1239,1244 ****
--- 1263,1384 ----
    c.trace(tail_);
  }

+ CellNodeListObj::CellNodeListObj(NodeListObj *head)
+ : head_(head), next_(0)
+ {
+   hasSubObjects_ = 1;
+ }
+
+ NodePtr CellNodeListObj::nodeListFirst(EvalContext &context, Interpreter &interp)
+ {
+   if (head_) {
+     NodePtr nd(head_->nodeListFirst(context, interp));
+     if (nd)
+       return nd;
+     head_ = 0;
+   }
+   if (next_)
+     return next_->nodeListFirst(context, interp);
+   return NodePtr();
+ }
+
+ NodeListObj *CellNodeListObj::nodeListRest(EvalContext &context, Interpreter &interp)
+ {
+   if (!head_) {
+     if (next_)
+       return next_->nodeListRest(context, interp);
+     return interp.makeEmptyNodeList();
+   }
+   if (!head_->nodeListFirst(context, interp)) {
+     head_ = 0;
+     if (next_)
+       return next_->nodeListRest(context, interp);
+     return interp.makeEmptyNodeList();
+   }
+   NodeListObj *tem = head_->nodeListRest(context, interp);
+   ELObjDynamicRoot protect(interp, tem);
+   if (!tem || !tem->nodeListFirst(context, interp)) {
+     if (next_)
+       return next_;
+     return interp.makeEmptyNodeList();
+   }
+   if (next_)
+     return tem->nodeListAppend(next_, interp);
+   return tem;
+ }
+
+ NodeListObj *CellNodeListObj::nodeListChunkRest(EvalContext &context, Interpreter &interp, bool &chunk)
+ {
+   if (!head_) {
+     if (next_)
+       return next_->nodeListChunkRest(context, interp, chunk);
+     chunk = 0;
+     return interp.makeEmptyNodeList();
+   }
+   if (!head_->nodeListFirst(context, interp)) {
+     head_ = 0;
+     if (next_)
+       return next_->nodeListChunkRest(context, interp, chunk);
+     chunk = 0;
+     return interp.makeEmptyNodeList();
+   }
+   NodeListObj *tem = head_->nodeListChunkRest(context, interp, chunk);
+   ELObjDynamicRoot protect(interp, tem);
+   if (!tem || !tem->nodeListFirst(context, interp)) {
+     if (next_)
+       return next_;
+     return interp.makeEmptyNodeList();
+   }
+   if (next_)
+     return tem->nodeListAppend(next_, interp);
+   return tem;
+ }
+
+ void CellNodeListObj::traceSubObjects(Collector &c) const
+ {
+   c.trace(head_);
+   c.trace(next_);
+ }
+
+ HeadNodeListObj::HeadNodeListObj(NodeListObj *first, NodeListObj *second, Interpreter &interp)
+ : first_(0), last_(0)
+ {
+   hasSubObjects_ = 1;
+   ELObjDynamicRoot protect(interp, this);
+   first_ = new (interp) CellNodeListObj(first);
+   last_ = new (interp) CellNodeListObj(second);
+   first_->next_ = last_;
+ }
+
+ NodePtr HeadNodeListObj::nodeListFirst(EvalContext &context, Interpreter &interp)
+ {
+   return first_->nodeListFirst(context, interp);
+ }
+
+ NodeListObj *HeadNodeListObj::nodeListRest(EvalContext &context, Interpreter &interp)
+ {
+   return first_->nodeListRest(context, interp);
+ }
+
+ NodeListObj *HeadNodeListObj::nodeListChunkRest(EvalContext &context, Interpreter &interp, bool &chunk)
+ {
+   return first_->nodeListChunkRest(context, interp, chunk);
+ }
+
+ NodeListObj *HeadNodeListObj::nodeListAppend(NodeListObj *newTail, Interpreter &interp)
+ {
+   CellNodeListObj *tem = new (interp) CellNodeListObj(newTail);
+   last_->next_ = tem;
+   last_ = tem;
+   return this;
+ }
+
+ void HeadNodeListObj::traceSubObjects(Collector &c) const
+ {
+   c.trace(first_);
+   // first_ will recurse to the rest
+ }
+
  ReverseNodeListObj::ReverseNodeListObj(NodeListObj *nl)
  : nl_(nl), reversed_(0)
  {
diff -cr openjade-1.3.2.orig/style/ELObj.h openjade-1.3.2/style/ELObj.h
*** openjade-1.3.2.orig/style/ELObj.h    Fri Jan 11 10:48:38 2002
--- openjade-1.3.2/style/ELObj.h    Sat Dec  9 20:14:23 2006
***************
*** 426,431 ****
--- 426,432 ----
    virtual NodeListObj *nodeListChunkRest(EvalContext &, Interpreter &, bool &chunk);
    virtual NodePtr nodeListRef(long, EvalContext &, Interpreter &);
    virtual NodeListObj *nodeListReverse(EvalContext &, Interpreter &);
+   virtual NodeListObj *nodeListAppend(NodeListObj *, Interpreter &);
    virtual long nodeListLength(EvalContext &, Interpreter &);
    virtual bool suppressError();
  };
***************
*** 493,504 ****
--- 494,533 ----
    NodePtr nodeListFirst(EvalContext &, Interpreter &);
    NodeListObj *nodeListRest(EvalContext &, Interpreter &);
    NodeListObj *nodeListChunkRest(EvalContext &, Interpreter &, bool &);
+   NodeListObj *nodeListAppend(NodeListObj *, Interpreter &);
    void traceSubObjects(Collector &) const;
  private:
    NodeListObj *head_; // may be null
    NodeListObj *tail_;
  };

+ class CellNodeListObj : public NodeListObj {
+ public:
+   CellNodeListObj(NodeListObj *);
+   NodePtr nodeListFirst(EvalContext &, Interpreter &);
+   NodeListObj *nodeListRest(EvalContext &, Interpreter &);
+   NodeListObj *nodeListChunkRest(EvalContext &, Interpreter &, bool &);
+   void traceSubObjects(Collector &) const;
+ private:
+   NodeListObj *head_; // may be null
+   CellNodeListObj *next_; // may be null
+   friend class HeadNodeListObj; // needs access to next_ link
+ };
+
+ class HeadNodeListObj : public NodeListObj {
+ public:
+   HeadNodeListObj(NodeListObj *, NodeListObj *, Interpreter &);
+   NodePtr nodeListFirst(EvalContext &, Interpreter &);
+   NodeListObj *nodeListRest(EvalContext &, Interpreter &);
+   NodeListObj *nodeListChunkRest(EvalContext &, Interpreter &, bool &);
+   NodeListObj *nodeListAppend(NodeListObj *, Interpreter &);
+   void traceSubObjects(Collector &) const;
+ private:
+   // there are always at least two cells in the list, hence first_ != last_
+   CellNodeListObj *first_;
+   CellNodeListObj *last_;
+ };
+
  inline
  bool ELObj::equal(ELObj &obj1, ELObj &obj2)
  {
diff -cr openjade-1.3.2.orig/style/primitive.cxx openjade-1.3.2/style/primitive.cxx
*** openjade-1.3.2.orig/style/primitive.cxx    Wed Mar 13 07:21:22 2002
--- openjade-1.3.2/style/primitive.cxx    Sat Dec  9 21:48:01 2006
***************
*** 3846,3870 ****

  DEFPRIMITIVE(NodeList, argc, argv, context, interp, loc)
  {
!   if (argc == 0)
      return interp.makeEmptyNodeList();
!   int i = argc - 1;
!   NodeListObj *nl = argv[i]->asNodeList();
    if (!nl)
      return argError(interp, loc,
!             InterpreterMessages::notANodeList, i, argv[i]);
!   if (i > 0) {
!     ELObjDynamicRoot protect(interp, nl);
!     for (;;) {
!       i--;
        NodeListObj *tem = argv[i]->asNodeList();
        if (!tem)
!         return argError(interp, loc,
!                     InterpreterMessages::notANodeList, i, argv[i]);
!       nl = new (interp) PairNodeListObj(tem, nl);
!       if (i == 0)
!     break;
!       protect = nl;
      }
    }
    return nl;
--- 3846,3868 ----

  DEFPRIMITIVE(NodeList, argc, argv, context, interp, loc)
  {
!   if (argc <= 0)
      return interp.makeEmptyNodeList();
!   NodeListObj *nl = argv[0]->asNodeList();
    if (!nl)
      return argError(interp, loc,
!             InterpreterMessages::notANodeList, 0, argv[0]);
!   if (argc > 1) {
!     ELObjDynamicRoot protect(interp);
!     ELObjDynamicRoot protect2(interp);
!     for (int i = 1; i < argc; i++) {
!       protect = nl;
        NodeListObj *tem = argv[i]->asNodeList();
        if (!tem)
!     return argError(interp, loc,
!             InterpreterMessages::notANodeList, i, argv[i]);
!       protect2 = tem;
!       nl = nl->nodeListAppend(tem, interp);
      }
    }
    return nl;

Re: Switching to XML

From

Tom Lane

Date:

10 December 2006, 00:53:13

I wrote:
> With the patch it takes me about 5 minutes to do the jade step of the
> PDF build, using this morning's SGML sources.

Ooops, that was with an -O0 build for debugging.  After rebuilding with
the normal -O2 optimization, it takes about 70 seconds, just about
exactly the same as for the HTML-output case.

            regards, tom lane

Re: Switching to XML

From

"Joshua D. Drake"

Date:

10 December 2006, 01:11:41

On Sun, 2006-12-10 at 03:13 +0100, Peter Eisentraut wrote:
> Joshua D. Drake wrote:
> > On Sun, 2006-12-10 at 00:32 +0100, Peter Eisentraut wrote:
> > > Joshua D. Drake wrote:
> > > > > If you know of a way to create a PDF off DocBook XML in 30
> > > > > minutes, please tell us.
> > > >
> > > > See the post on this thread from :
> > > >
> > > > Guillaume Lelarge
> > > >
> > > > 10 minutes to generate a pdf.
> > >
> > > Right.  So why don't we just use that?
> >
> > How about the fact that we are putting undue burden on our fellow
> > regional projects?
> >
> > How about it isn't in english?
>
> Well, I didn't actually find a post titled "10 minutes to generate a
> pdf." anywhere, so I assumed you referred to another one of his posts
> which described how they used FOP to build PDF documentation.  That
> method does not burden anyone unduly and it is in English.  But maybe
> we're not talking about the same thing.

I meant, *their* xml file is not english. So it doesn't do any good.
Remember this whole current thread started because we are trying to
create print ready documentation, which we currently do not have.

Sincerely,

Joshua D. Drake


>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

Tom Lane

Date:

10 December 2006, 01:36:46

I wrote:
> With the patch it takes me about 5 minutes to do the jade step of the
> PDF build, using this morning's SGML sources.  (I don't know how to set
> the TeX configuration to get the pdfjadetex steps to go through, so I
> dunno about total time.)

OK, I read Peter's notes about suitable TeX configuration, and with
that plus suitably patched openjade I get this from a standing start:

$ time make postgres.pdf
... much output ...
Output written on postgres.pdf (1670 pages, 13561269 bytes).
Transcript written on postgres.log.

real    8m23.949s
user    8m14.263s
sys     0m9.241s
$

But my goodness it produces a lot of TeX warnings :-(.  I can understand
that no one was gonna look into those with a three-day turnaround time,
but maybe now we can think about cleaning them up?  Personally I
haven't used TeX/LaTeX in anger in about ten years, but most of what
scrolled past looked easily fixable, and at least some of it would
translate to visible ugliness in the output.

            regards, tom lane

Re: Switching to XML

From

"Joshua D. Drake"

Date:

10 December 2006, 01:36:51

On Sat, 2006-12-09 at 09:21 +0100, Guillaume Lelarge wrote:
> Joshua D. Drake a écrit :
> > On Fri, 2006-12-08 at 21:58 +0100, Peter Eisentraut wrote:
> >> Joshua D. Drake wrote:
> >>> You can create, edit, convert, save, and open docbook xml in
> >>> OpenOfice.org.
> >> Sure, there are more editing options with DocBook XML.  No one disputes
> >> that.  But the question at hand was about processing the DocBook.
> >
> > Yes which is generated from our use of SGML which is the core of this
> > problem and the core of the question as a whole.
> >
> > SGML is making working with the documentation *harder*.
> >
>
> +1

I believe this point is being overlooked is people's willingness to code
a solution ;).

Guillaume could you please tell us *why* you ported the SGML to XML
before doing your translation and processing? Perhaps that will carry
more weight to this discussion then my hand waving.

Sincerely,

Joshua D. Drake



--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

"Joshua D. Drake"

Date:

10 December 2006, 01:37:47

On Fri, 2006-12-08 at 22:49 -0300, Mario wrote:
> On 08/12/06, Joshua D. Drake <jd@commandprompt.com> wrote:
> > There is a long standing support within the community to move to XML
> > including:
> >
> > Josh Berkus
> > Josh Drake
> > Robert Treat
> > Andrew Dunslane
> > David Blewett
> > David Fetter
> > Devrim Gunduz
> > Darcy Buskermolen
> >
> > And that is just from #postgresql
> >
>
>   And now in Chile, I wrote an app to translate postgres documentation
> (l10n.postgresql.cl). I took SGML sources, transform to XML and to POT
> files
>
>   The results are good! but is it not the best. To move from SGML to
> XML is a hard work.

I believe this point is being overlooked is people's willingness to code
a solution ;).

Mario could you please tell us *why* you ported the SGML to XML
before doing your translation and processing? Perhaps that will carry
more weight to this discussion then my hand waving.

Sincerely,

Joshua D. Drake



>
> >
>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

David Fetter

Date:

10 December 2006, 01:40:49

On Sat, Dec 09, 2006 at 09:21:12AM +0100, Guillaume Lelarge wrote:
> Joshua D. Drake a écrit :
> > On Fri, 2006-12-08 at 21:58 +0100, Peter Eisentraut wrote:
> >> Joshua D. Drake wrote:
> >>> You can create, edit, convert, save, and open docbook xml in
> >>> OpenOfice.org.
> >> Sure, there are more editing options with DocBook XML.  No one disputes
> >> that.  But the question at hand was about processing the DocBook.
> >
> > Yes which is generated from our use of SGML which is the core of this
> > problem and the core of the question as a whole.
> >
> > SGML is making working with the documentation *harder*.
>
> +1

Thanks to Peter and Tom for making the PDF build faster, but the more
general problem, which is that the SGML does not actually do the same
things that XML does, no matter how many times Peter so asserts,
remains.

In addition to the long-standing problem that there is no way to edit
the SGML docs with any known GUI tool, we have a particular use case,
namely producing a multi-volume set suitable for printing as books.

> > We have people that *DO NOT* contribute because of this SGML
> > requirement.  They have what I consider extremely valid reasons,
> > namely it is dumb to require a writer to use emacs or write tags
> > explictly.

Peter, if you have a working example of a GUI tool that can be used
with the SGML source in its current form, the burden of proof is on
you to demonstrate it.  Another flat assertion from you of some kind
of mathematical equivalence between SGML and XML will *not* do the
trick.

> > Hell, the only reason I have even bothered to contribute what
> > little I have to the docs is because I wrote a book in SGML, thus
> > it is a no brainer to me. Others aren't so tortured as to have
> > done the same.
>
> I'm not so sure it will help you find more contributors. I'm part of
> a project which aims to translate HOWTO from TLDP. They don't find
> contributors and we too have really hard times to find contributors
> despite the fact we try to only use DocBook XML (TLDP use DocBook
> SGML, DocBook XML and LinuxDoc formats).
>
> Did you try to use OpenOffice.org with DocBook ? I tried once and it
> was a complete disaster. But it was a long time ago. I will try
> again this week-end.

I gave it a try post-patches, and it's still a disaster 3284 pages of
un-rendered XML.

> > There is a long standing support within the community to move to XML
> > including:
> >
> > Josh Berkus
> > Josh Drake
> > Robert Treat
> > Andrew Dunslane
> > David Blewett
> > David Fetter
> > Devrim Gunduz
> > Darcy Buskermolen
> >
> > And that is just from #postgresql
> >
> > The french team also uses Docbook XML and they can generate a PDF
> > in 30 minutes... it takes us DAYS because of the SGML.
>
> In fact, we need 15 minutes to build HTML files and 10 minutes to
> build PDF file. To be completely honest, I don't seem to be able to
> build PDF file for 8.2.0 release. I must have made a mistake (or
> perhaps a lot of :) ).

It'll be nice to have the document building cycle shorter, but the
point here is that we need to enter the 21st century.  That Tom found
a need to fork a document tool, i.e. take ownership of a whole large
piece of software, that being what forking means, is a neon sign that
means, "we're stuck with broken tools."

Cheers,
D
--
David Fetter <david@fetter.org> http://fetter.org/
phone: +1 415 235 3778        AIM: dfetter666
                              Skype: davidfetter

Remember to vote!

Re: Switching to XML

From

"Joshua D. Drake"

Date:

10 December 2006, 01:54:59

> In addition to the long-standing problem that there is no way to edit
> the SGML docs with any known GUI tool, we have a particular use case,
> namely producing a multi-volume set suitable for printing as books.

Well yes and no. There isn't really any good WYSIWYG tools, but there
are plenty of graphical validating editors for Docbook XML. I actually
don't agree with JoshBs argument on this one as most major FOSS (Gnome,
KDE, Linux) have moved to Docbook XML without the need of graphical
editors.

I do think the tools in general can be a little daunting and perhaps a
pgfoundry project that had a perl script that would download and
configure a self contained jail of tools for working with the PostgreSQL
docs would be good.

>
> > > We have people that *DO NOT* contribute because of this SGML
> > > requirement.  They have what I consider extremely valid reasons,
> > > namely it is dumb to require a writer to use emacs or write tags
> > > explictly.
>
> Peter, if you have a working example of a GUI tool that can be used
> with the SGML source in its current form, the burden of proof is on
> you to demonstrate it.  Another flat assertion from you of some kind
> of mathematical equivalence between SGML and XML will *not* do the
> trick.

I have to agree here, the reality is that the tools that are designed to
work with XML work with XML. Yes you *can* in theory make them work with
SGML, and some may work directly out of the box (Emacs would for editing
for example), not all of them would.


> > Did you try to use OpenOffice.org with DocBook ? I tried once and it
> > was a complete disaster. But it was a long time ago. I will try
> > again this week-end.
>
> I gave it a try post-patches, and it's still a disaster 3284 pages of
> un-rendered XML.

Well keep in mind that all XML is unrendered :). OpenOffice will do
things like this:

<emphasis>Hi</emphasis>
<para></para>
<para></para>
Bye

Which equates to:

Hi


Bye


We don't do that in the PostgreSQL sgml (thank goodness) so it comes out
really narly :). However it wouldn't take much to clean up small
sections for people.

> > In fact, we need 15 minutes to build HTML files and 10 minutes to
> > build PDF file. To be completely honest, I don't seem to be able to
> > build PDF file for 8.2.0 release. I must have made a mistake (or
> > perhaps a lot of :) ).
>
> It'll be nice to have the document building cycle shorter, but the
> point here is that we need to enter the 21st century.  That Tom found
> a need to fork a document tool, i.e. take ownership of a whole large
> piece of software, that being what forking means, is a neon sign that
> means, "we're stuck with broken tools."

That is a big concern for me as well. I don't think Tom expects us to
maintain the code, and we can certainly try to get the patch submitted
to the open jade project which would definitely help. However, the
movement of the open jade projects appears basically dead.

Moving to XML based tools does give us a more modern and developed
toolset.

Sincerely,

Joshua D. Drake




>
> Cheers,
> D
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

Tom Lane

Date:

10 December 2006, 02:03:02

David Fetter <david@fetter.org> writes:
> In addition to the long-standing problem that there is no way to edit
> the SGML docs with any known GUI tool,

The point not in evidence is that there exists a GUI tool we'd accept
for editing XML-format docs.  Can you point to some that don't mess up
XML source code to the point of being unreadable?

> ... That Tom found
> a need to fork a document tool, i.e. take ownership of a whole large
> piece of software, that being what forking means, is a neon sign that
> means, "we're stuck with broken tools."

Um, I spend all day every day on making sometimes-comparable
improvements in Postgres.  Will you abandon Postgres as soon as
someone points out a(nother) serious performance bug in it?

As for "fork", I have no intention of forking anything --- that patch
is already submitted upstream.

            regards, tom lane

Re: Switching to XML

From

Peter Eisentraut

Date:

10 December 2006, 04:43:50

Joshua D. Drake wrote:
> I do think the tools in general can be a little daunting and perhaps
> a pgfoundry project that had a perl script that would download and
> configure a self contained jail of tools for working with the
> PostgreSQL docs would be good.

Do you find any deficiencies in the setup instructions we have in the
documentation?

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

Peter Eisentraut

Date:

10 December 2006, 05:08:17

David Fetter wrote:
> Peter, if you have a working example of a GUI tool that can be used
> with the SGML source in its current form, the burden of proof is on
> you to demonstrate it.  Another flat assertion from you of some kind
> of mathematical equivalence between SGML and XML will *not* do the
> trick.

Well, most of you are obviously led into twisting every single thread on
this mailing list related to some sort of tool trouble into a thread
that SGML is completely unusable for unrelated reasons such as that
there are no editing tools or that it takes too long to build.  I have
no interest in these sorts of uninformed discussions.

The question here was by David Blewett about creating multiple indexes.
Has anyone bothered to look into that?  My experimentation, by the way,
shows that it works exactly as described in the cited document.

> It'll be nice to have the document building cycle shorter, but the
> point here is that we need to enter the 21st century.  That Tom found
> a need to fork a document tool, i.e. take ownership of a whole large
> piece of software, that being what forking means, is a neon sign that
> means, "we're stuck with broken tools."

And which nonbroken tools for creating print output do you have up your
sleeve in the new XML world?  And what is so wonderous about these
tools that we can't use them today?

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

Guillaume Lelarge

Date:

10 December 2006, 06:32:25

Joshua D. Drake a écrit :
> On Sat, 2006-12-09 at 09:21 +0100, Guillaume Lelarge wrote:
>> Joshua D. Drake a écrit :
>>> On Fri, 2006-12-08 at 21:58 +0100, Peter Eisentraut wrote:
>>>> Joshua D. Drake wrote:
>>>>> You can create, edit, convert, save, and open docbook xml in
>>>>> OpenOfice.org.
>>>> Sure, there are more editing options with DocBook XML.  No one disputes
>>>> that.  But the question at hand was about processing the DocBook.
>>> Yes which is generated from our use of SGML which is the core of this
>>> problem and the core of the question as a whole.
>>>
>>> SGML is making working with the documentation *harder*.
>>>
>> +1
>
> I believe this point is being overlooked is people's willingness to code
> a solution ;).
>
> Guillaume could you please tell us *why* you ported the SGML to XML
> before doing your translation and processing? Perhaps that will carry
> more weight to this discussion then my hand waving.
>

I use DocBook XML a lot more than SGML on some translation's projects.

I switch to DocBook XML because I find it easier to customize
stylesheets. Take a look at DSSSL and take a look at XSLT : DSSL just
hurts my eyes, I found XSLT more simple to understand *and* customize.
There's been a lot of posts on this subject on some of the Linux From
Scratch project. And they've done a pretty good job with HTML and PDF
builts. So I took their stylesheets, Makefiles. I tried to understand
what they do, customize a little more to get what the french team have now.

So first argument, the one that makes me switch, is ease of
customization. Second one is also really important to me. XSLT
stylesheets and xsltproc are under active development. You can take a
look here :
  http://sourceforge.net/project/showfiles.php?group_id=21935#files
DSSSL 1.79 has been packaged one 2004-11-03, XSL 1.71.1 has been
released on 2006-10-19. And don't get me started on tools. I've seen
recent releases of xsltproc, none of jade, openjade or any of those tools.

Last point : just look at other free projects. Take a look at Gnome and
KDE for example. They all use DocBook XML with customized stylesheets. I
don't mean that because lots of projects use DocBook XML, they all are
right and us wrong. I don't even mean that we should switch to follow
new buzz. I just mean people use it, publish information on what they do
and how they do it. Can someone find more informations on how to build
documentation with SGML than he can with XML ?

I don't want to say we can't do all these with DocBook SGML. Tom fixed
some stuff on openjade to make it process quickier. Great. We can also
customize DSSSL stylesheets. We can, we can... yes we can do a lot of
things with SGML and DSSSL. When I say "We", I should say "You" because
I don't want to get this burden. I mean, fixing a tool is great... but I
really prefer to see Tom working on PostgreSQL new astounding features
than on an old unmaintained and unused tool. I can't code as Tom. But I
can help on documentation with tools, but tools that are maintained. You
want to put more documentation on the manual but don't want to write it
with DocBook XML ? I'm OK with it. Just give me a plain ASCII file or an
openoffice file, I will handle the tagging, check correctness of the XML
file, build the documentation.

I say "I" but perhaps "we" would be more appropriate.

Perhaps it is time to create a real documentation team. More people will
be able to get involved with this great project.

--
Guillaume.
<!-- http://abs.traduc.org/
     http://lfs.traduc.org/
     http://docs.postgresqlfr.org/ -->

Re: Switching to XML

From

Alvaro Herrera

Date:

10 December 2006, 08:20:40

Joshua D. Drake wrote:
> On Fri, 2006-12-08 at 22:49 -0300, Mario wrote:

> >   And now in Chile, I wrote an app to translate postgres documentation
> > (l10n.postgresql.cl). I took SGML sources, transform to XML and to POT
> > files
> >
> >   The results are good! but is it not the best. To move from SGML to
> > XML is a hard work.
>
> I believe this point is being overlooked is people's willingness to code
> a solution ;).
>
> Mario could you please tell us *why* you ported the SGML to XML
> before doing your translation and processing? Perhaps that will carry
> more weight to this discussion then my hand waving.

The reason is that he uses xml2pot, which AFAIK only handles XML and not
SGML.

--
Alvaro Herrera                                http://www.CommandPrompt.com/
PostgreSQL Replication, Consulting, Custom Development, 24x7 support

Re: Switching to XML

From

David Blewett

Date:

10 December 2006, 10:41:47

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

Guillaume Lelarge wrote:
<snip>

> Second one is also really important to me. XSLT
> stylesheets and xsltproc are under active development. You can take a
> look here :
>   http://sourceforge.net/project/showfiles.php?group_id=21935#files
> DSSSL 1.79 has been packaged one 2004-11-03, XSL 1.71.1 has been
> released on 2006-10-19. And don't get me started on tools. I've seen
> recent releases of xsltproc, none of jade, openjade or any of those tools.

<snip>

>
> I don't want to say we can't do all these with DocBook SGML. Tom fixed
> some stuff on openjade to make it process quickier. Great. We can also
> customize DSSSL stylesheets. We can, we can... yes we can do a lot of
> things with SGML and DSSSL. When I say "We", I should say "You" because
> I don't want to get this burden. I mean, fixing a tool is great... but I
> really prefer to see Tom working on PostgreSQL new astounding features
> than on an old unmaintained and unused tool.

I think this is a good point. I've heard a saying in #postgresql on
IRC: "First rate database, second rate tools..." I think that is a
shame.

David
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFFfBx2Zmlc6wNjtLYRCOI4AJ9/v0+k3+zNDS/LjwPzYSM70ILBtwCeImbA
z8T/bF8aE/qOV1eUnHeFyCc=
=Hf45
-----END PGP SIGNATURE-----

Re: Switching to XML

From

David Blewett

Date:

10 December 2006, 10:44:55

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

Peter Eisentraut wrote:
> > The question here was by David Blewett about creating multiple
indexes.
> > Has anyone bothered to look into that?  My experimentation, by
the way,
> > shows that it works exactly as described in the cited document.

Peter:

Can you explain the process to me then? I couldn't figure out how to
 generate different HTML.index files, or how to split the one
generated into different volumeindex.sgml files. Can you enlighten me?

David

PS:
Sorry if this gets posted twice, I'm having issues this morning...
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFFfB01Zmlc6wNjtLYRCCeAAJ9DULf4miTCGljCNy37pm6YMaN5/ACgjELj
kplVuFLBstJeXFwM8N087u8=
=N3Zu
-----END PGP SIGNATURE-----

Re: Switching to XML

From

"Joshua D. Drake"

Date:

10 December 2006, 12:02:05

On Sun, 2006-12-10 at 09:43 +0100, Peter Eisentraut wrote:
> Joshua D. Drake wrote:
> > I do think the tools in general can be a little daunting and perhaps
> > a pgfoundry project that had a perl script that would download and
> > configure a self contained jail of tools for working with the
> > PostgreSQL docs would be good.
>
> Do you find any deficiencies in the setup instructions we have in the
> documentation?

Where are they in the documentation?

Sincerely,

Joshua D. Drake

>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

"Joshua D. Drake"

Date:

10 December 2006, 12:08:52

On Sun, 2006-12-10 at 10:08 +0100, Peter Eisentraut wrote:
> David Fetter wrote:
> > Peter, if you have a working example of a GUI tool that can be used
> > with the SGML source in its current form, the burden of proof is on
> > you to demonstrate it.  Another flat assertion from you of some kind
> > of mathematical equivalence between SGML and XML will *not* do the
> > trick.
>
> Well, most of you are obviously led into twisting every single thread on
> this mailing list related to some sort of tool trouble into a thread
> that SGML is completely unusable for unrelated reasons such as that
> there are no editing tools or that it takes too long to build.  I have
> no interest in these sorts of uninformed discussions.

Bah! That is certainly not what is happening here. No one has said that
they are completely unusable. Even I have said they are usable but that
they can be daunting.

The point of this discussion comes down to some very simple things.

1. Tools. SGML tools are not as actively developed as the XML ones. No
you can not use XML tools with SGML as effectively.

If I am wrong on #1 Peter, fine. Prove it.  You keep telling us we are
wrong but are unwilling to share where or how.

2. We have two regional projects that take a lot of hard earned time to
work around the lackluster state of our source documentation, namely
SGML. This hard earned time would better be spent on other things and
could be if you would stop stonewalling and let us move to XML.

3. You are the only one that is aruging *for* staying SGML.

>
> The question here was by David Blewett about creating multiple indexes.
> Has anyone bothered to look into that?  My experimentation, by the way,
> shows that it works exactly as described in the cited document.

Which cited document? However this still doesn't solve #2

Lastly, even if I am completely cranked on #1 and #3, #2 is reason
enough to move to XML regardless of your opinion.

We need to move to XML for the regional projects.

Sincerely,

Joshua D. Drake

--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

Peter Eisentraut

Date:

10 December 2006, 12:37:16

Joshua D. Drake wrote:
> 1. Tools. SGML tools are not as actively developed as the XML ones.
> No you can not use XML tools with SGML as effectively.
>
> If I am wrong on #1 Peter, fine. Prove it.  You keep telling us we
> are wrong but are unwilling to share where or how.

Well, such statements are quite impossible to reason about because
a "tool" can be just about anything.  There is no doubt that SGML tools
are not as actively developed as the XML ones.  That could, for
example, be because SGML is much older and stabilized and the tools
have matured.

But none of this interests me unless someone can come up with a specific
completion of the following sentence: "We need to do $ACTION because
that allows us to use tool $TOOL, which is more efficient at doing
$TASK than the currently used tool."

The only completion I have heard about so far is ACTION='switch source
to XML', TOOL='OpenOffice', and TASK='editing'.  Others have pointed
out that while OpenOffice can indeed edit DocBook XML, it wouldn't be
very useful for that purpose in our case.  But I'm interested in
hearing more about that.

> 2. We have two regional projects that take a lot of hard earned time
> to work around the lackluster state of our source documentation,
> namely SGML.

I know of one regional project that has done documentation work, the
French one.  (I'm sorry that I missed the other one.)  Their reason for
moving to XML was (a) to use FOP for (b) producing print output more
efficiently.  But you can use FOP right now, so (a) is not a good
reason, and we have fixed the print output generation now, so (b) isn't
a reason either.

> This hard earned time would better be spent on other
> things and could be if you would stop stonewalling and let us move to
> XML.

I'm not stonewalling anything.  If someone wants to prepare a case for
moving to XML, be my guest.  I'm just here to debunk the reasons for
moving to XML which are wrong.

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

Peter Eisentraut

Date:

10 December 2006, 12:38:35

Joshua D. Drake wrote:
> > Do you find any deficiencies in the setup instructions we have in
> > the documentation?
>
> Where are they in the documentation?

Appendix G

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

"Joshua D. Drake"

Date:

10 December 2006, 12:48:03

On Sun, 2006-12-10 at 17:37 +0100, Peter Eisentraut wrote:
> Joshua D. Drake wrote:
> > 1. Tools. SGML tools are not as actively developed as the XML ones.
> > No you can not use XML tools with SGML as effectively.
> >
> > If I am wrong on #1 Peter, fine. Prove it.  You keep telling us we
> > are wrong but are unwilling to share where or how.
>
> Well, such statements are quite impossible to reason about because
> a "tool" can be just about anything.  There is no doubt that SGML tools
> are not as actively developed as the XML ones.  That could, for
> example, be because SGML is much older and stabilized and the tools
> have matured.

O.k. again... what tools? Either provide some specific solutions or back
out of the conversation.

The standard tools used in the FOSS arena (specifically linux) for
Docbook are OpenJade and DSSSL. OpenJade sees very little development.

What tools do you use? Are they the same listed in Appendix G (I am
assuming so)

>
> But none of this interests me unless someone can come up with a specific
> completion of the following sentence: "We need to do $ACTION because
> that allows us to use tool $TOOL, which is more efficient at doing
> $TASK than the currently used tool."

We need to do migrate our docs to XML because that allows us to use
xml-po, which is more efficient at helping translation than the
currently used process.

Close enough?


> > 2. We have two regional projects that take a lot of hard earned time
> > to work around the lackluster state of our source documentation,
> > namely SGML.
>
> I know of one regional project that has done documentation work, the
> French one.  (I'm sorry that I missed the other one.)  Their reason for
> moving to XML was (a) to use FOP for (b) producing print output more
> efficiently.  But you can use FOP right now, so (a) is not a good
> reason, and we have fixed the print output generation now, so (b) isn't
> a reason either.

No, they also wanted easier stylesheet maintenance.

Sincerely,

Joshua D. Drake







--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

"Joshua D. Drake"

Date:

10 December 2006, 12:49:44

On Sun, 2006-12-10 at 17:37 +0100, Peter Eisentraut wrote:

> I'm not stonewalling anything.  If someone wants to prepare a case for
> moving to XML, be my guest.  I'm just here to debunk the reasons for
> moving to XML which are wrong.
>

O.k. how about this:

We want to move to XML. We have community members that want to move to
XML. We have regional projects that want to move to XML.

You can not debunk any of those reasons.

The fact is, they want to.

Until you can come up with a reason why we *shouldn't* move to XML. I
believe this issue is settled.

Sincerely,

Joshua D. Drake

--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

"Joshua D. Drake"

Date:

10 December 2006, 12:50:32

On Sun, 2006-12-10 at 17:38 +0100, Peter Eisentraut wrote:
> Joshua D. Drake wrote:
> > > Do you find any deficiencies in the setup instructions we have in
> > > the documentation?
> >
> > Where are they in the documentation?
>
> Appendix G

These are good instruction, but not anywhere as simple as:

Download this script.
Run this script.
type make, make pdf, make rtf etc...


Sincerely,

Joshua D. Drake



>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

Guillaume Lelarge

Date:

10 December 2006, 12:54:00

Peter Eisentraut a écrit :
> Joshua D. Drake wrote:
>> 2. We have two regional projects that take a lot of hard earned time
>> to work around the lackluster state of our source documentation,
>> namely SGML.
>
> I know of one regional project that has done documentation work, the
> French one.  (I'm sorry that I missed the other one.)  Their reason for
> moving to XML was (a) to use FOP for (b) producing print output more
> efficiently.  But you can use FOP right now, so (a) is not a good
> reason, and we have fixed the print output generation now, so (b) isn't
> a reason either.
>

We didn't switch to use FOP. We switched to build better PDF files
(better --> prettier). Perhaps we could have done the same with SGML but
I had a great way to do it with XML and XSLT (and FOP), a way I can
understand, customize, enhance, etc.

The last PDF available on the website has some bad issues. Just take a
look at table 8-13 (page 805), table 8-18 (page 113). One more time, I
don't say you can't fix this in SGML (and I think we already talk of
this some time ago). But *I* won't do it. And I think some people are
willing to work on this but they also want to have XML documentation.

Perhaps we can switch to XML just because people who want to get
involved with PostgreSQL documentation want to use XML ?

>> This hard earned time would better be spent on other
>> things and could be if you would stop stonewalling and let us move to
>> XML.
>
> I'm not stonewalling anything.  If someone wants to prepare a case for
> moving to XML, be my guest.  I'm just here to debunk the reasons for
> moving to XML which are wrong.
>

--
Guillaume.
<!-- http://abs.traduc.org/
     http://lfs.traduc.org/
     http://docs.postgresqlfr.org/ -->

Re: Switching to XML

From

Tom Lane

Date:

10 December 2006, 13:11:43

"Joshua D. Drake" <jd@commandprompt.com> writes:
> Until you can come up with a reason why we *shouldn't* move to XML. I
> believe this issue is settled.

How about "the people who are actually doing the work today don't
want to change to XML"?

I'm not very excited by hopes that all sorts of documentation manpower
will suddenly materialize if only the docs were in XML not SGML.  *That*
is the part of this argument for which no evidence has been produced.
Where are the people who know the code well enough to document it
but refuse to work on the docs because they're not XML?

(And no, Peter is not the only one objecting.)

            regards, tom lane

Re: Switching to XML

From

"Joshua D. Drake"

Date:

10 December 2006, 13:21:39

On Sun, 2006-12-10 at 12:11 -0500, Tom Lane wrote:
> "Joshua D. Drake" <jd@commandprompt.com> writes:
> > Until you can come up with a reason why we *shouldn't* move to XML. I
> > believe this issue is settled.
>
> How about "the people who are actually doing the work today don't
> want to change to XML"?

O.k. that's obviously reasonable enough but nobody has stepped up and
said. "Hey, I am doing the work here, I don't want to use XML."

I can easily provide the counter argument that says, "Why don't the
users of SGML want to migrate to XML?" Not one person has stepped up and
said it will cause me undue amounts of grief, pain, or even... I just
hate XML.

>
> I'm not very excited by hopes that all sorts of documentation manpower
> will suddenly materialize if only the docs were in XML not SGML.

I am not a believe of suddenly materializing manpower either ;). However
I do know that the regional projects would be greatly appreciative.

>   *That*
> is the part of this argument for which no evidence has been produced.
> Where are the people who know the code well enough to document it
> but refuse to work on the docs because they're not XML?

Where are the people that know the code well enough to document it but
refuse to work on the docs because they are XML?

I am not trying to be sarcastic here, honestly but not even in this
email have you said: "I don't like XML, I don't want to move to it
because..."

>
> (And no, Peter is not the only one objecting.)

O.k. why then? Why are you objecting? Peter has yet to provide a reason
either?

What we have here is a dozen or so people that want to move to XML and
have stated publicly that they want to do so, as well as the reasons
why.

We also have two people who have said, that they want to stay with SGML
but have provided zero reason why.

Sincerely,

Joshua D. Drake

>
>             regards, tom lane
>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

Peter Eisentraut

Date:

10 December 2006, 13:23:21

Joshua D. Drake wrote:
> O.k. again... what tools? Either provide some specific solutions or
> back out of the conversation.

No, that is up to you, because you started handwaving about "tools".  I
know my tools, but then I'm not the one complaining that we don't have
enough or the right ones or whatever.

> We need to do migrate our docs to XML because that allows us to use
> xml-po, which is more efficient at helping translation than the
> currently used process.

Well, that seems reasonable, but I looked around for something
called "xmlpo" or "xml-po", but I could find neither a binary nor a
package by that name on my system.  Googling suggests that it is a
KDE-specific tool.  Can someone clarify?

> No, they also wanted easier stylesheet maintenance.

$EDITOR stylesheet.xsl

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

Peter Eisentraut

Date:

10 December 2006, 13:25:21

Joshua D. Drake wrote:

> We want to move to XML. We have community members that want to move
> to XML. We have regional projects that want to move to XML.

"Because I want to" is not a reason in my mind.

> Until you can come up with a reason why we *shouldn't* move to XML. I
> believe this issue is settled.

I have previously listed the issues in favor and against.  Check the
start of this thread.

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

Peter Eisentraut

Date:

10 December 2006, 13:26:41

Joshua D. Drake wrote:
> These are good instruction, but not anywhere as simple as:
>
> Download this script.
> Run this script.
> type make, make pdf, make rtf etc...

Then start writing that script.

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

Peter Eisentraut

Date:

10 December 2006, 13:28:56

Guillaume Lelarge wrote:
> We didn't switch to use FOP. We switched to build better PDF files
> (better --> prettier). Perhaps we could have done the same with SGML
> but I had a great way to do it with XML and XSLT (and FOP), a way I
> can understand, customize, enhance, etc.

Well, we've got XSLT stylesheets, so use them.

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

"Joshua D. Drake"

Date:

10 December 2006, 13:38:16

On Sun, 2006-12-10 at 18:26 +0100, Peter Eisentraut wrote:
> Joshua D. Drake wrote:
> > These are good instruction, but not anywhere as simple as:
> >
> > Download this script.
> > Run this script.
> > type make, make pdf, make rtf etc...
>
> Then start writing that script.

I plan on it. That is why I brought it up.

Joshua D. Drake

>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

"Joshua D. Drake"

Date:

10 December 2006, 13:40:19

>
> > We need to do migrate our docs to XML because that allows us to use
> > xml-po, which is more efficient at helping translation than the
> > currently used process.
>
> Well, that seems reasonable, but I looked around for something
> called "xmlpo" or "xml-po", but I could find neither a binary nor a
> package by that name on my system.  Googling suggests that it is a
> KDE-specific tool.  Can someone clarify?

Sorry, xml2pot


Sincerely,

Joshua D. Drake


>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

"Joshua D. Drake"

Date:

10 December 2006, 13:49:41

> > Until you can come up with a reason why we *shouldn't* move to XML. I
> > believe this issue is settled.
>
> I have previously listed the issues in favor and against.  Check the
> start of this thread.

The only thing I see is that you don't want to move because all the
tools that are available for docbook sgml are also available for docbook
xml.

Am I missing anything?

Joshua D. Drake


>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

Peter Eisentraut

Date:

10 December 2006, 14:07:18

Joshua D. Drake wrote:
> The only thing I see is that you don't want to move because all the
> tools that are available for docbook sgml are also available for
> docbook xml.

Link posted by David Blewett:
http://groups.google.com/group/pgsql.docs/browse_thread/thread/37ff3a011bb705d7
--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

"Joshua D. Drake"

Date:

10 December 2006, 14:18:24

On Sun, 2006-12-10 at 19:07 +0100, Peter Eisentraut wrote:
> Joshua D. Drake wrote:
> > The only thing I see is that you don't want to move because all the
> > tools that are available for docbook sgml are also available for
> > docbook xml.
>
> Link posted by David Blewett:
> http://groups.google.com/group/pgsql.docs/browse_thread/thread/37ff3a011bb705d7

Ahh that would explain why I couldn't find it. :). Thanks.

Sincerely,

Joshua D. Drake


--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

Peter Eisentraut

Date:

10 December 2006, 16:40:27

Joshua D. Drake wrote:
> Sorry, xml2pot

OK, found it.  Works fine.

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

Mario

Date:

10 December 2006, 17:30:02

On 10/12/06, Joshua D. Drake <jd@commandprompt.com> wrote:
>
> >
> > Well, that seems reasonable, but I looked around for something
> > called "xmlpo" or "xml-po", but I could find neither a binary nor a
> > package by that name on my system.  Googling suggests that it is a
> > KDE-specific tool.  Can someone clarify?
>
> Sorry, xml2pot
>

  no, is xml2po  and you can find in debian in a package called gnome-doc-utils

>

--
http://www.advogato.org/person/mgonzalez/

Re: Switching to XML

From

Mario

Date:

10 December 2006, 18:30:41

On 10/12/06, Alvaro Herrera <alvherre@commandprompt.com> wrote:
> >
> > Mario could you please tell us *why* you ported the SGML to XML
> > before doing your translation and processing? Perhaps that will carry
> > more weight to this discussion then my hand waving.
>
> The reason is that he uses xml2pot, which AFAIK only handles XML and not
> SGML.
>

  Yes, that's right, I used and I'm using xml2po. Cause, we wanted to
create an easy way to translate manuals and the only tool I know for
that is xml2po. That tool AFAIK works with XML only, that's way I did
that port and also we wanted to translate online, so the current
configuration is

  SGML -> XML -> POT -> WEBAPP -> PO_TRANSLATED -> XML_TRANSLATED

  AND from the XML_TRANSLATED we are able to (thanks to openjade)
generate the HTML files in +10 minutes

>

--
http://www.advogato.org/person/mgonzalez/

Re: Switching to XML

From

David Blewett

Date:

10 December 2006, 21:18:49

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

Peter Eisentraut wrote:
> But none of this interests me unless someone can come up with a specific
> completion of the following sentence: "We need to do $ACTION because
> that allows us to use tool $TOOL, which is more efficient at doing
> $TASK than the currently used tool."

Sorry to butt in, but I'm still not sure how to do multiple indexes
in one volume using the DSSSL stylesheets. The docs I pointed to
were for the XSL stylesheets. After some discussion in #docbook, it
was said that it was a recent addition to those stylesheets.

David
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFFfLHPZmlc6wNjtLYRCGp8AJ0aVEbI/Ztlx5UsAY7KSW7vWz6IqACgudT5
VuD8FcZzajFUMNgAdPCPXJE=
=AD66
-----END PGP SIGNATURE-----

Re: Switching to XML

From

Peter Eisentraut

Date:

11 December 2006, 05:13:46

Am Montag, 11. Dezember 2006 02:18 schrieb David Blewett:
> Sorry to butt in, but I'm still not sure how to do multiple indexes
> in one volume using the DSSSL stylesheets.

I'm not sure either.

> The docs I pointed to were for the XSL stylesheets.

Right.  So using the XSL(T) stylesheet instead of the DSSSL stylesheet would
be a step in the right direction.  Actually the 8.2 branch currently only has
an XSLT stylesheet for HTML output.  In 8.3devel I've added one for XSL-FO
output, which you can invoke by "make postgres-{A4|US}.fo".  (You will
probably want yet a different paper format for your book.)

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Authoring Tools WAS: Switching to XML

From

Josh Berkus

Date:

11 December 2006, 13:54:39

Folks,

Tom and Peter correctly point out that discussion of production tools and
authoring tools are separate, and only come together if there are tools for
XML which solve both issues, an assertion which is not yet proven.

I am one of the champions of XML simply because I know for a fact that there
are no real authoring tools* for SGML.  However, I do *not* know that the
tools for XML produce acceptable output.

So, questions to answer:

1) Are there enhanced tools for Docbook XML, WYSWYG or otherwise, which make
doc authoring easier and produce correct output for PostgreSQL Docs?
1) a) is there some way we can try various tools and check output?

2) If (1), would switching to XML make doc writing with Emacs/VIM harder?

3) If (1), will having authoring tools available increase the number of doc
contributors?

Regarding (3), I believe that having authoring tools would increase our pool
based on two things:
a) Anectodal evidence, including several people (like me) who were prevented
from doing significant doc clean-up by the amount of labor required to
restructure large SGML documents.
b) That the vast majority of the OSS world is no longer using SGML, meaning
that even for hand-editing people who work on other projects will be more
familiar with XML and thus better able to edit it.

While these two may not be enough to answer "should we migrate", they are
enough to make it worth trying to find the answer to (1), since if all of the
Docbook XML authoring tools are crappy the whole discussion is moot.

So, can someone help me come up with a way to validate a Docbook XML version
of the PostgreSQL docs so that I (and others, hopefully) can start testing
tools?


* = an authoring tool is one which makes generation of the document
easier/faster than hand-editing text files.  No such tool exists for SGML --
even the Emacs toolkit merely does validation.

--
Josh Berkus
PostgreSQL @ Sun
San Francisco

Re: Authoring Tools WAS: Switching to XML

From

"Joshua D. Drake"

Date:

11 December 2006, 14:11:09

> So, questions to answer:
>
> 1) Are there enhanced tools for Docbook XML, WYSWYG or otherwise, which make
> doc authoring easier and produce correct output for PostgreSQL Docs?

Anything that you produce from a WYSWYG editor is going to have to be
massaged to work with PostgreSQL.Org docs.

> 1) a) is there some way we can try various tools and check output?

OpenOffice.Org, WordPerfect, Kword, Abiword. The first two (last I
checked) were the only mature software supporting Docbook output.

>
> 2) If (1), would switching to XML make doc writing with Emacs/VIM harder?

Absolutely not.

>
> 3) If (1), will having authoring tools available increase the number of doc
> contributors?

It might. However, getting authors will always be difficult. Having
clear direction from the website on how to contribute documentation.
This should include how to contribute if you don't know Docbook. Right
now, you have to dig all the way to Appendix G (who reads apepedixes?).

Something like this would also be helpful:

http://www.tldp.org/LDP/LDP-Author-Guide/html/index.html


> Regarding (3), I believe that having authoring tools would increase our pool
> based on two things:
> a) Anectodal evidence, including several people (like me) who were prevented
> from doing significant doc clean-up by the amount of labor required to
> restructure large SGML documents.

Well to be honest, there isn't really anyway around that that I know of.
OpenOffice.Org will allow you to do what you want, but someone is going
to have to clean up after you.

Now, that is not neccessarily a road block. It is certainly plausible
that someone could write a section (although I think plain text would be
better here) or rewrite a section and submit it for factoring. Factoring
would involve any Docbook that needed to be done.

Usually the Docbook that is required is trivial.


> b) That the vast majority of the OSS world is no longer using SGML, meaning
> that even for hand-editing people who work on other projects will be more
> familiar with XML and thus better able to edit it.

Well that is certainly true.

> So, can someone help me come up with a way to validate a Docbook XML version
> of the PostgreSQL docs so that I (and others, hopefully) can start testing
> tools?

From the source tree:

make postgres.xml

Then validate as you see fit. Just FYI, it does validate.


>
>
> * = an authoring tool is one which makes generation of the document
> easier/faster than hand-editing text files.  No such tool exists for SGML --
> even the Emacs toolkit merely does validation.
>

What is it specifically you are looking for then? Because that is what
Docbook is, XML (or sgml) the only thing you need to do is validate.

http://www.commandprompt.com/community/emacs_sgml/

If you use OpenOffice you will have to set up a style to correct mirror
the PostgreSQL Docbook usage.

Is that perhaps useful? An OpenOffice style that people could download
that is documented and can be used for creating reasonably useful
Docbook for the community?


Sincerely,

Joshua D. Drake




--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

David Blewett

Date:

11 December 2006, 14:12:39

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

Peter Eisentraut wrote:
> Right.  So using the XSL(T) stylesheet instead of the DSSSL stylesheet would
> be a step in the right direction.  Actually the 8.2 branch currently only has
> an XSLT stylesheet for HTML output.  In 8.3devel I've added one for XSL-FO
> output, which you can invoke by "make postgres-{A4|US}.fo".  (You will
> probably want yet a different paper format for your book.)

Excuse my ignorance, but I thought XSLT stylesheets can only be used
on XML files?

David Blewett

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFFfZ8KZmlc6wNjtLYRCEsTAJ0YF/5/enV2w7ibvjSYENTdKjZs3QCgophl
tZVTZsrFceHD3f+KKn6Drqo=
=zhwv
-----END PGP SIGNATURE-----

Re: Switching to XML

From

"Joshua D. Drake"

Date:

11 December 2006, 14:19:36

On Mon, 2006-12-11 at 13:10 -0500, David Blewett wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA256
>
> Peter Eisentraut wrote:
> > Right.  So using the XSL(T) stylesheet instead of the DSSSL stylesheet would
> > be a step in the right direction.  Actually the 8.2 branch currently only has
> > an XSLT stylesheet for HTML output.  In 8.3devel I've added one for XSL-FO
> > output, which you can invoke by "make postgres-{A4|US}.fo".  (You will
> > probably want yet a different paper format for your book.)
>
> Excuse my ignorance, but I thought XSLT stylesheets can only be used
> on XML files?

AFAIK they can only be used on XML files, but you can get XML from the
postgresql sgml sources by using make postgres.xml .

Of course your changes that you make from release to release for volumes
won't be permanent and we will continue to waste time because people
have an odd affection for SGML like it was some kind of pet dog.

Joshua D. Drake


>
> David Blewett
>
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.5 (MingW32)
> Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
>
> iD8DBQFFfZ8KZmlc6wNjtLYRCEsTAJ0YF/5/enV2w7ibvjSYENTdKjZs3QCgophl
> tZVTZsrFceHD3f+KKn6Drqo=
> =zhwv
> -----END PGP SIGNATURE-----
>
> ---------------------------(end of broadcast)---------------------------
> TIP 4: Have you searched our list archives?
>
>                http://archives.postgresql.org
>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Authoring Tools WAS: Switching to XML

From

Tom Lane

Date:

11 December 2006, 14:34:11

"Joshua D. Drake" <jd@commandprompt.com> writes:
>> 1) a) is there some way we can try various tools and check output?

> OpenOffice.Org, WordPerfect, Kword, Abiword. The first two (last I
> checked) were the only mature software supporting Docbook output.

I tried opening postgres.xml in OpenOffice 2.0, and it just showed me
the raw text and markup --- no indication that it understood xml at all.
Is there some special incantation needed?

            regards, tom lane

Re: Authoring Tools WAS: Switching to XML

From

"Joshua D. Drake"

Date:

11 December 2006, 14:38:45

On Mon, 2006-12-11 at 13:34 -0500, Tom Lane wrote:
> "Joshua D. Drake" <jd@commandprompt.com> writes:
> >> 1) a) is there some way we can try various tools and check output?
>
> > OpenOffice.Org, WordPerfect, Kword, Abiword. The first two (last I
> > checked) were the only mature software supporting Docbook output.
>
> I tried opening postgres.xml in OpenOffice 2.0, and it just showed me
> the raw text and markup --- no indication that it understood xml at all.
> Is there some special incantation needed?

I found that I had to remove the <? xml declartion and only have the
docbook dtd declartion.

Warning, it is ugly :) but it does indeed work.

Sincerely,

Joshua D. Drake


>
>             regards, tom lane
>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

"Joshua D. Drake"

Date:

11 December 2006, 14:46:26

On Sun, 2006-12-10 at 19:07 +0100, Peter Eisentraut wrote:
> Joshua D. Drake wrote:
> > The only thing I see is that you don't want to move because all the
> > tools that are available for docbook sgml are also available for
> > docbook xml.
>
> Link posted by David Blewett:
> http://groups.google.com/group/pgsql.docs/browse_thread/thread/37ff3a011bb705d7

Per this link Peter you cite:
"""
- Marked sections don't work anymore; would need to use DocBook-specific
profiling mechanism, which isn't all that elegant.

- More typing: Things like <abc>foo</> and other abbreviations won't
work anymore; all attributes would need to be quoted, etc.

- doc/src/sgml/*.sgml will look silly for filenames.

- Someone would need to do the conversion.  I understand that the
French
translation team might have patches available.

So that is it.  In my mind, there is no clear winner, but if someone has
a concrete need for XML, I don't see a problem with it.
"""

Your first disadvantage I can not comment on.

Your second disadvantage is fixed with any reasonable editor including
emacs.

Your third comment is a filename change.

Your fourth is resolved, we have the conversion per make postgres.xml.
We could even reasonably break it all out again.

I would add to advantages that:

1. We wouldn't have to have this discussion every six months :)

2. It will make life easier for regional projects as they are already
going through the pain of moving to XML for each release.

3. It would help some sub projects including the printed documentation
project that Blewitt and I are working on because we could submit our
patches as XML directly back to the community.

Lastly: There are several who have mentioned a concrete need. Is the
need solved with the existing tools? Yes, but not without extra steps.

So I would ask that you either give us further reason why you don't want
to move to XML or honor what you say above, in that you don't have a
problem with it.

Sincerely,

Joshua D. Drake

--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Authoring Tools WAS: Switching to XML

From

Tom Lane

Date:

11 December 2006, 15:08:12

"Joshua D. Drake" <jd@commandprompt.com> writes:
> On Mon, 2006-12-11 at 13:34 -0500, Tom Lane wrote:
>> I tried opening postgres.xml in OpenOffice 2.0, and it just showed me
>> the raw text and markup --- no indication that it understood xml at all.
>> Is there some special incantation needed?

> I found that I had to remove the <? xml declartion and only have the
> docbook dtd declartion.

> Warning, it is ugly :) but it does indeed work.

Only for exceedingly small values of "work".  I tried that, and got
a display showing all the text run together in a single paragraph.
I changed one word and re-saved, and got a file that had basically
nothing to do with the original --- it's been arbitrarily reformatted,
and most of the original markup is gone.

Perhaps later versions are better, but the OO version currently available
in Fedora 5 appears utterly useless for our purposes.

Can you show me an authoring tool that does *not* think it's OK to
mangle the low-level text in "semantically irrelevant" ways?  We might
as well forget about the concept of a doc patch if people start using
tools like this --- if a one-word change results in diffs across the
whole file, the tool is not usable.

            regards, tom lane

Re: Authoring Tools WAS: Switching to XML

From

"Joshua D. Drake"

Date:

11 December 2006, 15:21:40

> Can you show me an authoring tool that does *not* think it's OK to
> mangle the low-level text in "semantically irrelevant" ways?

No. :) I mentioned this previously. Any word processor is going to blow
stuff away in an ugly way. The closest we could get is:

Create a custom style in OpenOffice that works in a similar fashion to
PostgreSQL.

Use Tidy and or a custom script to clean after the fact.

IMHO the only tools for Docbook that are usable are editors, preferrably
validating editors such as Emacs, Bluefish (actually Word Perfect does
this too). We are not going to get what we want out of WYSIWYG.


>   We might
> as well forget about the concept of a doc patch if people start using
> tools like this --- if a one-word change results in diffs across the
> whole file, the tool is not usable.

You are correct, we would have to run some kind of clean script and then
diff between the two versions.


Sincerely,

Joshua D. Drake


--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Authoring Tools WAS: Switching to XML

From

Josh Berkus

Date:

11 December 2006, 15:25:34

Josh,

> Anything that you produce from a WYSWYG editor is going to have to be
> massaged to work with PostgreSQL.Org docs.

*sigh* too bad Lyx only writes DocBook and doesn't read it.

> > * = an authoring tool is one which makes generation of the document
> > easier/faster than hand-editing text files.  No such tool exists for
> > SGML -- even the Emacs toolkit merely does validation.
>
> What is it specifically you are looking for then? Because that is what
> Docbook is, XML (or sgml) the only thing you need to do is validate.

"Authoring Tool" means "not always hand-editing tags".  Right now, I can't
do anything with Emacs SGML that I couldn't do with Wordpad or Pico,
except validate.

Let me give you an example of a doc change which needing to hand-edit SGML
prevented me from making:  I wanted to clean up runtime.sgml by adding an
alpha index, and clearly breaking out data like defaults, set time, etc.
This meant changing 70% of the document, including some significant
re-arranging.

After 5 hours of editing, I discovered that I was only 1/3 done ... and
that there had been incompatible changes in CVS in the meantime.  I gave
up.

If the docs were WYSWYG-editable or if we had any authoring tool which
would automatically fix tags, I would have been able to make the changes
and submit a patch in < 4 hours, not 15 hours+ it would have taken to
complete the project.

--
--Josh

Josh Berkus
PostgreSQL @ Sun
San Francisco

Re: Authoring Tools WAS: Switching to XML

From

"Joshua D. Drake"

Date:

11 December 2006, 15:34:14

> "Authoring Tool" means "not always hand-editing tags".  Right now, I can't
> do anything with Emacs SGML that I couldn't do with Wordpad or Pico,
> except validate.

That is certainly not true. Emacs will correctly associate your tags
without having to write those tags, if you use Emacs correctly. As will
Bluefish.

http://www.commandprompt.com/community/emacs_sgml/x108

Bluefish is a little easier with this IMHO but it gives the same
capabilities.

>
> Let me give you an example of a doc change which needing to hand-edit SGML
> prevented me from making:  I wanted to clean up runtime.sgml by adding an
> alpha index, and clearly breaking out data like defaults, set time, etc.
> This meant changing 70% of the document, including some significant
> re-arranging.
>
> After 5 hours of editing, I discovered that I was only 1/3 done ... and
> that there had been incompatible changes in CVS in the meantime.  I gave
> up.

Well the CVS issue will be one that always exists.

>
> If the docs were WYSWYG-editable or if we had any authoring tool which
> would automatically fix tags,

O.k. well you will you lost my support here. I guarantee you I would not
submit patches if I had to work with problems associated with doing
things in OpenOffice.Org format. I would not have a problem helping get
a clean up script together that would work with Docbook output from
OpenOffice.org though.

Docbook is specifically designed for technical documentation.


>  I would have been able to make the changes
> and submit a patch in < 4 hours, not 15 hours+ it would have taken to
> complete the project.

Wait... You want a tool to automatically fix tags? You do need to know
Docbook to write the docs JoshB. That means you need to know which tags
are relevant to what. Something like emacs will make this easier because
it will tell you what tags are valid for each section of the document
you are in.

Sincerely,

Joshua D. Drake



>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

Tom Lane

Date:

11 December 2006, 15:54:08

"Joshua D. Drake" <jd@commandprompt.com> writes:
> Your fourth is resolved, we have the conversion per make postgres.xml.

We do not have that --- have you looked at the output?  It's nigh
unreadable.  Same problem as with the authoring tools: that software
doesn't think that formatting of the XML source text is important.

            regards, tom lane

Re: Switching to XML

From

"Joshua D. Drake"

Date:

11 December 2006, 15:57:22

On Mon, 2006-12-11 at 14:53 -0500, Tom Lane wrote:
> "Joshua D. Drake" <jd@commandprompt.com> writes:
> > Your fourth is resolved, we have the conversion per make postgres.xml.
>
> We do not have that --- have you looked at the output?  It's nigh
> unreadable.

You are correct which is actually the argument that Gleu made to me
about moving to XML. I did find however that if you ran postgres.xml
through tidy it cleaned things up nicely.

Secondly the make postgres.xml would be a one time thing. I would
happilly take the time to get postgres.xml into a format that was
reasonable and similar to how we currently do the postgres.sgml docs.

>   Same problem as with the authoring tools: that software
> doesn't think that formatting of the XML source text is important.

Well not authoring editors though. Emacs or Bluefish won't blow that
stuff away (just realized Bluefish doesn't validate :( ).

>
>             regards, tom lane
>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Authoring Tools WAS: Switching to XML

From

Josh Berkus

Date:

11 December 2006, 16:05:18

Josh,

> Wait... You want a tool to automatically fix tags? You do need to know
> Docbook to write the docs JoshB. That means you need to know which tags
> are relevant to what. Something like emacs will make this easier because
> it will tell you what tags are valid for each section of the document
> you are in.

Sure, but that doesn't do any good if you want to cut-and-paste a section
and fix the tags.

Anyway, from the sound of it whether or not there are better authoring
tools for XML than SGML is "undetermined pending further investigation."

--
--Josh

Josh Berkus
PostgreSQL @ Sun
San Francisco

Re: Switching to XML

From

Tom Lane

Date:

11 December 2006, 16:07:44

"Joshua D. Drake" <jd@commandprompt.com> writes:
> Secondly the make postgres.xml would be a one time thing.

Agreed, the cost of conversion is one-time ... but it's not small.
Aside from getting the files themselves converted, there's the effort
for people to find, install, and learn suitable tools, not to mention
come up to speed on the differences between xml and sgml.  (I assume
there are some significant ones, else why are we having this discussion?)

The real problem here is that you've still failed to establish any
sizable benefit from converting.  As best I can tell at the moment,
the acceptable options for editing XML will be about the same as they
are for SGML: emacs, and not a lot else.  I don't really see why I
should have to start spelling out every closing tag for a no-op like
that.

            regards, tom lane

Re: Switching to XML

From

"Joshua D. Drake"

Date:

11 December 2006, 16:41:42

On Mon, 2006-12-11 at 15:07 -0500, Tom Lane wrote:
> "Joshua D. Drake" <jd@commandprompt.com> writes:
> > Secondly the make postgres.xml would be a one time thing.
>
> Agreed, the cost of conversion is one-time ... but it's not small.
> Aside from getting the files themselves converted, there's the effort
> for people to find, install, and learn suitable tools, not to mention
> come up to speed on the differences between xml and sgml.  (I assume
> there are some significant ones, else why are we having this discussion?)

Well this actually comes back to one of Peter's points :). You shouldn't
need to update your tools. If you installed Docbook tools, it is going
to come with DSSSL and XSL. OpenJade will process the XML as will any of
the current db2, docbook2 etc.. scripts.

As far as learning the differences between xml and sgml, they are very
subtle because it is still Docbook.

http://www.oreilly.com/catalog/docbook/chapter/book/appb.html
http://www.docbook.org/tdg/en/html/ch02.html

For most things that anyone would be doing, they will see extremely
little difference. I didn't even notice the difference because I never
used short tags.

> The real problem here is that you've still failed to establish any
> sizable benefit from converting.  As best I can tell at the moment,
> the acceptable options for editing XML will be about the same as they
> are for SGML: emacs, and not a lot else.

Well basically, editing :) any editor although some make it easier as I
mentioned Emacs, BlueFish, Kate etc...

The benefits are (in my mind):

1. At least some of our regionals are already using XML. Let's make it a
little easier for them to do so.

2. It would make the new printed documentation project easier for David
Blewitt (the guy that is doing it)

3. DocBook XML is where FOSS documentation is moving or has moved
depending on the project. Should any project want to integrate any part
of our docs to their docs or vice versa, it will be easier for them if
we follow the similar standard of using Docbook XML.

4. Their is active and progressive development with XSL and Docbook.
There is not (that I can see) the same level with DSSSL and Docbbok.

5. The format objects development (FOP) seems to be better maintained
then the dblatex/db2latex/pdflatex stuff. FOP does not appear to work
with SGML.

6. Nobody has really come up with a reason *not* to move to XML that
can't easily be addressed.

7. And this may be ethereal but I do believe that if we were following
in the footsteps of other FOSS projects we may be able to get writers
from those projects. Even only 1 of them.

>   I don't really see why I
> should have to start spelling out every closing tag for a no-op like
> that.

Fair enough but if you are using a decent editor, it will take care of
that for you.

To be honest. I do not care. I have zero preference to XML or SGML
because I know them both. I do however care, when I have regionals and
contributors trying to give to the community that are having difficulty
doing so *because* we use SGML versus XML.

It is enough for me, that I have PostgreSQLFR, Chile and the printed
book project all clammoring for XML support to give more weight to their
arguments. Each of the three have presented their specific problems with
the SGML. Not to mention people such as Andrew Dunstan and the others.

Maybe the people demand isn't the concern. Okay, but switching to XML is
not hurting anyone. It's not like we are trying to change languages...
only the DTD :)

Sincerely,

Joshua D. Drake

>
>             regards, tom lane
>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

"Magnus Hagander"

Date:

11 December 2006, 16:56:31

> The real problem here is that you've still failed to
> establish any sizable benefit from converting.  As best I can
> tell at the moment, the acceptable options for editing XML
> will be about the same as they are for SGML: emacs, and not a
> lot else.  I don't really see why I should have to start
> spelling out every closing tag for a no-op like that.

Visual Studio has a *very* nice XML editor these days, and they don't do
SGML...

(ok, ok, that's a joke. It's true, sure, but it's not very relevant...)

//Magnus

Re: Switching to XML

From

"Joshua D. Drake"

Date:

11 December 2006, 17:05:16

On Mon, 2006-12-11 at 21:56 +0100, Magnus Hagander wrote:
> > The real problem here is that you've still failed to
> > establish any sizable benefit from converting.  As best I can
> > tell at the moment, the acceptable options for editing XML
> > will be about the same as they are for SGML: emacs, and not a
> > lot else.  I don't really see why I should have to start
> > spelling out every closing tag for a no-op like that.
>
> Visual Studio has a *very* nice XML editor these days, and they don't do
> SGML...
>
> (ok, ok, that's a joke. It's true, sure, but it's not very relevant...)

Actually it might be for Windows developers wishing to develop Windows
specific sections. How does it work with Docbook?

Sincerely,

Joshua D. Drake


>
> //Magnus
>
> ---------------------------(end of broadcast)---------------------------
> TIP 6: explain analyze is your friend
>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

David Blewett

Date:

11 December 2006, 17:09:48

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

Tom Lane wrote:
> "Joshua D. Drake" <jd@commandprompt.com> writes:
>> Secondly the make postgres.xml would be a one time thing.
>
> Agreed, the cost of conversion is one-time ... but it's not small.
> Aside from getting the files themselves converted, there's the effort
> for people to find, install, and learn suitable tools, not to mention
> come up to speed on the differences between xml and sgml.  (I assume
> there are some significant ones, else why are we having this discussion?)
>
> The real problem here is that you've still failed to establish any
> sizable benefit from converting.  As best I can tell at the moment,
> the acceptable options for editing XML will be about the same as they
> are for SGML: emacs, and not a lot else.  I don't really see why I
> should have to start spelling out every closing tag for a no-op like
> that.
>
>             regards, tom lane

In regards to cost of conversion: The osx tool that the makefile
uses to generate postgres.xml has more options available to tweak
the output (not resolve external files into the main file, no
newlines in tags, etc.) After adding a few of those options +
htmltidy, the result is pretty close to the original SGML, barring a
bare newline between <para>'s for example.

The benefit I'm talking about is the ability to do multiple indexes
in one document, which is not possible with DSSSL. In order to do
this, I would have to run the conversion *every time* I wanted to
work on a new release of the manual to bookstores.

David Blewett
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFFfci5Zmlc6wNjtLYRCFYwAJ45TMs/YY0RoxHi+oMCB7shPGghlQCfa+oA
hDRctNczvnrFweSV4yI2lG0=
=2NoW
-----END PGP SIGNATURE-----

Re: Switching to XML

From

Scott Marlowe

Date:

11 December 2006, 17:21:08

On Mon, 2006-12-11 at 13:53, Tom Lane wrote:
> "Joshua D. Drake" <jd@commandprompt.com> writes:
> > Your fourth is resolved, we have the conversion per make postgres.xml.
>
> We do not have that --- have you looked at the output?  It's nigh
> unreadable.  Same problem as with the authoring tools: that software
> doesn't think that formatting of the XML source text is important.

Would this help:

http://xmlindent.sourceforge.net/

I use it all the time at work, where we have LOTS of tools that make big
one line xml files for us.

Re: Switching to XML

From

Tom Lane

Date:

11 December 2006, 17:23:33

David Blewett <david@dawninglight.net> writes:
> The benefit I'm talking about is the ability to do multiple indexes
> in one document, which is not possible with DSSSL. In order to do
> this, I would have to run the conversion *every time* I wanted to
> work on a new release of the manual to bookstores.

Yeah, but for purely downstream work like that, what's the objection to
just running the osx conversion?  It seems to take just a few seconds.

I'm still not seeing where we get return on our investment for
converting the master source files from one format to the other.

            regards, tom lane

Re: Switching to XML

From

"Magnus Hagander"

Date:

11 December 2006, 18:00:38

> > > The real problem here is that you've still failed to
> establish any
> > > sizable benefit from converting.  As best I can tell at
> the moment,
> > > the acceptable options for editing XML will be about the same as
> > > they are for SGML: emacs, and not a lot else.  I don't really see
> > > why I should have to start spelling out every closing tag for a
> > > no-op like that.
> >
> > Visual Studio has a *very* nice XML editor these days, and
> they don't
> > do SGML...
> >
> > (ok, ok, that's a joke. It's true, sure, but it's not very
> > relevant...)
>
> Actually it might be for Windows developers wishing to
> develop Windows specific sections. How does it work with Docbook?

It works just fine with any XML inlcluding docbook. It does your basic
syntax highlighting, autoindenting and things like that. You can teach
it about things like docbook to get autocomplete on tags and attributes,
but I don't know if anybody has done that (I certainly haven't).

//Magnus

Re: Switching to XML

From

Tom Lane

Date:

11 December 2006, 18:02:29

"Magnus Hagander" <mha@sollentuna.net> writes:
>>> Visual Studio has a *very* nice XML editor these days, and
>>> they don't do SGML...
>>>
>>> (ok, ok, that's a joke. It's true, sure, but it's not very
>>> relevant...)
>>
>> Actually it might be for Windows developers wishing to
>> develop Windows specific sections. How does it work with Docbook?

> It works just fine with any XML inlcluding docbook. It does your basic
> syntax highlighting, autoindenting and things like that. You can teach
> it about things like docbook to get autocomplete on tags and attributes,
> but I don't know if anybody has done that (I certainly haven't).

How is it on the point of not introducing random reformatting into the
XML source text?

            regards, tom lane

Re: Switching to XML

From

"Magnus Hagander"

Date:

11 December 2006, 18:05:03

> >>> Visual Studio has a *very* nice XML editor these days, and they
> >>> don't do SGML...
> >>>
> >>> (ok, ok, that's a joke. It's true, sure, but it's not very
> >>> relevant...)
> >>
> >> Actually it might be for Windows developers wishing to develop
> >> Windows specific sections. How does it work with Docbook?
>
> > It works just fine with any XML inlcluding docbook. It does
> your basic
> > syntax highlighting, autoindenting and things like that.
> You can teach
> > it about things like docbook to get autocomplete on tags and
> > attributes, but I don't know if anybody has done that (I
> certainly haven't).
>
> How is it on the point of not introducing random reformatting
> into the XML source text?

it doesn't do it, period. Just to be clear, it's a XML *source* editor.
Just like emacs, excpet prettier ;-)

(and just for the record, these days even the GUI editors in Visual
Studio generally do a very good job of not breaking your manually tuned
markup, but there is no GUI editor for docbook)

//Magnus

Re: Switching to XML

From

Guillaume Lelarge

Date:

11 December 2006, 20:29:45

David Blewett a écrit :
> Peter Eisentraut wrote:
>> Right.  So using the XSL(T) stylesheet instead of the DSSSL stylesheet would
>> be a step in the right direction.  Actually the 8.2 branch currently only has
>> an XSLT stylesheet for HTML output.  In 8.3devel I've added one for XSL-FO
>> output, which you can invoke by "make postgres-{A4|US}.fo".  (You will
>> probably want yet a different paper format for your book.)
>
> Excuse my ignorance, but I thought XSLT stylesheets can only be used
> on XML files?
>

That's right. To use Peter's work, you need to checkout CVS HEAD, cd
into doc/src/sgml, "make postgres.xml" which build a valid XML file.

Next step depends on the output format you want to get. To get HTML
output from XSLT file, execute "make xslthtml". This target use
stylesheet.xsl which customizes a bit the XSLT stylesheets for XHTML
available from docbook.sourceforge.net. To get .fo output (required to
build a PDF file), "make postgres-A4.fo". This command will use the
stylesheet-fo.xsl file. This file is just a pointer to XSLT stylesheets
for PDF available from docbook.sourceforge.net. Simple but effective.

A summary :
 * to build HTML manual with XSLT :
   make postgres.xml
   make xslthtml
 * to build PDF manual with XSLT :
   make postgres.xml
   make postgres-A4.fo
   fop.sh postgres-A4.fo postgres-A4.pdf

I think my fop install is broken because I'm not able to build the PDF
file...

Peter, if I said something stupid, please, tell it. This is just what I
understood from your mail and the Makefiles/xslt files you provide.

Regards.

--
Guillaume.
<!-- http://abs.traduc.org/
     http://lfs.traduc.org/
     http://docs.postgresqlfr.org/ -->

Re: Switching to XML

From

David Blewett

Date:

11 December 2006, 21:39:55

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

Tom Lane wrote:
> Yeah, but for purely downstream work like that, what's the objection to
> just running the osx conversion?  It seems to take just a few seconds.
>
> I'm still not seeing where we get return on our investment for
> converting the master source files from one format to the other.
>
>             regards, tom lane

Does the designation of "downstream work" apply here? I was
intending to contribute back any changes required, to enable anyone
else to build the multiple volume version for printing. I wanted it
to become an easily maintainable way for the project to have an
up-to-date version of our manual available to bookstores, or
directly available through lulu.com.

I also would like to reiterate the fact that the localization
efforts would appreciate not having to re-do their work for each
release. Do we really want to continue considering this addition to
the project as well as their work to be "second-class citizens" wrt
the rest of the distribution?

David Blewett
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFFfgg/Zmlc6wNjtLYRCK9pAKCtN+UDuiYMprloV6Kx9XM4X3jCXQCgvhZ3
sRgVSsSAwUhHhQ2aMMC4ehA=
=XA3B
-----END PGP SIGNATURE-----

Re: Switching to XML

From

Tom Lane

Date:

11 December 2006, 21:47:37

David Blewett <david@dawninglight.net> writes:
> Does the designation of "downstream work" apply here? I was
> intending to contribute back any changes required, to enable anyone
> else to build the multiple volume version for printing.

Changes required where?  AFAICS the useful content of your work would
probably be in the stylesheets, which there is already an XML version
of.

> I also would like to reiterate the fact that the localization
> efforts would appreciate not having to re-do their work for each
> release.

How exactly does working from an osx translation rather than hand-edited
sources handicap them?  Or even more to the point, which part of their
work don't they have to redo anyway?  We certainly have never hesitated
to whack the documentation around in small ways and large...

            regards, tom lane

Re: Switching to XML

From

Guillaume Lelarge

Date:

12 December 2006, 03:58:26

Tom Lane a ecrit le 12/12/2006 02:47:
> David Blewett <david@dawninglight.net> writes:
> [...]
>> I also would like to reiterate the fact that the localization
>> efforts would appreciate not having to re-do their work for each
>> release.
>
> How exactly does working from an osx translation rather than hand-edited
> sources handicap them?  Or even more to the point, which part of their
> work don't they have to redo anyway?  We certainly have never hesitated
> to whack the documentation around in small ways and large...
>

I don't know for others but I think I can speak for the french team :)

I also don't quite understand the whole point on the translation. I use
Kompare to... hmmm... compare one release to another. I merge *manually*
the differences, and then I translate them. I also need to transform
SGML tags to XML tags (</> stuff and no ending tags). Actually, the
merge/translate step is much more difficult.

If some projects prefer to use po (with xml2pot or po4all), "make
postgres.xml" is the required step. The XML file built this way seems
valid to me.

Regards.

--
Guillaume.

Re: Switching to XML

From

Peter Eisentraut

Date:

12 December 2006, 03:59:45

Joshua D. Drake wrote:
> 5. The format objects development (FOP) seems to be better maintained
> then the dblatex/db2latex/pdflatex stuff.

Check the facts.  The latest stable FOP release was more than three
years ago.  The latest stable dblatex release was less than a month
ago.

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

Peter Eisentraut

Date:

12 December 2006, 04:07:18

Joshua D. Drake wrote:
> Of course your changes that you make from release to release for
> volumes won't be permanent

Well, of course he'd make his changes in the sources, not in the
intermediate XML file.  Or he is talking about a kind of changes which
I cannot fathom at the moment.

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

Peter Eisentraut

Date:

12 December 2006, 04:08:07

David Blewett wrote:
> Excuse my ignorance, but I thought XSLT stylesheets can only be used
> on XML files?

FWIW, you can use XSLT stylesheets on SGML as well, but we don't do
that.

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

Peter Eisentraut

Date:

12 December 2006, 04:10:23

Guillaume Lelarge wrote:
> A summary :
>  * to build HTML manual with XSLT :
>    make postgres.xml
>    make xslthtml

Just the last command should be enough.

>  * to build PDF manual with XSLT :
>    make postgres.xml
>    make postgres-A4.fo

Same here.

>    fop.sh postgres-A4.fo postgres-A4.pdf
>
> I think my fop install is broken because I'm not able to build the
> PDF file...

I have been using FOP on and off for many years and I've never seen it
produce any real document longer than two pages, so I'm not surprised.

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

Guillaume Lelarge

Date:

12 December 2006, 04:16:45

Peter Eisentraut a ecrit le 12/12/2006 09:10:
> Guillaume Lelarge wrote:
>>    fop.sh postgres-A4.fo postgres-A4.pdf
>>
>> I think my fop install is broken because I'm not able to build the
>> PDF file...
>
> I have been using FOP on and off for many years and I've never seen it
> produce any real document longer than two pages, so I'm not surprised.
>

It produces PDF file for the PostgreSQL french manual. I really have no
problem to build the 1098 pages PDF file for 8.1.5 release.


--
Guillaume.

Re: Authoring Tools WAS: Switching to XML

From

Peter Eisentraut

Date:

12 December 2006, 11:26:56

Josh Berkus wrote:
> Anyway, from the sound of it whether or not there are better
> authoring tools for XML than SGML is "undetermined pending further
> investigation."

I'll be the first to welcome better editing options, but so far I have
only heard them being theorized about.

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

"Joshua D. Drake"

Date:

12 December 2006, 13:40:58

On Tue, 2006-12-12 at 08:59 +0100, Peter Eisentraut wrote:
> Joshua D. Drake wrote:
> > 5. The format objects development (FOP) seems to be better maintained
> > then the dblatex/db2latex/pdflatex stuff.
>
> Check the facts.  The latest stable FOP release was more than three
> years ago.

Actually it was about 8 months ago. 0.92 Beta.

>  The latest stable dblatex release was less than a month
> ago.

I have found dblatex to not work very well (as in at all) but that could
very well be my installation.

Joshua D. Drake


>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

"Joshua D. Drake"

Date:

12 December 2006, 13:42:29

On Tue, 2006-12-12 at 09:10 +0100, Peter Eisentraut wrote:
> Guillaume Lelarge wrote:
> > A summary :
> >  * to build HTML manual with XSLT :
> >    make postgres.xml
> >    make xslthtml
>
> Just the last command should be enough.
>
> >  * to build PDF manual with XSLT :
> >    make postgres.xml
> >    make postgres-A4.fo
>
> Same here.
>
> >    fop.sh postgres-A4.fo postgres-A4.pdf
> >
> > I think my fop install is broken because I'm not able to build the
> > PDF file...
>
> I have been using FOP on and off for many years and I've never seen it
> produce any real document longer than two pages, so I'm not surprised.

I was able to easily (once I got the environment sorted out) generate
the entire french documentation with FOP.

Joshua D. Drake

>
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

Peter Eisentraut

Date:

12 December 2006, 15:28:09

Joshua D. Drake wrote:
> > Check the facts.  The latest stable FOP release was more than three
> > years ago.
>
> Actually it was about 8 months ago. 0.92 Beta.

That is not a stable release.  And the fact that the latest beta is 8
months old doesn't impress me either.
--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

"Joshua D. Drake"

Date:

12 December 2006, 15:47:34

On Tue, 2006-12-12 at 20:27 +0100, Peter Eisentraut wrote:
> Joshua D. Drake wrote:
> > > Check the facts.  The latest stable FOP release was more than three
> > > years ago.
> >
> > Actually it was about 8 months ago. 0.92 Beta.
>
> That is not a stable release.  And the fact that the latest beta is 8
> months old doesn't impress me either.

I am not going to argue with you further. I have better things to do.

Sincerely,

Joshua D. Drake


--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

Bruce Momjian

Date:

12 December 2006, 18:45:04

Tom Lane wrote:
> Josh Berkus <josh@agliodbs.com> writes:
> > As I said then, this is absolutely untrue.  OpenOffice.org, for example,
> > works with DocBook XML but not SGML.  There are also a plethora of XML
> > editing and publishing tools which can been used for Docbook XML which
> > are not available for SGML.  A simple look at this page:
> > http://wiki.docbook.org/topic/DocBookAuthoringTools
> > .... shows that there are more than twice as many authoring tools which
> > support only XML as support SGML -- and that most of the tools which
> > support SGML are out-of-maintenance.
>
> This is confusing authoring tools (ie, stuff for more or less WYSIWYG
> editing of the document source) with output generation tools.
>
> As for authoring tools, show me one that produces SGML or XML that's
> reasonably readable, and I might worry about allowing people to use it.
> Most of the ones I've seen would render the doc sources unreadable for
> anyone not using an authoring tool (possibly even the very same
> authoring tool).  We are not going to move in that direction
> because it would piss off the people who do the bulk of the work now.

And diffs would be either very large or useless.

--
  Bruce Momjian   bruce@momjian.us
  EnterpriseDB    http://www.enterprisedb.com

  + If your life is a hard drive, Christ can be your backup. +

Re: Authoring Tools WAS: Switching to XML

From

Bruce Momjian

Date:

13 December 2006, 13:14:33

I have read the two long XML threads, and see these issues for
converting to XML:

o authoring, in two parts:

    o WYSIWYG - this seems unattainable because such editors are
      going to modify the surrounding XML, which will affect
      hand-edited changes

    o tag assistance - this seems possible, e.g. tag completion,
      drop down list of tags available for this section

o output generation - the XML toolchain is more widely used, but Tom has
  fixed the performance issue we have had for years.  The fact that the
  tool maintainers were unable to fix it themselves reflects badly on
  their maintenance of this toolchain.

o translation - does XML have better tools to assist in translating our
  manuals, and is this better than outputting XML via our makefile?

It would be nice to use htmltidy to clean up the xml before each
release, but I wonder whether it understand the tags well enough to
provide better formatting than we do now manually.

Tom, I think we need that SGML patch placed on our ftp site and a
reference to it made in our documentation.  I can do it of you want.

--
  Bruce Momjian   bruce@momjian.us
  EnterpriseDB    http://www.enterprisedb.com

  + If your life is a hard drive, Christ can be your backup. +

Re: Authoring Tools WAS: Switching to XML

From

Josh Berkus

Date:

14 December 2006, 16:56:43

Bruce,

>     o WYSIWYG - this seems unattainable because such editors are
>       going to modify the surrounding XML, which will affect
>       hand-edited changes

It would be more accurate to say that we have not identified a WYSWYG tool
which does not mess up the source.  There may be one, it would just take a
fair amount of testing to find it.

>     o tag assistance - this seems possible, e.g. tag completion,
>       drop down list of tags available for this section

Yes, and graphically-enhanced tag support using XML IDEs.  Command-line tag
completion is available via Emacs-SGML, but there are a lot more tools
which do it for XML (including Emacs).

--
--Josh

Josh Berkus
PostgreSQL @ Sun
San Francisco

Re: Authoring Tools WAS: Switching to XML

From

Scott Marlowe

Date:

14 December 2006, 17:23:28

On Thu, 2006-12-14 at 14:58, Josh Berkus wrote:
> Bruce,
>
> >     o WYSIWYG - this seems unattainable because such editors are
> >       going to modify the surrounding XML, which will affect
> >       hand-edited changes
>
> It would be more accurate to say that we have not identified a WYSWYG tool
> which does not mess up the source.  There may be one, it would just take a
> fair amount of testing to find it.

Is this strictly a question of indentation, or one of actually mangling
tags and such?

Re: Authoring Tools WAS: Switching to XML

From

Tom Lane

Date:

14 December 2006, 17:34:23

Scott Marlowe <smarlowe@g2switchworks.com> writes:
> On Thu, 2006-12-14 at 14:58, Josh Berkus wrote:
>> It would be more accurate to say that we have not identified a WYSWYG tool
>> which does not mess up the source.  There may be one, it would just take a
>> fair amount of testing to find it.

> Is this strictly a question of indentation, or one of actually mangling
> tags and such?

What we need is something that does not change regions of the file that
the user did not intend to modify.  I think horizonal-white-space-only
changes could be worked around if the user is careful to use diff
--ignore-space-change when submitting the patch, but I suspect that an
editor that thinks it can mangle whitespace will also figure that it can
change line boundaries, and then diff will never be able to extract any
signal from that noise.

            regards, tom lane

Re: Authoring Tools WAS: Switching to XML

From

Scott Marlowe

Date:

14 December 2006, 17:37:08

On Thu, 2006-12-14 at 15:34, Tom Lane wrote:
> Scott Marlowe <smarlowe@g2switchworks.com> writes:
> > On Thu, 2006-12-14 at 14:58, Josh Berkus wrote:
> >> It would be more accurate to say that we have not identified a WYSWYG tool
> >> which does not mess up the source.  There may be one, it would just take a
> >> fair amount of testing to find it.
>
> > Is this strictly a question of indentation, or one of actually mangling
> > tags and such?
>
> What we need is something that does not change regions of the file that
> the user did not intend to modify.  I think horizonal-white-space-only
> changes could be worked around if the user is careful to use diff
> --ignore-space-change when submitting the patch, but I suspect that an
> editor that thinks it can mangle whitespace will also figure that it can
> change line boundaries, and then diff will never be able to extract any
> signal from that noise.

Ahhhh.  I see.

Well, I don't mind teaching myself some docbook, but the first time I
looked at adding to the docs I was overwhelmed with what I didn't know,
and didn't even see a starting point.  I'll look in the appendices and
see what I can see.

Re: Switching to XML

From

Bruce Momjian

Date:

14 December 2006, 22:46:00

I have applied the attached documentation patch to HEAD and 8.2.X to
document the use of Tom's patch to openjade to get ouput in a reasonable
amount of time.

---------------------------------------------------------------------------

Tom Lane wrote:
> Peter Eisentraut <peter_e@gmx.net> writes:
> > Tom Lane wrote:
> >> Since jade does not go into this kind of spiral when producing html
> >> output from the same sources, I suggest that it's not jade's fault,
> >> but rather crummy coding in the sgml-to-tex conversion scripts it's
> >> using.
>
> > Right.  I fixed that, so now it takes about 15 minutes to build the
> > whole thing.
>
> Actually, I just finished fixing what seems to be the underlying problem
> in jade: it's got a spectacularly bad implementation of linked lists.
> In PG-code terms, if the list is built by successive lappends(), then
> both lfirst() and lnext() take O(N) time for an N-element list, thus
> iterating through the list in the usual way is O(N^2) ... and not even
> with a reasonably small multiplier, because it stresses the hell out of
> memory allocation and garbage collection while it's at it.  (Which may
> well mean that it's really more like O(N^3), since the garbage collector
> will iterate over every allocated object every so often.)
>
> With the patch it takes me about 5 minutes to do the jade step of the
> PDF build, using this morning's SGML sources.  (I don't know how to set
> the TeX configuration to get the pdfjadetex steps to go through, so I
> dunno about total time.)
>
> However, I have no idea what it'll take to get this patch propagated
> into the copies people actually use, so your fix sounds good for the
> short term.
>
>             regards, tom lane
>
> diff -cr openjade-1.3.2.orig/style/ELObj.cxx openjade-1.3.2/style/ELObj.cxx
> *** openjade-1.3.2.orig/style/ELObj.cxx    Fri Jan 11 10:48:38 2002
> --- openjade-1.3.2/style/ELObj.cxx    Sat Dec  9 21:58:36 2006
> ***************
> *** 1044,1049 ****
> --- 1044,1054 ----

--
  Bruce Momjian   bruce@momjian.us
  EnterpriseDB    http://www.enterprisedb.com

  + If your life is a hard drive, Christ can be your backup. +
Index: doc/src/sgml/docguide.sgml
===================================================================
RCS file: /cvsroot/pgsql/doc/src/sgml/docguide.sgml,v
retrieving revision 1.61
diff -c -c -r1.61 docguide.sgml
*** doc/src/sgml/docguide.sgml    10 Dec 2006 20:46:03 -0000    1.61
--- doc/src/sgml/docguide.sgml    15 Dec 2006 02:39:37 -0000
***************
*** 115,120 ****
--- 115,123 ----
         <acronym>DSSSL</acronym> stylesheets), as well as a number of
         related tools.  <productname>Jade</productname> is now being
         maintained by the OpenJade group, no longer by James Clark.
+        (If generating Postscript or PDF output, you will need to
+        compile from source and use a special patch to get output
+        in a reasonable amount of time.)
        </para>
       </listitem>
      </varlistentry>
***************
*** 295,300 ****
--- 298,311 ----
          installed and you want to install the rest of the tool chain
          locally.)
         </para>
+
+        <para>
+         OpenJade release 1.3.2 and perhaps earlier and later releases
+         have a known bug that causes Postscript and PDF output
+         generation to take days.  This <ulink
+         url="http://archives.postgresql.org/pgsql-docs/2006-12/msg00064.php">patch</ulink>
+         fixes the problem and generates output in a few minutes.
+        </para>
        </step>

        <step id="doc-openjade-install">

Re: Authoring Tools WAS: Switching to XML

From

David Fetter

Date:

15 December 2006, 02:15:26

On Thu, Dec 14, 2006 at 04:34:06PM -0500, Tom Lane wrote:
> Scott Marlowe <smarlowe@g2switchworks.com> writes:
> > On Thu, 2006-12-14 at 14:58, Josh Berkus wrote:
> >> It would be more accurate to say that we have not identified a
> >> WYSWYG tool which does not mess up the source.  There may be one,
> >> it would just take a fair amount of testing to find it.
>
> > Is this strictly a question of indentation, or one of actually
> > mangling tags and such?
>
> What we need is something that does not change regions of the file
> that the user did not intend to modify.  I think
> horizonal-white-space-only changes could be worked around if the
> user is careful to use diff --ignore-space-change when submitting
> the patch, but I suspect that an editor that thinks it can mangle
> whitespace will also figure that it can change line boundaries, and
> then diff will never be able to extract any signal from that noise.

Could post-processing with tidy clean this up?  I'm pretty sure that
tidy's output is deterministic, at least if the editor hasn't mangled
tags...

Cheers,
D
--
David Fetter <david@fetter.org> http://fetter.org/
phone: +1 415 235 3778        AIM: dfetter666
                              Skype: davidfetter

Remember to vote!

Re: Authoring Tools WAS: Switching to XML

From

"Joshua D. Drake"

Date:

15 December 2006, 02:37:45

> > What we need is something that does not change regions of the file
> > that the user did not intend to modify.  I think
> > horizonal-white-space-only changes could be worked around if the
> > user is careful to use diff --ignore-space-change when submitting
> > the patch, but I suspect that an editor that thinks it can mangle
> > whitespace will also figure that it can change line boundaries, and
> > then diff will never be able to extract any signal from that noise.
>
> Could post-processing with tidy clean this up?  I'm pretty sure that
> tidy's output is deterministic, at least if the editor hasn't mangled
> tags...
>

Yes tidy makes things pretty darn clean. However, there is one thing
that it won't do and that is remove extra tags. Thus something like
OpenOffice is still a no-op.

Joshua D. Drake

> Cheers,
> D
--

      === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/

Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate

Re: Switching to XML

From

Peter Eisentraut

Date:

15 December 2006, 05:07:59

Am Freitag, 15. Dezember 2006 03:45 schrieb Bruce Momjian:
> I have applied the attached documentation patch to HEAD and 8.2.X to
> document the use of Tom's patch to openjade to get ouput in a reasonable
> amount of time.

This is no longer accurate in 8.3.  A workaround has been installed in the
stylesheets to arrive at the same result.

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

Bruce Momjian

Date:

15 December 2006, 12:50:29

Peter Eisentraut wrote:
> Am Freitag, 15. Dezember 2006 03:45 schrieb Bruce Momjian:
> > I have applied the attached documentation patch to HEAD and 8.2.X to
> > document the use of Tom's patch to openjade to get ouput in a reasonable
> > amount of time.
>
> This is no longer accurate in 8.3.  A workaround has been installed in the
> stylesheets to arrive at the same result.

OK, I have talked to Peter via IM and now understand that his change to
stylesheet.dsl avoids the bug Tom found and fixed, so instead of
mentioning the patch in the SGML docs, I added a comment in the
stylesheet.dsl file and pointed to the URL.

Also, the double-run of the PDF does not fix the index.  Only a proper
bookindex.sgml does, that, so I removed the double runs and either Peter
or I will work on issuing a warning when the bookindex.sgml file isn't
proper, so people know to run it again.

Revert patch attached and applied to HEAD and 8.2.X.

--
  Bruce Momjian   bruce@momjian.us
  EnterpriseDB    http://www.enterprisedb.com

  + If your life is a hard drive, Christ can be your backup. +
Index: doc/src/sgml/Makefile
===================================================================
RCS file: /cvsroot/pgsql/doc/src/sgml/Makefile,v
retrieving revision 1.85
diff -c -c -r1.85 Makefile
*** doc/src/sgml/Makefile    15 Dec 2006 01:20:59 -0000    1.85
--- doc/src/sgml/Makefile    15 Dec 2006 16:29:03 -0000
***************
*** 99,104 ****
--- 99,107 ----

  COLLATEINDEX := LC_ALL=C $(PERL) $(COLLATEINDEX) -f -g

+ # If HTML.index does not exist, create a dummy bookindex.sgml.  During the
+ # next build, create bookindex.sgml with the proper index contents.  A proper
+ # bookindex.sgml is required to have an index in the output.
  ifeq (,$(wildcard HTML.index))
  bookindex.sgml:
      $(COLLATEINDEX) -o $@ -N
***************
*** 136,164 ****

  JADE.tex.call = $(JADE) $(JADEFLAGS) $(SGMLINCLUDE) $(CATALOG) -d $(srcdir)/stylesheet.dsl -t tex -V tex-backend -i
output-print

! %-A4.tex-ps: %.sgml $(ALLSGML) stylesheet.dsl
!     # multiple runs are necessary to create proper index entries
!     $(JADE.tex.call) -V texdvi-output -V '%paper-type%'=A4 -o $@ $<
      $(JADE.tex.call) -V texdvi-output -V '%paper-type%'=A4 -o $@ $<

! %-US.tex-ps: %.sgml $(ALLSGML) stylesheet.dsl
!     # multiple runs are necessary to create proper index entries
!     $(JADE.tex.call) -V texdvi-output -V '%paper-type%'=USletter -o $@ $<
      $(JADE.tex.call) -V texdvi-output -V '%paper-type%'=USletter -o $@ $<

! %-A4.tex-pdf: %.sgml $(ALLSGML) stylesheet.dsl
!     # multiple runs are necessary to create proper index entries
!     $(JADE.tex.call) -V texpdf-output -V '%paper-type%'=A4 -o $@ $<
      $(JADE.tex.call) -V texpdf-output -V '%paper-type%'=A4 -o $@ $<

! %-US.tex-pdf: %.sgml $(ALLSGML) stylesheet.dsl
!     # multiple runs are necessary to create proper index entries
!     $(JADE.tex.call) -V texpdf-output -V '%paper-type%'=USletter -o $@ $<
      $(JADE.tex.call) -V texpdf-output -V '%paper-type%'=USletter -o $@ $<

  %.dvi: %.tex-ps
      @rm -f $*.aux $*.log
!     # multiple runs are necessary to create proper index entries
      jadetex $<
      jadetex $<
      jadetex $<
--- 139,159 ----

  JADE.tex.call = $(JADE) $(JADEFLAGS) $(SGMLINCLUDE) $(CATALOG) -d $(srcdir)/stylesheet.dsl -t tex -V tex-backend -i
output-print

! %-A4.tex-ps: %.sgml $(ALLSGML) stylesheet.dsl bookindex.sgml
      $(JADE.tex.call) -V texdvi-output -V '%paper-type%'=A4 -o $@ $<

! %-US.tex-ps: %.sgml $(ALLSGML) stylesheet.dsl bookindex.sgml
      $(JADE.tex.call) -V texdvi-output -V '%paper-type%'=USletter -o $@ $<

! %-A4.tex-pdf: %.sgml $(ALLSGML) stylesheet.dsl bookindex.sgml
      $(JADE.tex.call) -V texpdf-output -V '%paper-type%'=A4 -o $@ $<

! %-US.tex-pdf: %.sgml $(ALLSGML) stylesheet.dsl bookindex.sgml
      $(JADE.tex.call) -V texpdf-output -V '%paper-type%'=USletter -o $@ $<

  %.dvi: %.tex-ps
      @rm -f $*.aux $*.log
!     # multiple runs are necessary to create proper intra-document links
      jadetex $<
      jadetex $<
      jadetex $<
***************
*** 169,175 ****

  %.pdf: %.tex-pdf
      @rm -f $*.aux $*.log $*.out
!     # multiple runs are necessary to create proper index entries
      pdfjadetex $<
      pdfjadetex $<
      pdfjadetex $<
--- 164,170 ----

  %.pdf: %.tex-pdf
      @rm -f $*.aux $*.log $*.out
!     # multiple runs are necessary to create proper intra-document links
      pdfjadetex $<
      pdfjadetex $<
      pdfjadetex $<
Index: doc/src/sgml/docguide.sgml
===================================================================
RCS file: /cvsroot/pgsql/doc/src/sgml/docguide.sgml,v
retrieving revision 1.62
diff -c -c -r1.62 docguide.sgml
*** doc/src/sgml/docguide.sgml    15 Dec 2006 02:44:28 -0000    1.62
--- doc/src/sgml/docguide.sgml    15 Dec 2006 16:29:03 -0000
***************
*** 115,123 ****
         <acronym>DSSSL</acronym> stylesheets), as well as a number of
         related tools.  <productname>Jade</productname> is now being
         maintained by the OpenJade group, no longer by James Clark.
-        (If generating Postscript or PDF output, you will need to
-        compile from source and use a special patch to get output
-        in a reasonable amount of time.)
        </para>
       </listitem>
      </varlistentry>
--- 115,120 ----
***************
*** 298,311 ****
          installed and you want to install the rest of the tool chain
          locally.)
         </para>
-
-        <para>
-         OpenJade release 1.3.2 and perhaps earlier and later releases
-         have a known bug that causes Postscript and PDF output
-         generation to take days.  This <ulink
-         url="http://archives.postgresql.org/pgsql-docs/2006-12/msg00064.php">patch</ulink>
-         fixes the problem and generates output in a few minutes.
-        </para>
        </step>

        <step id="doc-openjade-install">
--- 295,300 ----
Index: doc/src/sgml/stylesheet.dsl
===================================================================
RCS file: /cvsroot/pgsql/doc/src/sgml/stylesheet.dsl,v
retrieving revision 1.31
diff -c -c -r1.31 stylesheet.dsl
*** doc/src/sgml/stylesheet.dsl    10 Dec 2006 01:55:29 -0000    1.31
--- doc/src/sgml/stylesheet.dsl    15 Dec 2006 16:29:03 -0000
***************
*** 280,285 ****
--- 280,293 ----
  (define %refentry-new-page%     #t)
  (define %refentry-keep%         #f)

+ ;; Indentation of verbatim environments.  (This should really be done
+ ;; with start-indent in DSSSL.)
+ ;; Use of indentation in this area exposes a bug in openjade,
+ ;; http://archives.postgresql.org/pgsql-docs/2006-12/msg00064.php
+ ;; (define %indent-programlisting-lines% "    ")
+ ;; (define %indent-screen-lines% "    ")
+ ;; (define %indent-synopsis-lines% "    ")
+

  ;; Default graphic format: Jadetex wants eps, pdfjadetex wants pdf.
  ;; (Note that pdfjadetex will not accept eps, that's why we need to

Re: Switching to XML

From

Peter Eisentraut

Date:

15 December 2006, 17:39:42

Bruce Momjian wrote:
> Also, the double-run of the PDF does not fix the index.  Only a
> proper bookindex.sgml does, that, so I removed the double runs and
> either Peter or I will work on issuing a warning when the
> bookindex.sgml file isn't proper, so people know to run it again.

Producing a warning isn't hard, but the question is where to put it so
it doesn't get buried in all the other output.

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

Re: Switching to XML

From

Bruce Momjian

Date:

16 December 2006, 23:18:03

Peter Eisentraut wrote:
> Bruce Momjian wrote:
> > Also, the double-run of the PDF does not fix the index.  Only a
> > proper bookindex.sgml does, that, so I removed the double runs and
> > either Peter or I will work on issuing a warning when the
> > bookindex.sgml file isn't proper, so people know to run it again.
>
> Producing a warning isn't hard, but the question is where to put it so
> it doesn't get buried in all the other output.

I think the Makefile rule just emits it at the end of its processing.

Also, for testing, I was thinking you could use 'wc -l bookindex.sgml',
and if the file length is less thatn 25 lines, it is a fake file.


--
  Bruce Momjian   bruce@momjian.us
  EnterpriseDB    http://www.enterprisedb.com

  + If your life is a hard drive, Christ can be your backup. +

Re: Switching to XML

From

David Blewett

Date:

20 December 2006, 21:33:52

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

Tom Lane wrote:
> However, I have no idea what it'll take to get this patch propagated
> into the copies people actually use, so your fix sounds good for the
> short term.
>
>             regards, tom lane


I submitted the patch to gentoo's bugzilla:
http://bugs.gentoo.org/show_bug.cgi?id=158605

David Blewett

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFFieRjZmlc6wNjtLYRCH2GAJ4rfMXrcPAceJoL5rTPjmpY9EAndACcCfes
oxBVK5jqghkxKVXEbwRD8m8=
=bR1k
-----END PGP SIGNATURE-----

Re: Switching to XML

From

David Blewett

Date:

22 December 2006, 06:39:39

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

David Blewett wrote:
> Tom Lane wrote:
>> However, I have no idea what it'll take to get this patch propagated
>> into the copies people actually use, so your fix sounds good for the
>> short term.
>
>>             regards, tom lane
>
>
> I submitted the patch to gentoo's bugzilla:
> http://bugs.gentoo.org/show_bug.cgi?id=158605
>
> David Blewett
>

Hi Tom:

There's been a question posted to that bug. Would you mind replying
to it? You don't have to create an account there. I can post your reply.

David

Quoted,  leonardop@gentoo.org:
"Very interesting. I'd of course love to add a patch like this,
however, it seems not so easy to follow. Not that the patch itself
is overly complicated, but openjade sources are, let's say, less
than ideal.

If you or the original author of the patch could comment the
changes, specially those in style/primitive.cxx regarding to the
protect and protect2 objects, it would help a lot.

Also, for future reference, please create your patches as unified
diffs when submitting them to resources such as this bugzilla (many
other projects encourage this as well).

Thanks."

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFFi7W0Zmlc6wNjtLYRCBUPAJ0X2y99yxnFsike34Vo+JHckPcI6wCfVzdc
HuCmC6It6UvVa6a/5Z1QuGU=
=fJ3G
-----END PGP SIGNATURE-----

Re: Switching to XML

From

Tom Lane

Date:

22 December 2006, 11:18:36

David Blewett <david@dawninglight.net> writes:
> There's been a question posted to that bug. Would you mind replying
> to it? You don't have to create an account there. I can post your reply.

I haven't gotten any reply from my posting to the upstream mailing list
at sourceforge :-( so it's hard to tell if anything will happen there.
I don't really approve of fixing this on a distro-by-distro basis, but
maybe we have no other choice.

Anyway, what I said to them was

------- Forwarded Message

Date:    Sat, 09 Dec 2006 23:35:54 -0500
From:    Tom Lane <tgl@sss.pgh.pa.us>
To:      openjade-devel@lists.sourceforge.net
Subject: Performance problems with NodeListObj lists

It's been folklore for some time at the PostgreSQL project
(www.postgresql.org) that running our documentation through jade to
produce TeX output requires about three days on current hardware :-(
Today I got motivated to look into why, and what I find is a pretty
localized problem, as illustrated by this oprofile trace:

samples  %        symbol name
2082917  98.5829  OpenJade_DSSSL::PairNodeListObj::nodeListFirst(OpenJade_DSSSL::EvalContext&,
OpenJade_DSSSL::Interpreter&)
9713      0.4597  OpenJade_DSSSL::PairNodeListObj::nodeListRest(OpenJade_DSSSL::EvalContext&,
OpenJade_DSSSL::Interpreter&)
5019      0.2375  OpenJade_DSSSL::AppendSosofoObj::traceSubObjects(Collector&) const
3571      0.1690  Collector::collect()
1938      0.0917  OpenJade_DSSSL::FlowObj::traceSubObjects(Collector&) const

While I've not traced through the stylesheets to determine exactly what
causes this, the behavior seen at the C++ level is that long NodeListObj
lists (on the order of 10K elements) are built by successive "append"
operations using the NodeList primitive.  This produces a
PairNodeListObj tree that's nested in the wrong direction; if you'll
pardon some crude ASCII art:

                PairNodeListObj
                  /           \
            PairNodeListObj    e
              /           \
        PairNodeListObj    d
          /           \
    PairNodeListObj    c
       /      \
      a        b

This means that for an N-element list, nodeListFirst requires O(N) time
to recurse down to the first element, and so does nodeListRest.  So
iterating through the list in the usual way requires O(N^2) time.
To add to the problem, nodeListRest generates another PairNodeListObj
on each call, imposing a ton of mostly-useless load on the memory
allocation/garbage collection machinery.

Attached is a first-cut attempt at fixing this.  With this patch,
jade gets through the Postgres doc sources in 5 minutes rather than
the reported three days.  (I've never actually built the sources on
the machine I'm testing on, so these numbers are not strictly
comparable.)

As patched, PairNodeListObj is used only in ReverseNodeListObj;
it's tempting to get rid of it entirely by merging it somehow
with the new CellNodeListObj class.  But since I've never looked
at the jade sources before today, I figured I'd better send it
in without any more ado, so someone who knows better can tell me
why this is not gonna work ...

            regards, tom lane

[ plus patch, stripped ]

------- End of Forwarded Message

The point of the patch is to introduce a version of NodeListObj that
tracks both the head and the tail of the list it represents, so that
successive appends can produce a tree that's right-deep instead of
left-deep.  The individual list cells (CellNodeListObj) just have
Lisp-style car and cdr pointers, while HeadNodeListObj is the top of
the list and remembers the current first and last cells.

As for the changes in primitive.cxx, assuming that I have wrapped my
head around the usage of ELObjDynamicRoot correctly, the original
coding is actually wrong because it invokes PairNodeListObj() without
protecting the "tem" pointer.  If that were to point to a
freshly-created, not-otherwise-referenced object, then if any garbage
collection occurred during PairNodeListObj()'s memory allocation request
you'd have a problem.  I suppose this bug has not been seen because
no version of asNodeList() that is actually used here returns a freshly
created object, but it sure seems like an accident waiting to happen.
So I modified the code to be sure that both nl and tem are
ELObjDynamicRoot-protected during the nodeListAppend call.

            regards, tom lane

Re: Switching to XML

From

Alvaro Herrera

Date:

22 December 2006, 12:56:05

Guillaume Lelarge wrote:

> The last PDF available on the website has some bad issues. Just take a
> look at table 8-13 (page 805), table 8-18 (page 113). One more time, I
> don't say you can't fix this in SGML (and I think we already talk of
> this some time ago). But *I* won't do it. And I think some people are
> willing to work on this but they also want to have XML documentation.

I checked the PDF available on
http://www.postgresql.org/files/documentation/pdf/8.2/postgresql-8.2-A4.pdf
and it does have the issues Guillaume mentions: table 8-13 (on page 105)
is certainly mangled (columns too wide), and table 8-18 is mangled too
(a row's content is too wide for the columns, so the text on one column
overlaps the text on the next).  This problem can also be seen on other
tables, for example 9-5 and 9-6.

I printed one page and it looks just as silly, so it's not just a
display problem.

Also, the US version is lacking the index -- I suppose it was built
without the complete index.sgml file.

--
Alvaro Herrera                                http://www.CommandPrompt.com/
PostgreSQL Replication, Consulting, Custom Development, 24x7 support

Re: Switching to XML

From

Guillaume Lelarge

Date:

22 December 2006, 13:09:37

Alvaro Herrera a ecrit le 22/12/2006 17:54:
> Guillaume Lelarge wrote:
>
>> The last PDF available on the website has some bad issues. Just take a
>> look at table 8-13 (page 805), table 8-18 (page 113). One more time, I
>> don't say you can't fix this in SGML (and I think we already talk of
>> this some time ago). But *I* won't do it. And I think some people are
>> willing to work on this but they also want to have XML documentation.
>
> I checked the PDF available on
> http://www.postgresql.org/files/documentation/pdf/8.2/postgresql-8.2-A4.pdf
> and it does have the issues Guillaume mentions: table 8-13 (on page 105)
> is certainly mangled (columns too wide), and table 8-18 is mangled too
> (a row's content is too wide for the columns, so the text on one column
> overlaps the text on the next).  This problem can also be seen on other
> tables, for example 9-5 and 9-6.
>

There's a way to fix this : adding colspec's tags for each column which
need a specific col width.

For example :
   <colspec colnum="1" colwidth="1.3*"/>

> I printed one page and it looks just as silly, so it's not just a
> display problem.
>
> Also, the US version is lacking the index -- I suppose it was built
> without the complete index.sgml file.
>

Regards.


--
Guillaume.