Re: sgml cleanup: unescaped '>' characters - Mailing list pgsql-docs

From Peter Eisentraut
Subject Re: sgml cleanup: unescaped '>' characters
Date
Msg-id 1314478125.2349.39.camel@vanquo.pezone.net
Whole thread Raw
In response to sgml cleanup: unescaped '>' characters  (Josh Kupershmidt <schmiddy@gmail.com>)
Responses Re: sgml cleanup: unescaped '>' characters
List pgsql-docs
On ons, 2011-08-24 at 23:28 -0400, Josh Kupershmidt wrote:
> I found myself rewriting the ./src/tools/find_gt_lt script in Perl
> this evening, since the existing script was quite broken (the main
> problem is it's not capable of understanding CDATA or sgml comment
> sections, and hence produces a bunch of noise).
>
> The rewritten version picked up a few stylistic inconsistencies in the
> SGML, such as:
>  * breaking the trailing '>' of an SGML marker across lines. AFAIK
> this is legal, but is a bit inconsistent and just confuses simplistic
> tools like find_gt_lt

The cases you show don't appear to be terribly useful, but I think on
occasion this can be necessary to work around some arcane whitespace
rules in SGML or XML.  (Just look at the generated HTML; it uses this
technique throughout.)

>  * using single quotes instead of double quotes to surround a node
> attribute, as in <orderedlist numeration='loweralpha'>

It would be better if the tool could handle that, because sometimes you
want to use single quotes if the value contains double quotes.

> as well as seemingly-invalid SGML, such as using '>' unescaped inside
> normal SGML entries.

Unescaped > is valid, AFAIK.



pgsql-docs by date:

Previous
From: Peter Eisentraut
Date:
Subject: Re: [GENERAL] documentation suggestion
Next
From: Josh Kupershmidt
Date:
Subject: Re: sgml cleanup: unescaped '>' characters