Thread: Cognitive dissonance

Cognitive dissonance

From
John Gage
Date:
Unix is a text-based operating system with unbelievably helpful text
manipulation tools.

Postgres is a creature of Unix which happens to have unbelievable text
searching and manipulation tools.

Yet, the only one file edition of the Postgres documentation is
in...pdf format.  Huh?

I know.  I know.  I have already brought this up.  And various ways of
creating a one file text edition of the documentation have been
proposed to me.  I know.

But either I am a visitor from the Crab Nebula, or there is someone
else out there who would like to have a text file of the entire
documentation.

Two examples from other applications.

I use Vim.  Vim's documentation is as easy to access as any
documentation on earth...as long as you know exactly what you are
looking for.  Otherwise, it is a tremendous pain.

I also use the National Library of Medicine's MeSH subject headings.
25,000 descriptors with definitions, synonyms and a lot of other
things.  They give it to you in single files either as text, xml, or
other ways.  Big files.  Hundreds of megabytes.  That makes it so that
you can do just about anything with it you want.  It is one of the
seven wonders of the world.

I do suggest that a plain text file of the entire documentation be
made part of the documentation armamentarium.

Respectfully,

John Gage



Re: Cognitive dissonance

From
Lew
Date:
John Gage wrote:
> I also use the National Library of Medicine's MeSH subject headings.
> 25,000 descriptors with definitions, synonyms and a lot of other things.
> They give it to you in single files either as text, xml, or other ways.
> Big files. Hundreds of megabytes. That makes it so that you can do just
> about anything with it you want. It is one of the seven wonders of the
> world.
>
> I do suggest that a plain text file of the entire documentation be made
> part of the documentation armamentarium.

 From <http://www.postgresql.org/docs/manuals/>:
"The DocBook SGML source for the manuals is available as part of the
PostgreSQL source download available in the FTP area."

--
Lew

Re: Cognitive dissonance

From
Peter Hunsberger
Date:
On Tue, Jun 8, 2010 at 4:04 AM, John Gage <jsmgage@numericable.fr> wrote:
> Unix is a text-based operating system with unbelievably helpful text
> manipulation tools.
>
> Postgres is a creature of Unix which happens to have unbelievable text
> searching and manipulation tools.
>
> Yet, the only one file edition of the Postgres documentation is in...pdf
> format.  Huh?
>

I suppose the next thing you'll be suggesting is that, because
Postgres is a database, the documentation should be stored as some
form of searchable table within the database itself....?

<runs and hides/>

--
Peter Hunsberger

Re: Cognitive dissonance

From
Justin Graf
Date:
On 6/8/2010 9:23 AM, Peter Hunsberger wrote:
> On Tue, Jun 8, 2010 at 4:04 AM, John Gage<jsmgage@numericable.fr>  wrote:
>
>> Unix is a text-based operating system with unbelievably helpful text
>> manipulation tools.
>>
>> Postgres is a creature of Unix which happens to have unbelievable text
>> searching and manipulation tools.
>>
>> Yet, the only one file edition of the Postgres documentation is in...pdf
>> format.  Huh?
>>
>>
> I suppose the next thing you'll be suggesting is that, because
> Postgres is a database, the documentation should be stored as some
> form of searchable table within the database itself....?
>
> <runs and hides/>
>
>

Its also available in chm  windows help file format.  Which i find allot
more useful
http://www.postgresql.org/docs/manuals/
you could print chm to a text file.

also it not hard to dump a PDF document into a text file.




All legitimate Magwerks Corporation quotations are sent in a .PDF file attachment with a unique ID number generated by
ourproprietary quotation system. Quotations received via any other form of communication will not be honored. 

CONFIDENTIALITY NOTICE: This e-mail, including attachments, may contain legally privileged, confidential or other
informationproprietary to Magwerks Corporation and is intended solely for the use of the individual to whom it
addresses.If the reader of this e-mail is not the intended recipient or authorized agent, the reader is hereby notified
thatany unauthorized viewing, dissemination, distribution or copying of this e-mail is strictly prohibited. If you have
receivedthis e-mail in error, please notify the sender by replying to this message and destroy all occurrences of this
e-mailimmediately. 
Thank you.

Attachment

Re: Cognitive dissonance

From
Stephen Frost
Date:
* John Gage (jsmgage@numericable.fr) wrote:
> But either I am a visitor from the Crab Nebula, or there is someone else
> out there who would like to have a text file of the entire
> documentation.

Soo..  there are quite a few man pages, and in-psql's help is also
pretty nice (\h <command> and \?).  That's certainly what I typically
use.  I admit that we don't include the full command description in the
\h (just the syntax), but that's still extremely useful.

Would a \h+ that gave you the text from the web-page be useful..?  That,
plus the various man pages, would cover an awful lot of what's in SGML..

    Thanks,

        Stephen

Attachment

Re: Cognitive dissonance

From
John Gage
Date:
Thank you all for your suggestions.  Thank you very much.

John


1) I suppose the next thing you'll be suggesting is that, because
Postgres is a database, the documentation should be stored as some
form of searchable table within the database itself....?

<runs and hides/>

------Well, that is exactly what I have done with the MeSH subject
headings.  And it works like a charm.


2) Its also available in chm  windows help file format.  Which i find
allot
more useful
http://www.postgresql.org/docs/manuals/
you could print chm to a text file.

------I'll have to boot over to XP, ugh.  Will do.


3)  also it not hard to dump a PDF document into a text file.

------I would print out what the dump looks like, but this is a family
program


4) Would a \h+ that gave you the text from the web-page be useful..?
That,
plus the various man pages, would cover an awful lot of what's in SGML..

 From <http://www.postgresql.org/docs/manuals/>:
"The DocBook SGML source for the manuals is available as part of the
PostgreSQL source download available in the FTP area."


-----I'm headed there.  It's just that given the incredibly good
documentation and the fact that it's available in just about every
format except a text file, I was sort of hoping for a policy change on
the part of the powers that be.





Re: Cognitive dissonance

From
Chris Browne
Date:
justin@magwerks.com (Justin Graf) writes:
> Its also available in chm  windows help file format.  Which i find allot
> more useful
> http://www.postgresql.org/docs/manuals/
> you could print chm to a text file.
>
> also it not hard to dump a PDF document into a text file.

I wish I could find a converter that would generate one of the common
eBook formats (epub, mobi).

There do exist CHM readers on mobile platforms such as Android, but
they're much clumsier to work with than the rather more heavily used
eBook readers.

I have poked around for conversions; nothing particularly suitable has
emerged :-(.
--
select 'cbbrowne' || '@' || 'cbbrowne.com';
http://cbbrowne.com/info/internet.html
"MS  apparently now  has a  team dedicated  to tracking  problems with
Linux  and publicizing them.   I guess  eventually they'll  figure out
this back fires... ;)" -- William Burrow <aa126@DELETE.fan.nb.ca>

Re: Cognitive dissonance

From
Justin Graf
Date:
***SNIP***
> 2) Its also available in chm  windows help file format.  Which i find
> allot
> more useful
> http://www.postgresql.org/docs/manuals/
> you could print chm to a text file.
>
> ------I'll have to boot over to XP, ugh.  Will do.
There are linux chm readers

http://www.linux.com/news/software/applications/8209-chm-viewers-for-linux

and one for firefox
https://addons.mozilla.org/en-US/firefox/addon/3235/


All legitimate Magwerks Corporation quotations are sent in a .PDF file attachment with a unique ID number generated by
ourproprietary quotation system. Quotations received via any other form of communication will not be honored. 

CONFIDENTIALITY NOTICE: This e-mail, including attachments, may contain legally privileged, confidential or other
informationproprietary to Magwerks Corporation and is intended solely for the use of the individual to whom it
addresses.If the reader of this e-mail is not the intended recipient or authorized agent, the reader is hereby notified
thatany unauthorized viewing, dissemination, distribution or copying of this e-mail is strictly prohibited. If you have
receivedthis e-mail in error, please notify the sender by replying to this message and destroy all occurrences of this
e-mailimmediately. 
Thank you.

Attachment

Re: Cognitive dissonance

From
John R Pierce
Date:
Justin Graf wrote:
> There are linux chm readers
>
...


Note that even Microsoft deprecated CHM back in 2003 after it was
realized it was full of potential security exploits that couldn't
readily be abated.



Re: Cognitive dissonance

From
Josh Kupershmidt
Date:
On Tue, Jun 8, 2010 at 5:04 AM, John Gage <jsmgage@numericable.fr> wrote:
> I do suggest that a plain text file of the entire documentation be made part
> of the documentation armamentarium.

Not that I see a whole lot of utility in this endeavor, but it's
possible to do a decent PDF to plain text conversion. I tried some of
the online tools that do this and found <http://www.pdftextonline.com>
to do the best job with the Postgres manual. I didn't bother trying
any client-side applications which do the same job.

Attached is a short snippet of a text export of the PDF manual.

Josh

Attachment

Re: Cognitive dissonance

From
John Gage
Date:
1) On a list that howls with complaints when posts are in html, it is
surprising that there is resistance to the idea of documentation in
plain text.

2) Posters are correctly referred to the documentation as frequently
as possible.  In fact, very frequently.  The frequency might decrease
if the documentation were in plain text.  It is easier to search a
single plain text file than any other source, except perhaps the
database itself.

3) Postgres is getting pushed off the map at the low end by MySQL, now
owned by Oracle.    If Postgres ceased to exist, Ellison would be
thrilled.  I chose A2 Hosting (with whom I am very happy) for my
website because they support Postgres.  I'm writing cgi scripts in
perl.  I had to install the postgres driver for dbi.  It was not pre-
installed.  There are about four buttons for MySQL on the cPanel and
two farther over on the right for Postgres.

An anecdote.  I discovered the tsvector functionality a while back.  I
have used it to create indices for my text files and several other
tasks.  I recently was re-looking at my files and saw
"tsvector::text".  I had forgotten that the double colon is one way to
cast a type.  Double colon is not in the html index of the
documentation.  I found it by searching my plain text version of the
pdf file.  In my opinion, the html documentation is useful for reading
it like a novel or referencing it in these lists.


On Jun 8, 2010, at 9:56 PM, Josh Kupershmidt wrote:

> Not that I see a whole lot of utility in this endeavor


Re: Cognitive dissonance

From
Greg Smith
Date:
John Gage wrote:
> Posters are correctly referred to the documentation as frequently as
> possible.  In fact, very frequently.  The frequency might decrease if
> the documentation were in plain text.  It is easier to search a single
> plain text file than any other source, except perhaps the database
> itself.

In reality searches are being done on the web, which combines the HTML
version of the official documentation with blog posts, presentation
materials, the wiki, and similar other resources.  This is why I don't
actually care about a text version of the docs; I've just gotten used to
using Google to search the PostgreSQL documentation.  The occasional
time when I know I just want to search the manual instead, I can search
the PDF version.  Neither of those are great solutions, but they're good
enough that it's not worth fighting to build a text version over as I
see it.  I'd use it if it were around, but there's little motivation for
most of us to work on it.

> Postgres is getting pushed off the map at the low end by MySQL, now
> owned by Oracle.

The dynamics are much more complicated than that.  Big MySQL sites are
switching to NoSQL; medium sized MySQL sites are switching to PostgreSQL
to get rid of scaling and reliability issues (I personally have been
seeing a lot of this from Rails installs lately); small to medium size
Oracle shops are switching to PostgreSQL to lower licensing costs.

The idea that plain-text documentation for the database would be a
significant driver in any of these trends would be greatly exaggerating
the significance of a technical detail important to a pretty small
number of people.  On my personal list of "things that could be improved
in the documentation", good plain text format is there, but there's a
whole lot of things above it.

--
Greg Smith  2ndQuadrant US  Baltimore, MD
PostgreSQL Training, Services and Support
greg@2ndQuadrant.com   www.2ndQuadrant.us


Re: Cognitive dissonance

From
Brian Modra
Date:
On 09/06/2010, John Gage <jsmgage@numericable.fr> wrote:
> 1) On a list that howls with complaints when posts are in html, it is
> surprising that there is resistance to the idea of documentation in
> plain text.
>
> 2) Posters are correctly referred to the documentation as frequently
> as possible.  In fact, very frequently.  The frequency might decrease
> if the documentation were in plain text.  It is easier to search a
> single plain text file than any other source, except perhaps the
> database itself.
>
> 3) Postgres is getting pushed off the map at the low end by MySQL, now
> owned by Oracle.    If Postgres ceased to exist, Ellison would be
> thrilled.  I chose A2 Hosting (with whom I am very happy) for my
> website because they support Postgres.  I'm writing cgi scripts in
> perl.  I had to install the postgres driver for dbi.  It was not pre-
> installed.  There are about four buttons for MySQL on the cPanel and
> two farther over on the right for Postgres.
>
> An anecdote.  I discovered the tsvector functionality a while back.  I
> have used it to create indices for my text files and several other
> tasks.  I recently was re-looking at my files and saw
> "tsvector::text".  I had forgotten that the double colon is one way to
> cast a type.  Double colon is not in the html index of the
> documentation.  I found it by searching my plain text version of the
> pdf file.  In my opinion, the html documentation is useful for reading
> it like a novel or referencing it in these lists.
>
>
> On Jun 8, 2010, at 9:56 PM, Josh Kupershmidt wrote:
>
>> Not that I see a whole lot of utility in this endeavor

Personally I like to use html docs, and it would be good if the
documentation were downloadable from the postgresql website in other
formats, for convenience...

But, what I use is this, which works pretty well:

(e.g. to get the 8.1 dosc)

mkdir postgresql
cd postgresql
wget -r -nH -l 10 -k -np
http://www.postgresql.org/docs/8.1/interactive/index.html

... then after it all downloads:

open the file docs/8.1/interactive/index.html
in your web browser.

e.g.
links docs/8.1/interactive/index.html


HTML is "text", so you can search using grep e.g.
grep -r "ALTER TABLE .* ADD COLUMN" docs/8.1

>
>
> --
> Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-general
>


--
Brian Modra   Land line: +27 23 5411 462
Mobile: +27 79 69 77 082
5 Jan Louw Str, Prince Albert, 6930
Postal: P.O. Box 2, Prince Albert 6930
South Africa
http://www.zwartberg.com/
Fax: +27865510467

Re: Cognitive dissonance

From
Torsten Zühlsdorff
Date:
Brian Modra schrieb:

> Personally I like to use html docs, and it would be good if the
> documentation were downloadable from the postgresql website in other
> formats, for convenience...
>
> But, what I use is this, which works pretty well:
>
> (e.g. to get the 8.1 dosc)
>
> mkdir postgresql
> cd postgresql
> wget -r -nH -l 10 -k -np
> http://www.postgresql.org/docs/8.1/interactive/index.html
>
> ... then after it all downloads:
>
> open the file docs/8.1/interactive/index.html
> in your web browser.
>
> e.g.
> links docs/8.1/interactive/index.html
>
>
> HTML is "text", so you can search using grep e.g.
> grep -r "ALTER TABLE .* ADD COLUMN" docs/8.1

Thats the way i do too. A huge pdf is often not very helpful. In my
personal case i programm often in a train, using my laptop. Searching a
PDF with more than 1.000 pages really hits my battery. With html-files i
could preselect the items to search.
Also it's possible to import the html-files in a postgres-db and using
fulltext-search. ;)

Greetings,
Torsten
--
http://www.dddbl.de - ein Datenbank-Layer, der die Arbeit mit 8
verschiedenen Datenbanksystemen abstrahiert,
Queries von Applikationen trennt und automatisch die Query-Ergebnisse
auswerten kann.

Re: Cognitive dissonance

From
Dave Coventry
Date:
My tupp'th:

Formatted text, whether PDF, HTML or (heaven forbid!) Word Documents,
is easier to read than unformatted plain text, and those of us without
the OP's very admirable proficiency in vi remain at the mercy of the
various readers and their associated search functions.

However, I sure that it's not too arduous a task to extract the text
in these documents and strip them of their formatting?

Or am I missing something?

Re: Cognitive dissonance

From
Dimitri Fontaine
Date:
Dave Coventry <dgcoventry@gmail.com> writes:
> Formatted text, whether PDF, HTML or (heaven forbid!) Word Documents,
> is easier to read than unformatted plain text, and those of us without
> the OP's very admirable proficiency in vi remain at the mercy of the
> various readers and their associated search functions.
>
> However, I sure that it's not too arduous a task to extract the text
> in these documents and strip them of their formatting?
>
> Or am I missing something?

Info documentation format. Text based, super user aware, easy to
browse and search, has an index.

You can even produce postgres.info today, it's just not optimised to be
very friendly, it's missing mainly convenient table support and
index.

Regards,
--
dim

Re: Cognitive dissonance

From
Alvaro Herrera
Date:
Excerpts from John Gage's message of mié jun 09 01:28:54 -0400 2010:

> I recently was re-looking at my files and saw
> "tsvector::text".  I had forgotten that the double colon is one way to
> cast a type.  Double colon is not in the html index of the
> documentation.

I just added an index entry for ::, thanks for pointing out that it was
missing.

If you notice other missing index entries, do not hesitate to point it
out in this mailing list or pgsql-docs.

--
Álvaro Herrera <alvherre@commandprompt.com>
The PostgreSQL Company - Command Prompt, Inc.
PostgreSQL Replication, Consulting, Custom Development, 24x7 support

Re: Cognitive dissonance

From
Lew
Date:
Brian Modra wrote:
> Personally I like to use html docs, and it would be good if the
> documentation were downloadable from the postgresql website in other
> formats, for convenience...

Good thing it is, then, albeit not in the most convenient format, i.e.,
DocBook.  But then, from there you can generate pretty much any format you
want, right?

--
Lew

Re: Cognitive dissonance

From
Peter Eisentraut
Date:
On tis, 2010-06-08 at 11:04 +0200, John Gage wrote:
>
> Yet, the only one file edition of the Postgres documentation is
> in...pdf format.  Huh?
>
> I know.  I know.  I have already brought this up.  And various ways
> of
> creating a one file text edition of the documentation have been
> proposed to me.  I know.
>
> But either I am a visitor from the Crab Nebula, or there is someone
> else out there who would like to have a text file of the entire
> documentation.

As I said back then, doing this is straightforward, but we kind of need
more than one user who asks for it before we make it part of a regular
service, which comes with maintenance costs.


Re: Cognitive dissonance

From
Alvaro Herrera
Date:
Excerpts from Peter Eisentraut's message of jue jun 10 02:50:14 -0400 2010:
> On tis, 2010-06-08 at 11:04 +0200, John Gage wrote:
> >
> > Yet, the only one file edition of the Postgres documentation is
> > in...pdf format.  Huh?
> >
> > I know.  I know.  I have already brought this up.  And various ways
> > of
> > creating a one file text edition of the documentation have been
> > proposed to me.  I know.
> >
> > But either I am a visitor from the Crab Nebula, or there is someone
> > else out there who would like to have a text file of the entire
> > documentation.
>
> As I said back then, doing this is straightforward, but we kind of need
> more than one user who asks for it before we make it part of a regular
> service, which comes with maintenance costs.

Hey, count me as another interested person in a single-file plain-text
doc output format.

--
Álvaro Herrera <alvherre@commandprompt.com>
The PostgreSQL Company - Command Prompt, Inc.
PostgreSQL Replication, Consulting, Custom Development, 24x7 support

Re: Cognitive dissonance

From
Tom Lane
Date:
Alvaro Herrera <alvherre@commandprompt.com> writes:
> Excerpts from Peter Eisentraut's message of jue jun 10 02:50:14 -0400 2010:
>> As I said back then, doing this is straightforward, but we kind of need
>> more than one user who asks for it before we make it part of a regular
>> service, which comes with maintenance costs.

> Hey, count me as another interested person in a single-file plain-text
> doc output format.

Well, there are two separate things here:

* providing a Makefile target to build plain-text output.

* shipping prebuilt plain text docs in standard distributions.

I am for #1, not so much for #2, mainly on the grounds of size.  But
given #1 it would be possible for packagers to make their own choices
about whether to include plain-text docs.

            regards, tom lane

Re: Cognitive dissonance

From
Leif Biberg Kristensen
Date:
On Thursday 10. June 2010 17.24.00 Tom Lane wrote:
> Alvaro Herrera <alvherre@commandprompt.com> writes:
> > Excerpts from Peter Eisentraut's message of jue jun 10 02:50:14 -0400
2010:
> >> As I said back then, doing this is straightforward, but we kind of need
> >> more than one user who asks for it before we make it part of a regular
> >> service, which comes with maintenance costs.
>
> > Hey, count me as another interested person in a single-file plain-text
> > doc output format.
>
> Well, there are two separate things here:
>
> * providing a Makefile target to build plain-text output.
>
> * shipping prebuilt plain text docs in standard distributions.
>
> I am for #1, not so much for #2, mainly on the grounds of size.  But
> given #1 it would be possible for packagers to make their own choices
> about whether to include plain-text docs.

Wouldn't it suffice to make it downloadable, like the pdf doc?

regards,
--
Leif Biberg Kristensen
http://solumslekt.org/blog/

Re: Cognitive dissonance

From
John Gage
Date:
Like all visitors from the Crab Nebula (except our leaders who are
genetically separate) I qualify as a novice when it comes to
Postgres.  What is more, the people (humans, that is) who need the
documentation the most are those who, well, need the documentation the
most.

Hence, if this were to be made available, it would be great if it was
novice speed.

Thanks everyone for even contemplating it.

John


>> Well, there are two separate things here:
>>
>> * providing a Makefile target to build plain-text output.
>>
>> * shipping prebuilt plain text docs in standard distributions.
>>
>> I am for #1, not so much for #2, mainly on the grounds of size.  But
>> given #1 it would be possible for packagers to make their own choices
>> about whether to include plain-text docs.
>
> Wouldn't it suffice to make it downloadable, like the pdf doc?
>


Re: Cognitive dissonance

From
Robert Gravsjö
Date:

Leif Biberg Kristensen skrev 2010-06-10 17.33:
> On Thursday 10. June 2010 17.24.00 Tom Lane wrote:
>> Alvaro Herrera<alvherre@commandprompt.com>  writes:
>>> Excerpts from Peter Eisentraut's message of jue jun 10 02:50:14 -0400
> 2010:
>>>> As I said back then, doing this is straightforward, but we kind of need
>>>> more than one user who asks for it before we make it part of a regular
>>>> service, which comes with maintenance costs.
>>
>>> Hey, count me as another interested person in a single-file plain-text
>>> doc output format.
>>
>> Well, there are two separate things here:
>>
>> * providing a Makefile target to build plain-text output.
>>
>> * shipping prebuilt plain text docs in standard distributions.
>>
>> I am for #1, not so much for #2, mainly on the grounds of size.  But
>> given #1 it would be possible for packagers to make their own choices
>> about whether to include plain-text docs.
>
> Wouldn't it suffice to make it downloadable, like the pdf doc?

And/or make the HTML version downloadable side by side with the PDF.

There are good reasons for wanting access to the complete document when
being offline. PDF is not such a bad format but it do have some
limitations as have been previously mentioned.

As for building the docs I don't think everyone, not even all
developers, has the tool chain installed (or even wants to).

Regards,
roppert

>
> regards,

Re: Cognitive dissonance

From
Bruce Momjian
Date:
Robert Gravsjö wrote:
> >> I am for #1, not so much for #2, mainly on the grounds of size.  But
> >> given #1 it would be possible for packagers to make their own choices
> >> about whether to include plain-text docs.
> >
> > Wouldn't it suffice to make it downloadable, like the pdf doc?
>
> And/or make the HTML version downloadable side by side with the PDF.

That might be easy to do.  We already build the HTML, and requiring
people to recursively use wget is not user-friendly.

--
  Bruce Momjian  <bruce@momjian.us>        http://momjian.us
  EnterpriseDB                             http://enterprisedb.com

  + None of us is going to be here forever. +

Re: Cognitive dissonance

From
John Gage
Date:
A one file html version would be a godsend.


On Jun 12, 2010, at 3:20 AM, Bruce Momjian wrote:

> Robert Gravsjö wrote:
>>>> I am for #1, not so much for #2, mainly on the grounds of size.
>>>> But
>>>> given #1 it would be possible for packagers to make their own
>>>> choices
>>>> about whether to include plain-text docs.
>>>
>>> Wouldn't it suffice to make it downloadable, like the pdf doc?
>>
>> And/or make the HTML version downloadable side by side with the PDF.
>
> That might be easy to do.  We already build the HTML, and requiring
> people to recursively use wget is not user-friendly.
>
> --
>  Bruce Momjian  <bruce@momjian.us>        http://momjian.us
>  EnterpriseDB                             http://enterprisedb.com
>
>  + None of us is going to be here forever. +
>
> --
> Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-general


Re: Cognitive dissonance

From
Peter Eisentraut
Date:
On lör, 2010-06-12 at 11:18 +0200, John Gage wrote:
> A one file html version would be a godsend.

I've committed a build target for that now.  Use 'make postgres.html' in
doc/src/sgml/.



Re: Cognitive dissonance

From
Tom Lane
Date:
Peter Eisentraut <peter_e@gmx.net> writes:
> On lör, 2010-06-12 at 11:18 +0200, John Gage wrote:
>> A one file html version would be a godsend.

> I've committed a build target for that now.  Use 'make postgres.html' in
> doc/src/sgml/.

Huh, is that actually worth anything?  How many browsers will open it
without crashing, or will navigate the page with decent performance
if they do manage to open it?

(Not that I object to providing this Make target.  But I thought the
discussion was about plain-text output.)

            regards, tom lane

Re: Cognitive dissonance

From
John Gage
Date:
It was.  But if the compromise is single file html, that is a vast
improvement over the current system imho.

What I want is the thing that is maximally amenable to being searched
conveniently using all the tools at our disposal especially regular
expressons.  The point has been made that Google is the best system to
search for Postgres documentation/knowledge/etc.  Frankly, I don't
necessarily agree with that, particularly for the novice.  The
documentation is where it is at, and it is the documentation that is
referenced the most in these posts.

But there are no dichotomies here.  It is not either or.  It is a
balance between what is easiest to produce and maintain and what is
most productive to use.

And the background for my request is my respect for the extraordinary
power and elegance of postgres.

John


On Jun 12, 2010, at 3:10 PM, Tom Lane wrote:

> Peter Eisentraut <peter_e@gmx.net> writes:
>> On lör, 2010-06-12 at 11:18 +0200, John Gage wrote:
>>> A one file html version would be a godsend.
>
>> I've committed a build target for that now.  Use 'make
>> postgres.html' in
>> doc/src/sgml/.
>
> Huh, is that actually worth anything?  How many browsers will open it
> without crashing, or will navigate the page with decent performance
> if they do manage to open it?
>
> (Not that I object to providing this Make target.  But I thought the
> discussion was about plain-text output.)
>
>             regards, tom lane
>
> --
> Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-general


Re: Cognitive dissonance

From
Stephen Frost
Date:
* Tom Lane (tgl@sss.pgh.pa.us) wrote:
> Peter Eisentraut <peter_e@gmx.net> writes:
> > I've committed a build target for that now.  Use 'make postgres.html' in
> > doc/src/sgml/.
>
> Huh, is that actually worth anything?  How many browsers will open it
> without crashing, or will navigate the page with decent performance
> if they do manage to open it?

If it works with links, that'd probably work well for the use case
described...

    Stephen

Attachment

Re: Cognitive dissonance

From
Peter Eisentraut
Date:
On lör, 2010-06-12 at 09:10 -0400, Tom Lane wrote:
> Peter Eisentraut <peter_e@gmx.net> writes:
> > On lör, 2010-06-12 at 11:18 +0200, John Gage wrote:
> >> A one file html version would be a godsend.
>
> > I've committed a build target for that now.  Use 'make postgres.html' in
> > doc/src/sgml/.
>
> Huh, is that actually worth anything?  How many browsers will open it
> without crashing, or will navigate the page with decent performance
> if they do manage to open it?

Text output is generated by going through HTML.  I haven't figured out
the best way to do the second step yet.  We use lynx for INSTALL and
HISTORY, but the results for this big file aren't very clean.

Browsers seem to handle the file OK, btw.


Re: Cognitive dissonance

From
Bruce Momjian
Date:
Peter Eisentraut wrote:
> On l?r, 2010-06-12 at 09:10 -0400, Tom Lane wrote:
> > Peter Eisentraut <peter_e@gmx.net> writes:
> > > On l?r, 2010-06-12 at 11:18 +0200, John Gage wrote:
> > >> A one file html version would be a godsend.
> >
> > > I've committed a build target for that now.  Use 'make postgres.html' in
> > > doc/src/sgml/.
> >
> > Huh, is that actually worth anything?  How many browsers will open it
> > without crashing, or will navigate the page with decent performance
> > if they do manage to open it?
>
> Text output is generated by going through HTML.  I haven't figured out
> the best way to do the second step yet.  We use lynx for INSTALL and
> HISTORY, but the results for this big file aren't very clean.
>
> Browsers seem to handle the file OK, btw.

Well, I tried lynx and the output looked fine to me, so I applied the
attached patch to allow single-page text output.  You can see the HTML
and text file results here:

    http://momjian.us/expire/

The new rule name is postgres.txt.  The file size are:

    7,789,730  postgres.html
    5,155,672  postgres.txt

--
  Bruce Momjian  <bruce@momjian.us>        http://momjian.us
  EnterpriseDB                             http://enterprisedb.com

  + None of us is going to be here forever. +
Index: doc/src/sgml/Makefile
===================================================================
RCS file: /cvsroot/pgsql/doc/src/sgml/Makefile,v
retrieving revision 1.145
diff -c -c -r1.145 Makefile
*** doc/src/sgml/Makefile    12 Jun 2010 15:42:44 -0000    1.145
--- doc/src/sgml/Makefile    12 Jun 2010 15:57:31 -0000
***************
*** 108,113 ****
--- 108,117 ----
  postgres.html: postgres.sgml $(ALLSGML) stylesheet.dsl
      $(JADE.html.call) -V nochunks -V rootchunk -V '(define %root-filename% #f)' -V '(define use-output-dir #f)' -i
include-index$< 

+ # single-page text
+ postgres.txt: postgres.html
+     $(LYNX) -force_html -dump -nolist -stdin $< > $@
+
  HTML.index: postgres.sgml $(ALMOSTALLSGML) stylesheet.dsl
      @$(MKDIR_P) html
      $(JADE.html.call) -V html-index $<

Re: Cognitive dissonance

From
John Gage
Date:
UFB!  This was definitely worth the visit from the Nebula.

Thanks very, very much.

Sensational.

Thanks again,

John Gage


On Jun 12, 2010, at 6:01 PM, Bruce Momjian wrote:

http://momjian.us/expire/

The new rule name is postgres.txt.  The file size are:

7,789,730  postgres.html
5,155,672  postgres.txt

--
 Bruce Momjian  <bruce@momjian.us>        http://momjian.us
 EnterpriseDB                             http://enterprisedb.com


Re: Cognitive dissonance

From
Bruce Momjian
Date:
John Gage wrote:
> UFB!  This was definitely worth the visit from the Nebula.
>
> Thanks very, very much.
>
> Sensational.
>
> Thanks again,
>
> John Gage

We still have to decide how to make these accessible from our web site.

--
  Bruce Momjian  <bruce@momjian.us>        http://momjian.us
  EnterpriseDB                             http://enterprisedb.com

  + None of us is going to be here forever. +

Re: Cognitive dissonance

From
Tim Landscheidt
Date:
Bruce Momjian <bruce@momjian.us> wrote:

> [...]
> + # single-page text
> + postgres.txt: postgres.html
> +     $(LYNX) -force_html -dump -nolist -stdin $< > $@
                                          ^^^^^^
> +
> [...]

Isn't that unnecessary/wrong as the filename is supplied on
the command line?

Tim

Re: Cognitive dissonance

From
Bruce Momjian
Date:
Tim Landscheidt wrote:
> Bruce Momjian <bruce@momjian.us> wrote:
>
> > [...]
> > + # single-page text
> > + postgres.txt: postgres.html
> > +     $(LYNX) -force_html -dump -nolist -stdin $< > $@
>                                           ^^^^^^
> > +
> > [...]
>
> Isn't that unnecessary/wrong as the filename is supplied on
> the command line?

Ah, good catch. I wasn't able to test that because my lynx version
doesn't support -stdin.  Thanks, and updated.

--
  Bruce Momjian  <bruce@momjian.us>        http://momjian.us
  EnterpriseDB                             http://enterprisedb.com

  + None of us is going to be here forever. +

Re: Cognitive dissonance

From
Chris Browne
Date:
tgl@sss.pgh.pa.us (Tom Lane) writes:
> Peter Eisentraut <peter_e@gmx.net> writes:
>> On lör, 2010-06-12 at 11:18 +0200, John Gage wrote:
>>> A one file html version would be a godsend.
>
>> I've committed a build target for that now.  Use 'make postgres.html' in
>> doc/src/sgml/.
>
> Huh, is that actually worth anything?  How many browsers will open it
> without crashing, or will navigate the page with decent performance
> if they do manage to open it?
>
> (Not that I object to providing this Make target.  But I thought the
> discussion was about plain-text output.)

I expect that having one HTML file would make it reasonably easy to
use lynx/links/w3m [some text-based HTML browser] to transform HTML
into plain text.

I should think that it would also enable using analagous tools to
transform HTML into eBook formats like ePub and mobi, which are the
frequently-preferable-formats on the emerging category of "electronic
book" appliances.

I have browsed the CHM form of the docs using a CHM reader on my
phone, and found that less than wonderful, which had a lot to do with
the reader not being particularly great.  A format that plays well
with a decent reader may turn out pretty happily.
--
select 'cbbrowne' || '@' || 'cbbrowne.com';
http://cbbrowne.com/info/internet.html
"MS  apparently now  has a  team dedicated  to tracking  problems with
Linux  and publicizing them.   I guess  eventually they'll  figure out
this back fires... ;)" -- William Burrow <aa126@DELETE.fan.nb.ca>