Thread: Questions about pg_dump file

Questions about pg_dump file

From
John Browne
Date:
I had a-couple of questions regarding the text formatting in the pg_dump file.

First, when dumping my database, I seem to be getting inconsistent
end-of-line characters.  Some lines end with the *NIX-style \n and
some end with Windows-style \r\n.  Now, I have created this database
over about a-year or so and I have tested various clients (psql,
pgadminIII, etc) during that time.  I'm suspecting they are coming
from one of the Windows clients, but shouldn't the \r\n end-of-line
characters be converted to \n when a dump is created?  Is this the
expected behavior?  I know the output of pg_dump could be piped
through perl or sed or something to strip/convert them.  I just didn't
know if there was an easier way.

Second question...   I have noticed that a pg_dump file keeps the text
formatting, ie, tabs, spaces, etc. for any user-defined functions.
The function looks like it should, and is readable, in the pg_dump
file, with all of it's formatting.  A view, however, ends up all on
the same line.  This is fine for simple views, but for more
complicated view definitions it can be difficult to read.  Is there a
way to get postgres/pg_dump to keep it's text formatting (tabs,
spaces, etc) for the view definitions like it does for functions?  Or
is this even feasable?

Thanks for the help

JB

Re: Questions about pg_dump file

From
Bruce Momjian
Date:
John Browne wrote:
> I had a-couple of questions regarding the text formatting in the pg_dump file.
>
> First, when dumping my database, I seem to be getting inconsistent
> end-of-line characters.  Some lines end with the *NIX-style \n and
> some end with Windows-style \r\n.  Now, I have created this database
> over about a-year or so and I have tested various clients (psql,
> pgadminIII, etc) during that time.  I'm suspecting they are coming
> from one of the Windows clients, but shouldn't the \r\n end-of-line
> characters be converted to \n when a dump is created?  Is this the
> expected behavior?  I know the output of pg_dump could be piped
> through perl or sed or something to strip/convert them.  I just didn't
> know if there was an easier way.

We should handle \n or \r\n just fine, though there is a fix in 7.4.5
for that.

> Second question...   I have noticed that a pg_dump file keeps the text
> formatting, ie, tabs, spaces, etc. for any user-defined functions.
> The function looks like it should, and is readable, in the pg_dump
> file, with all of it's formatting.  A view, however, ends up all on
> the same line.  This is fine for simple views, but for more
> complicated view definitions it can be difficult to read.  Is there a
> way to get postgres/pg_dump to keep it's text formatting (tabs,
> spaces, etc) for the view definitions like it does for functions?  Or
> is this even feasable?

This is improved in 8.0beta1.

--
  Bruce Momjian                        |  http://candle.pha.pa.us
  pgman@candle.pha.pa.us               |  (610) 359-1001
  +  If your life is a hard drive,     |  13 Roberts Road
  +  Christ can be your backup.        |  Newtown Square, Pennsylvania 19073

Re: Questions about pg_dump file

From
John Browne
Date:
On Wed, 25 Aug 2004 23:23:23 -0400 (EDT), Bruce Momjian
<pgman@candle.pha.pa.us> wrote:
> John Browne wrote:
> > I had a-couple of questions regarding the text formatting in the pg_dump file.
> >
> > First, when dumping my database, I seem to be getting inconsistent
> > end-of-line characters.  Some lines end with the *NIX-style \n and
> > some end with Windows-style \r\n.  Now, I have created this database
> > over about a-year or so and I have tested various clients (psql,
> > pgadminIII, etc) during that time.  I'm suspecting they are coming
> > from one of the Windows clients, but shouldn't the \r\n end-of-line
> > characters be converted to \n when a dump is created?  Is this the
> > expected behavior?  I know the output of pg_dump could be piped
> > through perl or sed or something to strip/convert them.  I just didn't
> > know if there was an easier way.
>
> We should handle \n or \r\n just fine, though there is a fix in 7.4.5
> for that.

Oh, pg_dump handles it just fine, but the problem is you end up with a
mixture of different EOL characters in the dump file.  Shouldn't
pg_dump create the dump file with consistent \n *NIX style EOL
characters throughout the entire file, instead of carrying over the
\r\n EOL characters from user-defined function definitions, views,
etc.?

The only reason I stumbled across the fact there were \r\n characters
in the dump file is because I started versioning my postgres schema in
my Subversion repository. :-)  Just thought it was odd the \r\n
characters were there..

Re: Questions about pg_dump file

From
Tom Lane
Date:
John Browne <jkbrowne@gmail.com> writes:
> Oh, pg_dump handles it just fine, but the problem is you end up with a
> mixture of different EOL characters in the dump file.  Shouldn't
> pg_dump create the dump file with consistent \n *NIX style EOL
> characters throughout the entire file, instead of carrying over the
> \r\n EOL characters from user-defined function definitions, views,
> etc.?

No, I don't think so.  The job of pg_dump is to reproduce what you have,
not to do cleanup for you.

            regards, tom lane

Re: Questions about pg_dump file

From
Bruce Momjian
Date:
John Browne wrote:
> On Wed, 25 Aug 2004 23:23:23 -0400 (EDT), Bruce Momjian
> <pgman@candle.pha.pa.us> wrote:
> > John Browne wrote:
> > > I had a-couple of questions regarding the text formatting in the pg_dump file.
> > >
> > > First, when dumping my database, I seem to be getting inconsistent
> > > end-of-line characters.  Some lines end with the *NIX-style \n and
> > > some end with Windows-style \r\n.  Now, I have created this database
> > > over about a-year or so and I have tested various clients (psql,
> > > pgadminIII, etc) during that time.  I'm suspecting they are coming
> > > from one of the Windows clients, but shouldn't the \r\n end-of-line
> > > characters be converted to \n when a dump is created?  Is this the
> > > expected behavior?  I know the output of pg_dump could be piped
> > > through perl or sed or something to strip/convert them.  I just didn't
> > > know if there was an easier way.
> >
> > We should handle \n or \r\n just fine, though there is a fix in 7.4.5
> > for that.
>
> Oh, pg_dump handles it just fine, but the problem is you end up with a
> mixture of different EOL characters in the dump file.  Shouldn't
> pg_dump create the dump file with consistent \n *NIX style EOL
> characters throughout the entire file, instead of carrying over the
> \r\n EOL characters from user-defined function definitions, views,
> etc.?

Yes, I see your point  If I create a function like this:

    test=> CREATE FUNCTION test () returns int4 AS 'SELECT 1;\r\nSELECT 2;'
    LANGUAGE 'SQL';

the dump has:

    --
    -- Name: test(); Type: FUNCTION; Schema: public; Owner: postgres
    --

    CREATE FUNCTION test() RETURNS integer
        AS $$SELECT 1;
    SELECT 2;$$
        LANGUAGE sql;

where there is \r after 'SELECT 1'.  While COPY is consistent in
escaping \r or \n that appear as data, it does not do anything to modify
strings used by function bodies.  This happens because it is a string
and we just report its exact value.  This does bring up the issue that
someone converting pg_dump output from one EOL format to another would
modify the function string.   I can't see a big downside to that,
though.

> The only reason I stumbled across the fact there were \r\n characters
> in the dump file is because I started versioning my postgres schema in
> my Subversion repository. :-)  Just thought it was odd the \r\n
> characters were there..

You suggested we add an option to COPY to output in various OS-specific
EOL formats.  Seems we would have to support MSWIN, Unix, and old-Mac.
That seems too much.  I think we are better off outputing in a native
format and allow external tools to change the EOL format.


--
  Bruce Momjian                        |  http://candle.pha.pa.us
  pgman@candle.pha.pa.us               |  (610) 359-1001
  +  If your life is a hard drive,     |  13 Roberts Road
  +  Christ can be your backup.        |  Newtown Square, Pennsylvania 19073