RE: 7.0.3 dumps aren't accessible via JDBC in 7.1 - Mailing list pgsql-bugs

From Rainer Mager
Subject RE: 7.0.3 dumps aren't accessible via JDBC in 7.1
Date
Msg-id NEBBJBCAFMMNIHGDLFKGKECIDKAA.rmager@vgkk.com
Whole thread Raw
In response to Re: 7.0.3 dumps aren't accessible via JDBC in 7.1  (Peter T Mount <peter@retep.org.uk>)
List pgsql-bugs
Hi Peter and all,

    I may have described this poorly, let me try again.

1. We have a Unicode database that has a particular dash character in it
that gets dumped incorrectly. When dumped (from 7.0.x) the dash becomes the
character 0xAD but is not properly encoded in UTF-8 (at least my limited
knowledge of UTF-8 says so). My understanding is that all characters above
0x7F should be encoded but this particular character is not encoded/escaped
at all in the dump.

2. The given dump can be imported into 7.1 without visible problem to 7.1.
The character in question can be viewed via queries in PSQL. The only time
that there is a problem is when the character is accessed via JDBC. If byte
array returned from the backend to the JDBC driver is supposed to be UTF-8
format but (I believe) is incorrectly formatted. The character in question
is, still, by itself as 0xAD and this is not valid UTF-8. When the Java
UTF-8 to internal Unicode converter hits this character it dies and the
resulting string is truncated right before the character. I think there may
be a bug in Java that an InvalidEncoding exception isn't thrown but
nonetheless the bytes aren't valid UTF-8.

3. My guess is that some part of Postgres' UTF-8 conversion routines are
wrong. I looked at the code but couldn't find the relavant parts.


I hope this is clearer.


Thanks,

--Rainer


> -----Original Message-----
> From: Peter T Mount [mailto:peter@retep.org.uk]
> Sent: Friday, April 20, 2001 8:13 PM
> To: rmager@vgkk.com; pgsql-bugs@postgresql.org; pgsql-bugs@postgresql.org
> Cc: pgsql-bugs@postgresql.org
> Subject: Re: [BUGS] 7.0.3 dumps aren't accessible via JDBC in 7.1
>
>
> Hmmm, this sounds like either a backend issue, or something is
> misconfigured.
> Have you got unicode support enabled in the backend?
>
> Peter
>
> --
> Peter Mount peter@retep.org.uk
> PostgreSQL JDBC Driver: http://www.retep.org.uk/postgres/
> RetepPDF PDF library for Java: http://www.retep.org.uk/pdf/

pgsql-bugs by date:

Previous
From: Tom Lane
Date:
Subject: Re: An statement causes postmaster to die
Next
From: pgsql-bugs@postgresql.org
Date:
Subject: select a.name ... union select a.name ... order by a.name fails in 7.1