Home > mailing lists

Re: Upcoming PG re-releases - Mailing list pgsql-hackers

From	Gregory Maxwell
Subject	Re: Upcoming PG re-releases
Date	December 8, 2005 18:54:37
Msg-id	e692861c0512081454u560e6cc2h41f09293d4e5f2d1@mail.gmail.com Whole thread Raw
In response to	Re: Upcoming PG re-releases (Bruce Momjian <pgman@candle.pha.pa.us>)
Responses	Re: Upcoming PG re-releases
List	pgsql-hackers

Tree view

On 12/8/05, Bruce Momjian <pgman@candle.pha.pa.us> wrote:
> >       A script which identifies non-utf-8 characters and provides some
> >       context, line numbers, etc, will greatly speed up the process of
> >       remedying the situation.
>
> I think the best we can do is the "iconv -c with the diff" idea, which
> is already in the release notes.  I suppose we could merge the iconv and
> diff into a single command, but I don't see a portable way to output the
> iconv output to stdout., /dev/stdin not being portable.

No, what is needed for people who care about fixing their data is a
loadable strip_invalid_utf8() that works in older versions.. then just
select * from bar where foo != strip_invalid_utf8(foo);  The function
would be useful in general, for example, if you have an application
which doesn't already have much utf8 logic, you want to use a text
field, and stripping is the behaviour you want. For example, lots of
simple web applications.

pgsql-hackers by date:

From: Simon Riggs
Date: 08 December 2005, 18:47:10
Subject: Re: [PATCHES] Inherited Constraints

From: Gavin Sherry
Date: 08 December 2005, 19:08:41
Subject: Another encoding issue

Re: Upcoming PG re-releases - Mailing list pgsql-hackers

Previous

Next