Home > mailing lists

Re: Duplicate Values or Not?! - Mailing list pgsql-general

From	Greg Stark
Subject	Re: Duplicate Values or Not?!
Date	September 17, 2005 02:37:06
Msg-id	87r7boqmq5.fsf@stark.xeocode.com Whole thread Raw
In response to	Duplicate Values or Not?! (John Seberg <johnseberg@yahoo.com>)
Responses	Re: Duplicate Values or Not?!
List	pgsql-general

Tree view

John Seberg <johnseberg@yahoo.com> writes:

> I recently tried to CREATE a UNIQUE INDEX and could
> not, due to duplicate values:
>
> CREATE UNIQUE INDEX usr_login ON usr (login);
>
> To try to find the offending row(s), I then executed
> the following:
>
> SELECT count(*), login FROM usr GROUP BY login ORDER
> BY 1 DESC;
>
> The GROUP BY didn't group anything, indicating to me
> that there were no duplicate values. There were the
> same number of rows in this query as a simple SELECT
> count(*) FROM usr.
>
> This tells me that Postgresql is not using the same
> method for determining duplicates when GROUPING and
> INDEXing.

You might try running the GROUP BY query after doing:

set enable_hashagg = false;
select ...

With that false it would have to sort the results which should be exactly the
same code as the index is using. I think.

That doesn't really answer the rest of your questions. The short of it is that
setting the encoding doesn't magically make your data encoded in that
encoding. If your client sends it one encoding but claims it's unicode then
Postgres will happily store it in a UNICODE database and it'll be garbage.

Maybe someone else will have more specific advice on that front.

--
greg

pgsql-general by date:

From: Tom Lane
Date: 17 September 2005, 02:23:40
Subject: Re: pg_ctl reload breaks our client

From: Vikas
Date: 17 September 2005, 05:35:08
Subject: unsubscribe

Re: Duplicate Values or Not?! - Mailing list pgsql-general

Previous

Next