Re: [HACKERS] 'a' == 'a ' - Mailing list pgsql-general

From Greg Stark
Subject Re: [HACKERS] 'a' == 'a '
Date
Msg-id 87br1kaf12.fsf@stark.xeocode.com
Whole thread Raw
In response to Re: [HACKERS] 'a' == 'a '  (Tom Lane <tgl@sss.pgh.pa.us>)
Responses Re: [HACKERS] 'a' == 'a '  (Jan Wieck <JanWieck@Yahoo.com>)
List pgsql-general
Tom Lane <tgl@sss.pgh.pa.us> writes:

> Chris Travers <chris@travelamericas.com> writes:
> > If I understand the spec correctly, it seems to indicate that this is
> > specific to the locale/character set.
>
> The spec associates padding behavior with collations, which per spec are
> separate from the datatypes --- that is, you should be able to able to
> specify a collation for each string-type table column (whether char(N)
> or varchar(N)) and even for each literal string constant.  We do not
> currently have that capability, and accordingly fall back to binding
> PAD SPACE behavior to char(N) and NO PAD behavior to varchar(N).
>
> AFAICS this choice is allowed by the spec since the default collation is
> implementation-defined.

Does it even make sense for char(N) to not be space padded? I had the
impression char(N) was always N characters long, not more or less. I can't
picture any other character being used for padding, then you would need a more
flexible rtrim function.

And I can understand the collation order determining whether 'a' and 'a '
compare equal. But surely if you store 'a' in a varchar(N) you have to get 'a'
back out, not some other string! Does the spec really allow varchar to
actually be padded and not just compare ignoring trailing space?


(I can't believe anyone really wants varchar to be space padded. Space padding
always seemed like a legacy feature for databases with fixed record length
data types. Why would anyone want a string data type that can't represent all
strings?)

--
greg

pgsql-general by date:

Previous
From: CSN
Date:
Subject: NULL != text ?
Next
From: Tino Wildenhain
Date:
Subject: Re: [HACKERS] 'a' == 'a ' (Was: RE: [pgsql-advocacy]