Home > mailing lists

Re: Reducing the overhead of NUMERIC data - Mailing list pgsql-hackers

From	Tom Lane
Subject	Re: Reducing the overhead of NUMERIC data
Date	November 4, 2005 14:54:20
Msg-id	16543.1131130444@sss.pgh.pa.us Whole thread Raw
In response to	Re: Reducing the overhead of NUMERIC data (mark@mark.mielke.cc)
Responses	Re: Reducing the overhead of NUMERIC data
List	pgsql-hackers

Tree view

mark@mark.mielke.cc writes:
> I read "the backend is by and large an ASCII, null-terminated-string
> engine" with "we use UTF-8 [for varlena strings?]" as, a lot of the
> code assumes varlena strings are '\0' terminated, and an assumption
> on my part, that the varlena strings are not stored in the backend
> with a '\0' terminator, therefore, they require being copied out,
> terminated with a '\0', before they can be used?

There are places where we have to do that, the worst from a performance
viewpoint being in string comparison --- we have to null-terminate both
values before we can pass them to strcoll().

One of the large bits that would have to be done before we could even
contemplate using UCS2/UCS4 is getting rid of our dependence on strcoll,
since its API is null-terminated-string.

> How much effort (past discussions that I've missed from a decade ago? 
> hehe) has been put into determining whether a zero-copy architecture,
> or really, a minimum copy architecture, would address some of these
> bottlenecks? Am I dreaming? :-)

We've already done it in places, for instance the new implementation
of "virtual tuples" in TupleTableSlots eliminates a lot of copying
of pass-by-reference values.
        regards, tom lane

pgsql-hackers by date:

From: Tom Lane
Date: 04 November 2005, 14:37:47
Subject: Re: insert performance for win32

From: "Otto Hirr"
Date: 04 November 2005, 15:12:39
Subject: Re: [OT] somebody could explain this?

Re: Reducing the overhead of NUMERIC data - Mailing list pgsql-hackers

Previous

Next