Re: data model one large and many small columns - Mailing list pgsql-admin

From Campbell, Lance
Subject Re: data model one large and many small columns
Date
Msg-id B75CD08C73BD3543B97E4EF3964B7D70307662C8@CITESMBX1.ad.uillinois.edu
Whole thread Raw
In response to Re: data model one large and many small columns  (Tom Lane <tgl@sss.pgh.pa.us>)
List pgsql-admin
I had a hunch this was the case.  I just wanted to double check.

Lance

-----Original Message-----
From: Tom Lane [mailto:tgl@sss.pgh.pa.us]
Sent: Tuesday, July 26, 2016 4:18 PM
To: Campbell, Lance <lance@illinois.edu>
Cc: pgsql-admin@postgresql.org
Subject: Re: [ADMIN] data model one large and many small columns

"Campbell, Lance" <lance@illinois.edu> writes:
> Suppose you have a table that consists of a lot of small sized columns with one really large text column.  The text
columnrepresents an average sized web page.  When inserting or updating you will work with only one row at a time. 

> When selecting information you will either:
> A)     select all columns in one row for display or editing purposes.
> B)      Or you will select hundreds of rows but NOT the very large text column.

> Question: Is there any performance to be gained from PostgreSQL by
> storing the data as two tables versus one table?

No, probably not; TOAST will effectively do that for you by off-loading the large text values into the side table.

If you were to do an explicit join, there would be some use-cases --- mainly where you were selecting a LOT of rows ---
wherein theory you could get a smarter plan from the explicit join.  TOAST will effectively always fetch the text
valuesvia a nestloop-with-inner-indexscan plan, but maybe hashing or merging would be smarter.  Unfortunately, with two
explicittables, TOAST would still apply to the second table, which means that what you've really got under the hood is
athree-way join, with the intermediate table contributing nothing except overhead. 

So don't bother ...

            regards, tom lane


pgsql-admin by date:

Previous
From: Tom Lane
Date:
Subject: Re: data model one large and many small columns
Next
From: Stephen Frost
Date:
Subject: Re: Pg-Upgrade standbys via rsync... and avoid sending UNlogged data?