Re: libpq questions - Mailing list pgsql-general

From James Harper
Subject Re: libpq questions
Date
Msg-id AEC6C66638C05B468B556EA548C1A77DAF0689@trantor
Whole thread Raw
In response to libpq questions  ("James Harper" <james.harper@bendigoit.com.au>)
Responses Re: libpq questions
List pgsql-general
> On Tue, Jan 31, 2006 at 10:23:54PM +1100, James Harper wrote:
> > For the libpq interface:
> >
> > I need to be able to know if a column in a result from a query is
> > nullable or not. From reading the documentation it seems that I can
> > obtain the following information:
> > . scan all the rows in the result and see if there exists a null
value
> > for each column...
>
> Be careful what you infer from such a scan: not finding any NULLs
> doesn't necessarily mean a column isn't nullable, it just means the
> result set didn't contain any NULLs.

I understand that limitation, but haven't figured out if it matters in
my situation. The only time it might is if the client wants to infer a
schema as a result of a query, eg 'SELECT * FROM TableX WHERE 0 = 1'.

> > . backtrack the column to the source table (assuming a
non-calculated
> > field) and check the nullable status there
> >
> > Neither of the above is particularly cheap to do...
>
> If you know the table and column names then checking which columns
> have a NOT NULL constraint is a simple query against pg_attribute.

Yep, but does involve another round trip to the server to determine... I
guess it's the only way to do it though. Obviously it's only going to
work for fields which are returned directly from a table. What about
this situation:

CREATE TABLE TableX (f1 int NOT NULL, f2 int NULL)
INSERT INTO TableX VALUES (10, 10)
INSERT INTO TableX VALUES (20, 11)
INSERT INTO TableX VALUES (30, NULL)
INSERT INTO TableX VALUES (40, 12)

SELECT f1 + f2 AS f FROM TableX WHERE f1 < 30

Now, given the current where clause, the client isn't going to see any
NULL values, but a change in the where clause might suddenly make the
calculated f field nullable. Maybe this doesn't matter and the client
application should itself be aware of the database schema anyway...
maybe I'm just worrying too much :)

In the above example, does the database engine assign internally a
'nullability' flag? I guess it must do... because how would the
following be evaluated:

SELECT f1 + f2 AS f INTO TableY FROM TableX WHERE f1 < 30

Would the column f in the created table be nullable or not?

I guess I need to do some testing unless you know off the top of your
head?

>
> > Which leads me to my next question... If I executed a select against
a
> > table with a million rows, and the query returned all of the rows,
what
> > happens? Are all the rows read into memory on the client before
> > returning the result? Or are rows only fetched from the server as
> > required?
>
> libpq fetches all rows before returning any to the client; if you
> want to fetch rows in smaller chunks then use a cursor.  The
> developers' TODO list has an item to address that problem:
>
> * Allow statement results to be automatically batched to the client
>
> http://www.postgresql.org/docs/faqs.TODO.html
>

Hmmm... so a select statement with result set of a million rows is going
to stall for a while before the results are usefully available to the
client, and is then going to use a significant amount of memory on the
client...

Is this a limitation of libpq or of the underlying database engine?

Are there any alternative (but native - eg not ODBC) interfaces to
postgresql?

Thanks for the response!

James


pgsql-general by date:

Previous
From: nboutelier@hotmail.com
Date:
Subject: Re: Can't get the field = ANY(array) clause to work...
Next
From: "rlee0001"
Date:
Subject: Re: 8.0.3 regexp_replace()...