Thread: Re: [HACKERS] PQescapeIdentifier
Christopher Kings-Lynne wrote: > TODO item done for 8.2: > > * Add PQescapeIdentifier() to libpq > > Someone probably needs to check this :) Updated patch applied. Thanks. -- Bruce Momjian bruce@momjian.us EnterpriseDB http://www.enterprisedb.com + If your life is a hard drive, Christ can be your backup. + Index: doc/src/sgml/libpq.sgml =================================================================== RCS file: /cvsroot/pgsql/doc/src/sgml/libpq.sgml,v retrieving revision 1.211 diff -c -c -r1.211 libpq.sgml *** doc/src/sgml/libpq.sgml 23 May 2006 22:13:19 -0000 1.211 --- doc/src/sgml/libpq.sgml 26 Jun 2006 23:54:12 -0000 *************** *** 2279,2284 **** --- 2279,2347 ---- </para> </sect2> + <sect2 id="libpq-exec-escape-identifier"> + <title>Escaping Identifier for Inclusion in SQL Commands</title> + + <indexterm zone="libpq-exec-escape-identifier"><primary>PQescapeIdentifier</></> + <indexterm zone="libpq-exec-escape-identifier"><primary>escaping strings</></> + + <para> + <function>PQescapeIdentifier</function> escapes a string for use + as an identifier name within an SQL command. For example; table names, + column names, view names and user names are all identifiers. + Double quotes (") must be escaped to prevent them from being interpreted + specially by the SQL parser. <function>PQescapeIdentifier</> performs this + operation. + </para> + + <tip> + <para> + It is especially important to do proper escaping when handling strings that + were received from an untrustworthy source. Otherwise there is a security + risk: you are vulnerable to <quote>SQL injection</> attacks wherein unwanted + SQL commands are fed to your database. + </para> + </tip> + + <para> + Note that it is still necessary to do escaping of identifiers when + using functions that support parameterized queries such as <function>PQexecParams</> or + its sibling routines. Only literal values are automatically escaped + using these functions, not identifiers. + + <synopsis> + size_t PQescapeIdentifier (char *to, const char *from, size_t length); + </synopsis> + </para> + + <para> + The parameter <parameter>from</> points to the first character of the string + that is to be escaped, and the <parameter>length</> parameter gives the + number of characters in this string. A terminating zero byte is not + required, and should not be counted in <parameter>length</>. (If + a terminating zero byte is found before <parameter>length</> bytes are + processed, <function>PQescapeIdentifier</> stops at the zero; the behavior + is thus rather like <function>strncpy</>.) + <parameter>to</> shall point to a + buffer that is able to hold at least one more character than twice + the value of <parameter>length</>, otherwise the behavior is + undefined. A call to <function>PQescapeIdentifier</> writes an escaped + version of the <parameter>from</> string to the <parameter>to</> + buffer, replacing special characters so that they cannot cause any + harm, and adding a terminating zero byte. The double quotes that + may surround <productname>PostgreSQL</> identifiers are not + included in the result string; they should be provided in the SQL + command that the result is inserted into. + </para> + <para> + <function>PQescapeIdentifier</> returns the number of characters written + to <parameter>to</>, not including the terminating zero byte. + </para> + <para> + Behavior is undefined if the <parameter>to</> and <parameter>from</> + strings overlap. + </para> + </sect2> <sect2 id="libpq-exec-escape-bytea"> <title>Escaping Binary Strings for Inclusion in SQL Commands</title> Index: src/interfaces/libpq/exports.txt =================================================================== RCS file: /cvsroot/pgsql/src/interfaces/libpq/exports.txt,v retrieving revision 1.11 diff -c -c -r1.11 exports.txt *** src/interfaces/libpq/exports.txt 28 May 2006 22:42:05 -0000 1.11 --- src/interfaces/libpq/exports.txt 26 Jun 2006 23:54:20 -0000 *************** *** 130,132 **** --- 130,134 ---- PQencryptPassword 128 PQisthreadsafe 129 enlargePQExpBuffer 130 + PQescapeIdentifier 131 + Index: src/interfaces/libpq/fe-exec.c =================================================================== RCS file: /cvsroot/pgsql/src/interfaces/libpq/fe-exec.c,v retrieving revision 1.186 diff -c -c -r1.186 fe-exec.c *** src/interfaces/libpq/fe-exec.c 28 May 2006 21:13:54 -0000 1.186 --- src/interfaces/libpq/fe-exec.c 26 Jun 2006 23:54:21 -0000 *************** *** 2516,2521 **** --- 2516,2557 ---- } /* + * Escaping arbitrary strings to get valid SQL identifier strings. + * + * Replaces " with "". + * + * length is the length of the source string. (Note: if a terminating NUL + * is encountered sooner, PQescapeIdentifier stops short of "length"; the behavior + * is thus rather like strncpy.) + * + * For safety the buffer at "to" must be at least 2*length + 1 bytes long. + * A terminating NUL character is added to the output string, whether the + * input is NUL-terminated or not. + * + * Returns the actual length of the output (not counting the terminating NUL). + */ + size_t + PQescapeIdentifier(char *to, const char *from, size_t length) + { + const char *source = from; + char *target = to; + size_t remaining = length; + + while (remaining > 0 && *source != '\0') + { + if (*source == '"') + *target++ = *source; + *target++ = *source++; + remaining--; + } + + /* Write the terminating NUL character. */ + *target = '\0'; + + return target - to; + } + + /* * PQescapeBytea - converts from binary string to the * minimal encoding necessary to include the string in an SQL * INSERT statement with a bytea type column as the target. Index: src/interfaces/libpq/libpq-fe.h =================================================================== RCS file: /cvsroot/pgsql/src/interfaces/libpq/libpq-fe.h,v retrieving revision 1.129 diff -c -c -r1.129 libpq-fe.h *** src/interfaces/libpq/libpq-fe.h 23 May 2006 22:13:19 -0000 1.129 --- src/interfaces/libpq/libpq-fe.h 26 Jun 2006 23:54:21 -0000 *************** *** 436,441 **** --- 436,443 ---- size_t *to_length); extern unsigned char *PQunescapeBytea(const unsigned char *strtext, size_t *retbuflen); + extern size_t PQescapeIdentifier(char *to, const char *from, size_t length); + /* These forms are deprecated! */ extern size_t PQescapeString(char *to, const char *from, size_t length); extern unsigned char *PQescapeBytea(const unsigned char *from, size_t from_length,
Bruce Momjian <bruce@momjian.us> writes: >> * Add PQescapeIdentifier() to libpq > Updated patch applied. Thanks. Have either of you inquired into the encoding-safety of this code? It certainly looks like no consideration was given for that. regards, tom lane
Tom Lane wrote: > Bruce Momjian <bruce@momjian.us> writes: > >> * Add PQescapeIdentifier() to libpq > > > Updated patch applied. Thanks. > > Have either of you inquired into the encoding-safety of this code? > It certainly looks like no consideration was given for that. I thought of that but I assume we were not accepting user-supplied identifiers for this --- that this was only for application use. Am I wrong? -- Bruce Momjian bruce@momjian.us EnterpriseDB http://www.enterprisedb.com + If your life is a hard drive, Christ can be your backup. +
Bruce Momjian <bruce@momjian.us> writes: > Tom Lane wrote: >> Have either of you inquired into the encoding-safety of this code? >> It certainly looks like no consideration was given for that. > I thought of that but I assume we were not accepting user-supplied > identifiers for this --- that this was only for application use. Am I > wrong? By definition, an escaping routine is not supposed to trust the data it is handed. We *will* be seeing a CVE report if this function has got any escaping vulnerability. If you insist on a practical example, I can certainly imagine someone thinking it'd be cool to allow searches on a user-selected column, and implementing that by passing the user-given column name straight into the query with only PQescapeIdentifier for safety. regards, tom lane
Tom Lane wrote: > Bruce Momjian <bruce@momjian.us> writes: > > Tom Lane wrote: > >> Have either of you inquired into the encoding-safety of this code? > >> It certainly looks like no consideration was given for that. > > > I thought of that but I assume we were not accepting user-supplied > > identifiers for this --- that this was only for application use. Am I > > wrong? > > By definition, an escaping routine is not supposed to trust the data it > is handed. We *will* be seeing a CVE report if this function has got > any escaping vulnerability. > > If you insist on a practical example, I can certainly imagine someone > thinking it'd be cool to allow searches on a user-selected column, and > implementing that by passing the user-given column name straight into > the query with only PQescapeIdentifier for safety. OK, does someone want to fix it, or should I revert it? -- Bruce Momjian bruce@momjian.us EnterpriseDB http://www.enterprisedb.com + If your life is a hard drive, Christ can be your backup. +
Christopher Kings-Lynne <chriskl@familyhealth.com.au> writes: > Yes, phpPgAdmin sure would. I imagine this would be a nightmare to > address properly, so perhaps we should remove the function :( Well, it's fixable, cf PQescapeStringConn, but we should fix it *before* it gets into the field not after. regards, tom lane
>> I thought of that but I assume we were not accepting user-supplied >> identifiers for this --- that this was only for application use. Am I >> wrong? Well, yes the plan was to accept user-supplied identifiers... > If you insist on a practical example, I can certainly imagine someone > thinking it'd be cool to allow searches on a user-selected column, and > implementing that by passing the user-given column name straight into > the query with only PQescapeIdentifier for safety. Yes, phpPgAdmin sure would. I imagine this would be a nightmare to address properly, so perhaps we should remove the function :(
Hang on a second. Has someone considered the encoding issues this might suffer from, same as PQescapeString? I remember we discussed it briefly and I mentioned it's outta my league to prove one way or the other... Bruce Momjian wrote: > Christopher Kings-Lynne wrote: >> TODO item done for 8.2: >> >> * Add PQescapeIdentifier() to libpq >> >> Someone probably needs to check this :) > > Updated patch applied. Thanks. > > > > ------------------------------------------------------------------------ > > Index: doc/src/sgml/libpq.sgml > =================================================================== > RCS file: /cvsroot/pgsql/doc/src/sgml/libpq.sgml,v > retrieving revision 1.211 > diff -c -c -r1.211 libpq.sgml > *** doc/src/sgml/libpq.sgml 23 May 2006 22:13:19 -0000 1.211 > --- doc/src/sgml/libpq.sgml 26 Jun 2006 23:54:12 -0000 > *************** > *** 2279,2284 **** > --- 2279,2347 ---- > </para> > </sect2> > > + <sect2 id="libpq-exec-escape-identifier"> > + <title>Escaping Identifier for Inclusion in SQL Commands</title> > + > + <indexterm zone="libpq-exec-escape-identifier"><primary>PQescapeIdentifier</></> > + <indexterm zone="libpq-exec-escape-identifier"><primary>escaping strings</></> > + > + <para> > + <function>PQescapeIdentifier</function> escapes a string for use > + as an identifier name within an SQL command. For example; table names, > + column names, view names and user names are all identifiers. > + Double quotes (") must be escaped to prevent them from being interpreted > + specially by the SQL parser. <function>PQescapeIdentifier</> performs this > + operation. > + </para> > + > + <tip> > + <para> > + It is especially important to do proper escaping when handling strings that > + were received from an untrustworthy source. Otherwise there is a security > + risk: you are vulnerable to <quote>SQL injection</> attacks wherein unwanted > + SQL commands are fed to your database. > + </para> > + </tip> > + > + <para> > + Note that it is still necessary to do escaping of identifiers when > + using functions that support parameterized queries such as <function>PQexecParams</> or > + its sibling routines. Only literal values are automatically escaped > + using these functions, not identifiers. > + > + <synopsis> > + size_t PQescapeIdentifier (char *to, const char *from, size_t length); > + </synopsis> > + </para> > + > + <para> > + The parameter <parameter>from</> points to the first character of the string > + that is to be escaped, and the <parameter>length</> parameter gives the > + number of characters in this string. A terminating zero byte is not > + required, and should not be counted in <parameter>length</>. (If > + a terminating zero byte is found before <parameter>length</> bytes are > + processed, <function>PQescapeIdentifier</> stops at the zero; the behavior > + is thus rather like <function>strncpy</>.) > + <parameter>to</> shall point to a > + buffer that is able to hold at least one more character than twice > + the value of <parameter>length</>, otherwise the behavior is > + undefined. A call to <function>PQescapeIdentifier</> writes an escaped > + version of the <parameter>from</> string to the <parameter>to</> > + buffer, replacing special characters so that they cannot cause any > + harm, and adding a terminating zero byte. The double quotes that > + may surround <productname>PostgreSQL</> identifiers are not > + included in the result string; they should be provided in the SQL > + command that the result is inserted into. > + </para> > + <para> > + <function>PQescapeIdentifier</> returns the number of characters written > + to <parameter>to</>, not including the terminating zero byte. > + </para> > + <para> > + Behavior is undefined if the <parameter>to</> and <parameter>from</> > + strings overlap. > + </para> > + </sect2> > > <sect2 id="libpq-exec-escape-bytea"> > <title>Escaping Binary Strings for Inclusion in SQL Commands</title> > Index: src/interfaces/libpq/exports.txt > =================================================================== > RCS file: /cvsroot/pgsql/src/interfaces/libpq/exports.txt,v > retrieving revision 1.11 > diff -c -c -r1.11 exports.txt > *** src/interfaces/libpq/exports.txt 28 May 2006 22:42:05 -0000 1.11 > --- src/interfaces/libpq/exports.txt 26 Jun 2006 23:54:20 -0000 > *************** > *** 130,132 **** > --- 130,134 ---- > PQencryptPassword 128 > PQisthreadsafe 129 > enlargePQExpBuffer 130 > + PQescapeIdentifier 131 > + > Index: src/interfaces/libpq/fe-exec.c > =================================================================== > RCS file: /cvsroot/pgsql/src/interfaces/libpq/fe-exec.c,v > retrieving revision 1.186 > diff -c -c -r1.186 fe-exec.c > *** src/interfaces/libpq/fe-exec.c 28 May 2006 21:13:54 -0000 1.186 > --- src/interfaces/libpq/fe-exec.c 26 Jun 2006 23:54:21 -0000 > *************** > *** 2516,2521 **** > --- 2516,2557 ---- > } > > /* > + * Escaping arbitrary strings to get valid SQL identifier strings. > + * > + * Replaces " with "". > + * > + * length is the length of the source string. (Note: if a terminating NUL > + * is encountered sooner, PQescapeIdentifier stops short of "length"; the behavior > + * is thus rather like strncpy.) > + * > + * For safety the buffer at "to" must be at least 2*length + 1 bytes long. > + * A terminating NUL character is added to the output string, whether the > + * input is NUL-terminated or not. > + * > + * Returns the actual length of the output (not counting the terminating NUL). > + */ > + size_t > + PQescapeIdentifier(char *to, const char *from, size_t length) > + { > + const char *source = from; > + char *target = to; > + size_t remaining = length; > + > + while (remaining > 0 && *source != '\0') > + { > + if (*source == '"') > + *target++ = *source; > + *target++ = *source++; > + remaining--; > + } > + > + /* Write the terminating NUL character. */ > + *target = '\0'; > + > + return target - to; > + } > + > + /* > * PQescapeBytea - converts from binary string to the > * minimal encoding necessary to include the string in an SQL > * INSERT statement with a bytea type column as the target. > Index: src/interfaces/libpq/libpq-fe.h > =================================================================== > RCS file: /cvsroot/pgsql/src/interfaces/libpq/libpq-fe.h,v > retrieving revision 1.129 > diff -c -c -r1.129 libpq-fe.h > *** src/interfaces/libpq/libpq-fe.h 23 May 2006 22:13:19 -0000 1.129 > --- src/interfaces/libpq/libpq-fe.h 26 Jun 2006 23:54:21 -0000 > *************** > *** 436,441 **** > --- 436,443 ---- > size_t *to_length); > extern unsigned char *PQunescapeBytea(const unsigned char *strtext, > size_t *retbuflen); > + extern size_t PQescapeIdentifier(char *to, const char *from, size_t length); > + > /* These forms are deprecated! */ > extern size_t PQescapeString(char *to, const char *from, size_t length); > extern unsigned char *PQescapeBytea(const unsigned char *from, size_t from_length,
Christopher Kings-Lynne <chriskl@familyhealth.com.au> writes: > Hang on a second. Has someone considered the encoding issues this might > suffer from, same as PQescapeString? That was the point I raised when I saw the commit. My advice is we shouldn't have PQescapeIdentifier at all. PQescapeIdentifierConn would be the thing to define, parallel to PQescapeStringConn. regards, tom lane
Tom Lane wrote: > Christopher Kings-Lynne <chriskl@familyhealth.com.au> writes: > > Hang on a second. Has someone considered the encoding issues this might > > suffer from, same as PQescapeString? > > That was the point I raised when I saw the commit. > > My advice is we shouldn't have PQescapeIdentifier at all. > PQescapeIdentifierConn would be the thing to define, > parallel to PQescapeStringConn. Patch reverted, TODO updated to: o Add PQescapeIdentifierConn() -- Bruce Momjian bruce@momjian.us EnterpriseDB http://www.enterprisedb.com + If your life is a hard drive, Christ can be your backup. +