Home > mailing lists

Re: Doing better at HINTing an appropriate column within errorMissingColumn() - Mailing list pgsql-hackers

From	Robert Haas
Subject	Re: Doing better at HINTing an appropriate column within errorMissingColumn()
Date	November 19, 2014 13:43:59
Msg-id	CA+Tgmoam4EsCzo=mhK6PfgNV88BSEFR5ykueb+XEyJP7c7e_kA@mail.gmail.com Whole thread
In response to	Re: Doing better at HINTing an appropriate column within errorMissingColumn() (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: Doing better at HINTing an appropriate column within errorMissingColumn()
List	pgsql-hackers

Tree view

On Tue, Nov 18, 2014 at 8:03 PM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
> Peter Geoghegan <pg@heroku.com> writes:
>> On Tue, Nov 18, 2014 at 3:29 PM, Robert Haas <robertmhaas@gmail.com> wrote:
>>> On Mon, Nov 17, 2014 at 3:04 PM, Peter Geoghegan <pg@heroku.com> wrote:
>>>> postgres=# select qty from orderlines ;
>>>> ERROR:  42703: column "qty" does not exist
>>>> HINT:  Perhaps you meant to reference the column "orderlines"."quantity".
>
>>> I don't buy this example, because it would give you the same hint if
>>> you told it you wanted to access a column called ant, or uay, or tit.
>>> And that's clearly ridiculous.  The reason why quantity looks like a
>>> reasonable suggestion for qty is because it's a conventional
>>> abbreviation, but an extremely high percentage of comparable cases
>>> won't be.
>
>> I maintain that omission of part of the correct spelling should be
>> weighed less.
>
> I would say that omission of the first letter should completely disqualify
> suggestions based on this heuristic; but it might make sense to weight
> omissions less after the first letter.

I think we would be well-advised not to start inventing our own
approximate matching algorithm.  Peter's suggestion boils down to a
guess that the default cost parameters for Levenshtein suck, and your
suggestion boils down to a guess that we can fix the problems with
Peter's suggestion by bolting another heuristic on top of it - and
possibly running Levenshtein twice with different sets of cost
parameters.  Ugh.

-- 
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

pgsql-hackers by date:

From: Alvaro Herrera
Date: 19 November 2014, 13:22:11
Subject: Re: tracking commit timestamps

From: Robert Haas
Date: 19 November 2014, 13:45:47
Subject: Re: tracking commit timestamps

Re: Doing better at HINTing an appropriate column within errorMissingColumn() - Mailing list pgsql-hackers

Previous

Next