Home > mailing lists

Re: Doing better at HINTing an appropriate column within errorMissingColumn() - Mailing list pgsql-hackers

From	Peter Geoghegan
Subject	Re: Doing better at HINTing an appropriate column within errorMissingColumn()
Date	November 19, 2014 20:34:07
Msg-id	CAM3SWZRy8i98pvVw-YcQxSgT8ZoXBmx_SL3p2VEse1vC4jjQ_w@mail.gmail.com Whole thread Raw
In response to	Re: Doing better at HINTing an appropriate column within errorMissingColumn() (Robert Haas <robertmhaas@gmail.com>)
Responses	Re: Doing better at HINTing an appropriate column within errorMissingColumn() (Robert Haas <robertmhaas@gmail.com>)
List	pgsql-hackers

Tree view

On Wed, Nov 19, 2014 at 5:43 AM, Robert Haas <robertmhaas@gmail.com> wrote:
> I think we would be well-advised not to start inventing our own
> approximate matching algorithm.  Peter's suggestion boils down to a
> guess that the default cost parameters for Levenshtein suck, and your
> suggestion boils down to a guess that we can fix the problems with
> Peter's suggestion by bolting another heuristic on top of it - and
> possibly running Levenshtein twice with different sets of cost
> parameters.  Ugh.

I agree.

While I am perfectly comfortable with the fact that we are guessing
here, my guesses are based on what I observed to work well with real
schemas, and simulated errors that I thought were representative of
human error. Obviously it's possible that another scheme will do
better sometimes, including for example a scheme that picks a match
entirely at random. But on average, I think that what I have here will
do better than anything else proposed so far.

-- 
Peter Geoghegan

pgsql-hackers by date:

From: Robert Haas
Date: 19 November 2014, 20:32:02
Subject: Re: pg_test_fsync file descriptor leak

From: Robert Haas
Date: 19 November 2014, 20:34:56
Subject: Re: proposal: plpgsql - Assert statement

Re: Doing better at HINTing an appropriate column within errorMissingColumn() - Mailing list pgsql-hackers

Previous

Next