FW: Simple join optimized badly? - Mailing list pgsql-performance

From H.J. Sanders
Subject FW: Simple join optimized badly?
Date
Msg-id GMEAJIDMNDKCIHMMHDFLEEOHEGAA.hjs@rmax.nl
Whole thread Raw
Responses Re: FW: Simple join optimized badly?
List pgsql-performance


 Hello.

 Simply jumping on the bandwagon, just my 2 cents:

 why not just like in some other (commercial) databases:

 a statement to say: use index ............

 I know this is against all though but if even the big ones can not resist
 the pressure of their users, why not?

 Henk Sanders

> > -----Oorspronkelijk bericht-----
> > Van: pgsql-performance-owner@postgresql.org
> > [mailto:pgsql-performance-owner@postgresql.org]Namens Bucky Jordan
> > Verzonden: woensdag 11 oktober 2006 16:27
> > Aan: Tom Lane; Brian Herlihy
> > CC: Postgresql Performance
> > Onderwerp: Re: [PERFORM] Simple join optimized badly?
> >
> >
> > > Brian Herlihy <btherl@yahoo.com.au> writes:
> > > > What would it take for hints to be added to postgres?
> > >
> > > A *whole lot* more thought and effort than has been expended on the
> > > subject to date.
> > >
> > > Personally I have no use for the idea of "force the planner to do
> > > exactly X given a query of exactly Y".  You don't have exactly Y
> > > today, tomorrow, and the day after (if you do, you don't need a
> > > hint mechanism at all, you need a mysql-style query cache).
> > > IMHO most of the planner mistakes we see that could be fixed via
> > > hinting are really statistical estimation errors, and so the right
> > > level to be fixing them at is hints about how to estimate the number
> > > of rows produced for given conditions.  Mind you that's still a plenty
> > > hard problem, but you could at least hope that a hint of that form
> > > would be useful for more than one query.
> > >
> >
> > Do I understand correctly that you're suggesting it might not be a bad
> > idea to allow users to provide statistics?
> >
> > Is this along the lines of "I'm loading a big table and touching every
> > row of data, so I may as well collect some stats along the way" and "I
> > know my data contains these statistical properties, but the analyzer
> > wasn't able to figure that out (or maybe can't figure it out efficiently
> > enough)"?
> >
> > While it seems like this would require more knowledge from the user
> > (e.g. more about their data, how the planner works, and how it uses
> > statistics) this would actually be helpful/required for those who really
> > care about performance. I guess it's the difference between a tool
> > advanced users can get long term benefit from, or a quick fix that will
> > probably come back to bite you. I've been pleased with Postgres'
> > thoughtful design; recently I've been doing some work with MySQL, and
> > can't say I feel the same way.
> >
> > Also, I'm guessing this has already come up at some point, but what
> > about allowing PG to do some stat collection during queries? If you're
> > touching a lot of data (such as an import process) wouldn't it be more
> > efficient (and perhaps more accurate) to collect stats then, rather than
> > having to re-scan? It would be nice to be able to turn this on/off on a
> > per query basis, seeing as it could have pretty negative impacts on OLTP
> > performance...
> >
> > - Bucky
> >
> > ---------------------------(end of broadcast)---------------------------
> > TIP 4: Have you searched our list archives?
> >
> >                http://archives.postgresql.org
> >

pgsql-performance by date:

Previous
From: Markus Schaber
Date:
Subject: Re: Scrub one large table against another (vmstat output)
Next
From: Mark Kirkwood
Date:
Subject: Re: FW: Simple join optimized badly?