Home > mailing lists

Re: Using quicksort for every external sort run - Mailing list pgsql-hackers

From	Greg Stark
Subject	Re: Using quicksort for every external sort run
Date	January 30, 2016 07:27:26
Msg-id	CAM-w4HORqMqS1R3XSW_7gS6=9RXe=xovj5SWwXnPXJR2Mt7niQ@mail.gmail.com Whole thread Raw
In response to	Re: Using quicksort for every external sort run (Robert Haas <robertmhaas@gmail.com>)
Responses	Re: Using quicksort for every external sort run
List	pgsql-hackers

Tree view

<p dir="ltr"><br /> On 29 Jan 2016 11:58 pm, "Robert Haas" <<a
href="mailto:robertmhaas@gmail.com">robertmhaas@gmail.com</a>>wrote:<br /> > It<br /> > seems pretty easy to
constructcases where this technique regresses,<br /> > and a large percentage of those cases are precisely those
where<br/> > replacement selection would have produced a single run, avoiding the<br /> > merge step altogether. 
<pdir="ltr">Now that avoiding the merge phase altogether didn't necessarily represent any actual advantage.<p
dir="ltr">Wedon't find out we've avoided the merge phase until the entire run has been spiked to disk. Then we need to
readit back in from disk to serve up those tuples.<p dir="ltr">If we have tapes to merge but can do then in a single
passwe do that lazily and merge as needed when we serve up the tuples. I doubt there's any speed difference in reading
twosequential streams with our buffering over one especially in the midst of a quiet doing other i/o. And N extra
comparisonsis less than the quicksort advantage.<p dir="ltr">If we could somehow predict that it'll be a single output
runthat would be a huge advantage. But having to spill all the tuples and then find out isn't really helpful.

pgsql-hackers by date:

From: Peter Geoghegan
Date: 30 January 2016, 07:25:42
Subject: Re: Using quicksort for every external sort run

From: Greg Stark
Date: 30 January 2016, 07:29:19
Subject: Re: Using quicksort for every external sort run

Re: Using quicksort for every external sort run - Mailing list pgsql-hackers

Previous

Next