Thread: Parallel GiST build on Cube

Parallel GiST build on Cube

From
Shyam Saladi
Date:
Hello --

I regularly build GiST indexes on large databases. In recent times, as the size of the database has ballooned (30 million rows) along with the build time on a column of points in 3- and 8-dimensional space (0-volume cube). 

Is anyone working on (or already implemented) a parallel GiST build on Cube? If not, I'd like to try contributing this--any pointers from folks familiar with the backend of Cube? Any input would be great.

Thanks,
Shyam

--
NSF Graduate Research Fellow - Clemons Lab
Biochemistry and Molecular Biophysics
California Institute of Technology

Re: Parallel GiST build on Cube

From
Darafei "Komяpa" Praliaskouski
Date:
Hello,

These things for GIST I know that can help:
 - Fast sorting GIST build commitfest entry by Andrey Borodin, not parallel but faster - https://www.postgresql.org/message-id/flat/1A36620E-CAD8-4267-9067-FB31385E7C0D%40yandex-team.ru 
 - Fast sorting GIST build by Nikita Glukhov, reuses btree code so also parallel - https://github.com/postgres/postgres/compare/master...glukhovn:gist_btree_build 

These are very wanted for PostGIS which also indexes everything by 2-4 dimensional cubes and require improvements in core infrastructure and opclass.




On Mon, Apr 27, 2020 at 8:57 PM Shyam Saladi <saladi@caltech.edu> wrote:
Hello --

I regularly build GiST indexes on large databases. In recent times, as the size of the database has ballooned (30 million rows) along with the build time on a column of points in 3- and 8-dimensional space (0-volume cube). 

Is anyone working on (or already implemented) a parallel GiST build on Cube? If not, I'd like to try contributing this--any pointers from folks familiar with the backend of Cube? Any input would be great.

Thanks,
Shyam

--
NSF Graduate Research Fellow - Clemons Lab
Biochemistry and Molecular Biophysics
California Institute of Technology


--
Darafei Praliaskouski