I am willing to participate in GSoC to speed up the build of the gist index in postgis, which is based on postgresql.
And I need to know *everything* about the GiST API.
To do so I need to acquire the necessary theory and concepts to start this journey.
I do not have a computer science background, I have little knowledge about machines and I code in Python (for scientific computation, data science and ml).
So I am asking what I should learn to complete in an efficient way this task at hand: speeding up the build of gist index, if possible how much time is needed to accomplish each task.
The main thing you have to know IMO is the concept of "extensibility of indexes" in PostgreSQL. More specifically about GiST, you can have a look here: