Re: GSoC project : K-medoids clustering in Madlib - Mailing list pgsql-hackers

From Atri Sharma
Subject Re: GSoC project : K-medoids clustering in Madlib
Date
Msg-id CAOeZVieJZoskgNoM2SzUDq+abGTob8hM3mVMdtZcM7boCWpMyA@mail.gmail.com
Whole thread Raw
In response to GSoC project : K-medoids clustering in Madlib  (viod <viod.len@gmail.com>)
Responses Re: GSoC project : K-medoids clustering in Madlib  (Tom Lane <tgl@sss.pgh.pa.us>)
List pgsql-hackers
I suggested a couple of algorithms to be implemented in MADLib(apart
from K Medoids). You could pick some(or all) of them, which would
require 3 months to be completed.

As for more information on index, you can refer

http://wiki.postgresql.org/wiki/What's_new_in_PostgreSQL_9.1

along with the postgres wiki. The wiki is the standard for anything postgres.

pg_trgm used KNN, but I believe it uses its own implementation of the
algorithm. The idea I proposed aims at writing an implementation in
the MADlib so that any client program can use the algorithm(s) in
their code directly, using MADlib functions.

Regards,

Atri

On 3/26/13, viod <viod.len@gmail.com> wrote:
> Hello!
>
> I'm an IT student, and I would like to apply for the 2013 GSoC.
> I've been looking at this mailing list for a while now, and I saw a
> suggestion for GSoC that particularly interested me: implementing the
> K-medoids clustering in Madlib, as it is supposed to be more efficient than
> the K-means algorithm.
>
> I didn't know about these algorithms before, but I have documented myself,
> and it looks quite interesting to me, and even more as I currently have
> lessons (but very very simplified unfortunately).
>
> I've got a few questions:
> Won't this be a quite short project? I can't get an idea of how long it
> would take me to implement this algorithm in a way that would be usable by
> postgresql, but 3 months looks long for this task, doesn't it?
>
> Someone on the IRC channel (can't remember who, sorry) told me it was used
> in the KNN index. I guess this is used by pg_trgm, but are there other
> modules using it currently?
> And could you please give me some links explaining the internals of this
> index? I've been through several articles presenting of it, but none very
> satisfying.
>
> Thanks a lot in advance!
>


-- 
Regards,

Atri
*l'apprenant*



pgsql-hackers by date:

Previous
From: Simon Riggs
Date:
Subject: Re: Assertion failure when promoting node by deleting recovery.conf and restart node
Next
From: Alvaro Herrera
Date:
Subject: Re: sql_drop Event Triggerg