Home > mailing lists

Re: GSoC project: K-medoids clustering in Madlib - Mailing list pgsql-students

From	Maxence AHLOUCHE
Subject	Re: GSoC project: K-medoids clustering in Madlib
Date	April 21, 2013 21:21:22
Msg-id	CAJeaomWVrTvP5O3oYePCnmTnMQYkz2_JkC_8MgrrDf6ui+z1uA@mail.gmail.com Whole thread
In response to	Re: GSoC project: K-medoids clustering in Madlib (Atri Sharma <atri.jiit@gmail.com>)
List	pgsql-students

Tree view

2013/4/21 Atri Sharma <atri.jiit@gmail.com>

Interesting! Good work!

Could you draw up a summary, giving your findings about the performance of different algorithms,and which one should be implemented,or both(k means++ vs k medoids).

Regards,

Atri

From the few articles I've already read, I've found that K-medoids clustering usually goes faster on standard datasets such as the ones I generate). But I'll look for more detailed information during the week, and report what I'll have found here!

By the way, have you got any idea of other forms of datasets that could be useful to test?

2013/4/21 <hellerstein@cs.berkeley.edu>

Very cool!

May I suggest generating a visualization in a web toolkit? Perhaps the new vega library would be simplest (http://trifacta.github.io/vega/) or the more popular but lower-level D3.js?

More generally, a project to connect MADlib outputs to vega vis specifications seems like it would be enormously useful!

Joe

I'll give it a look during my holidays, in a week! It would indeed be nice if one just had to open a webpage to test my work!

Considering your other idea, aren't MADlib outputs PostgreSQL/GreenPlum outputs? If so, only a database connector is required, which probably already exists (I may be wrong, I had never heard of D3.js or Vega before, and I don't know well the MADlib project yet).

--
Maxence Ahlouche
06 06 66 97 00
93 avenue Paul DOUMER
24100 Bergerac

pgsql-students by date:

From: Atri Sharma
Date: 21 April 2013, 18:03:36
Subject: Re: GSoC project: K-medoids clustering in Madlib

From: Thom Brown
Date: 23 April 2013, 18:09:53
Subject: Student GSoC applications

Re: GSoC project: K-medoids clustering in Madlib - Mailing list pgsql-students

Previous

Next