Re: GSoC project: K-medoids clustering in Madlib

From: Atri Sharma <atri(dot)jiit(at)gmail(dot)com>
To: Maxence AHLOUCHE <maxence(dot)ahlouche(at)gmail(dot)com>
Cc: "Iyer, Rahul" <Rahul(dot)Iyer(at)emc(dot)com>, "pgsql-students(at)postgresql(dot)org" <pgsql-students(at)postgresql(dot)org>, "devel(at)madlib(dot)net" <devel(at)madlib(dot)net>, "Philip, Sujit" <Sujit(dot)Philip(at)emc(dot)com>
Subject: Re: GSoC project: K-medoids clustering in Madlib
Date: 2013-04-20 13:00:31
Message-ID: 2DAD673D-4C46-4039-9D83-AE0D5C16764C@gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-students

Sent from my iPad

On 20-Apr-2013, at 16:11, Maxence AHLOUCHE <maxence(dot)ahlouche(at)gmail(dot)com> wrote:

> Oops, forgot to attach the output!
>
>
> 2013/4/20 Maxence AHLOUCHE <maxence(dot)ahlouche(at)gmail(dot)com>
>> Hi all!
>>
>> I've had a bit of fun with the k-means clustering, and have made a small script to visualize the result of the classification.
>> However, I couldn't guess how to assign a cluster to a point from the output of the algorithm, could someone give me an indication, please?
>>
>> My script is written in python3, and uses py-postgresql (http://python.projects.pgfoundry.org/) as PostgreSQL interface. It also requires Pillow (a PIL fork) which you can find here : https://pypi.python.org/pypi/Pillow/2.0.0.
>>
>> Before your first use, you may want to change the settings (on top of the file) to connect to your PostgreSQL server.
>> The script will create a table in your database, populate it with random groups of points, and then call the k-means algorithm on it. Finally, it will generate a PNG image, displaying the points and the centroids.
>>
>> For a first run, use something like this:
>> ./k-means_test.py --regen -o clustered_data.png
>>
>> You can call "./k-means_test.py -h" for a list of available options.
>>
>> In attachment are my script and an example of its output.
>>
>> By the way, I'll have a lot of work next week, as I have several exams coming and a big project to do (about empirical orthogonal functions), so I'll probably be inactive for a few days! Then I'll be on holidays, so I will be able to focus on MADlib and GSoC :)
>>
>> Regards,
>> Maxence
>>
>>
>>

Very interesting! The results look encouraging,although this is on Python :)

Good work!

Regards,

Atri

In response to

Responses

Browse pgsql-students by date

  From Date Subject
Next Message Atri Sharma 2013-04-20 14:31:28 Re: GSoC project: K-medoids clustering in Madlib
Previous Message Maxence AHLOUCHE 2013-04-20 10:41:18 Re: GSoC project: K-medoids clustering in Madlib