Implementation of Various Clustering Techniques

Please use this identifier to cite or link to this item: http://www.ir.juit.ac.in:8080/jspui/jspui/handle/123456789/9545

Title:	Implementation of Various Clustering Techniques
Authors:	Mahajan, Saumya Kumar, Pardeep [Guided by]
Keywords:	Data Mining Algorithm DBSCAN algorithm K-Means
Issue Date:	2014
Publisher:	Jaypee University of Information Technology, Solan, H.P.
Abstract:	Clustering is one of the important streams in data mining useful for discovering groups and identifying interesting distributions in the underlying data. This project aims in analyzing and comparing the partitional and density based clustering algorithms namely K-Means and DBSCAN. The comparison is done based on the extent to which each of these algorithms identify the clusters and their pros and cons. K-Means is a partitional clustering technique that helps to identify k clusters from a given set of n data points in d-dimensional space. It starts with k random centers and refines it at each step arriving to k clusters. DBSCAN discovers clusters of arbitrary shape relying on a density based notion of clusters. Given eps as the input parameter, unlike k-means clustering, it tries to find out all possible clusters by classifying each point as core, border or noise. DBSCAN can be expensive as computation of nearest neighbors requires computing all pair wise proximities. Our implementation would provide a comparative study of K-Means against DBSCAN algorithm.
URI:	http://ir.juit.ac.in:8080/jspui/jspui/handle/123456789/9545
Appears in Collections:	B.Tech. Project Reports

Files in This Item:

File	Description	Size	Format
Implementation of Various Clustering Techniques.pdf		526.75 kB	Adobe PDF	View/Open