Please use this identifier to cite or link to this item:
Title: Implementation of Various Clustering Techniques
Authors: Mahajan, Saumya
Kumar, Pardeep [Guided by]
Keywords: Data Mining
DBSCAN algorithm
Issue Date: 2014
Publisher: Jaypee University of Information Technology, Solan, H.P.
Abstract: Clustering is one of the important streams in data mining useful for discovering groups and identifying interesting distributions in the underlying data. This project aims in analyzing and comparing the partitional and density based clustering algorithms namely K-Means and DBSCAN. The comparison is done based on the extent to which each of these algorithms identify the clusters and their pros and cons. K-Means is a partitional clustering technique that helps to identify k clusters from a given set of n data points in d-dimensional space. It starts with k random centers and refines it at each step arriving to k clusters. DBSCAN discovers clusters of arbitrary shape relying on a density based notion of clusters. Given eps as the input parameter, unlike k-means clustering, it tries to find out all possible clusters by classifying each point as core, border or noise. DBSCAN can be expensive as computation of nearest neighbors requires computing all pair wise proximities. Our implementation would provide a comparative study of K-Means against DBSCAN algorithm.
Appears in Collections:B.Tech. Project Reports

Files in This Item:
File Description SizeFormat 
Implementation of Various Clustering Techniques.pdf526.75 kBAdobe PDFView/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.