Parallel Evaluation of Large Scale Hierarchical Clustering Results
Date
Authors
Advisor
Publisher
Polytechnic University of Puerto Rico
Item Type
Article
- Total Views Total Views1
- Total Downloads Total Downloads2
Abstract
Abstract ⎯ Data clustering refers to the automatic grouping of object based on their similarity, i.e., similar objects should be in the same group and dissimilar objects should be in different groups. In particular, for hierarchical clustering algorithms there is also the notion of a hierarchy in which the objects and the cluster fit. Clustering is a fundamental task in data mining, machine learning, information retrieval, bioinformatics, and image analysis, among others. It is important to evaluate the result of clustering algorithms. However most evaluations approaches are geared towards nonhierarchical clustering approaches; this research explores how to use traditional validity measures to evaluate and assess hierarchical clustering results.
Key Terms ⎯ Clustering, Data Clustering, Hierarchical Clustering, and Validity Measures.
Description
Design Project Article for the Graduate Programs at Polytechnic University of Puerto Rico
Keywords
Citation
Cruz Rodríguez, D. (2012). Parallel evaluation of large scale hierarchical clustering results [Unpublished manuscript]. Graduate School, Polytechnic University of Puerto Rico.