Title | Fast clustering algorithm based on reference and density |
Authors | Ma, Shuai Wang, Teng-Jiao Tang, Shi-Wei Yang, Dong-Qing Gao, Jun |
Affiliation | Dept. of Comp. Sci. and Technol., Peking Univ., Beijing 100871, China Natl. Lab. on Machine Perception, Peking Univ., Beijing 100871, China |
Issue Date | 2003 |
Publisher | ruan jian xue baojournal of software |
Citation | Ruan Jian Xue Bao/Journal of Software.2003,14,(6),1089-1095. |
Abstract | The efficiency of data mining algorithms is strongly needed with data becoming larger and larger. Density-Based clustering analysis is one kind of clustering analysis methods that can discover clusters with arbitrary shape and is insensitive to noise data. A new kind of clustering algorithm that is called CURD (clustering using references and density) is presented. The creativity of CURD is capturing the shape and extent of a cluster by references, and then analyzes the data based on the references. CURD keeps the ability of density based clustering method's good features, and it can reach high efficiency because of its linear time complexity, so it can be used in mining very large databases. Both theory analysis and experimental results confirm that CURD can discover clusters with arbitrary shape and is insensitive to noise data. In the meanwhile, its executing efficiency is much higher than traditional DBSCAN algorithm based on R-tree. |
URI | http://hdl.handle.net/20.500.11897/294081 |
ISSN | 10009825 |
Indexed | EI |
Appears in Collections: | 信息科学技术学院 机器感知与智能教育部重点实验室 |