Mohamed Abubaker

Work place: Dept. of Computer Engineering, Islamic University of Gaza, Gaza, Palestine

E-mail: mabubaker@hotmail.com

Website:

Research Interests: Computer systems and computational processes, Artificial Intelligence, Computer Vision, Data Structures and Algorithms

Biography

Mohamed Abubaker has received his B.Sc. degree in Computer Engineering in 2005 from Yarmouk University (YU) in Jordan. In 2011, he has completed his M.Sc. in Computer engineering from the Islamic University of Gaza in Palestine. In 2004 he worked as a researcher and developer at ENIC telecom Lille 1 in France for implementing and verifying the Packet-E-Model. His research interests include multi-agent systems where he developed the YU soccer game simulator which simulates an intelligent collaborative robot in soccer game. He focuses on artificial intelligence, computer vision and mobile computing.

Author Articles
Efficient Data Clustering Algorithms: Improvements over Kmeans

By Mohamed Abubaker Wesam Ashour

DOI: https://doi.org/10.5815/ijisa.2013.03.04, Pub. Date: 8 Feb. 2013

This paper presents a new approach to overcome one of the most known disadvantages of the well-known Kmeans clustering algorithm. The problems of classical Kmeans are such as the problem of random initialization of prototypes and the requirement of predefined number of clusters in the dataset. Randomly initialized prototypes can often yield results to converge to local rather than global optimum. A better result of Kmeans may be obtained by running it many times to get satisfactory results. The proposed algorithms are based on a new novel definition of densities of data points which is based on the k-nearest neighbor method. By this definition we detect noise and outliers which affect Kmeans strongly, and obtained good initial prototypes from one run with automatic determination of K number of clusters. This algorithm is referred to as Efficient Initialization of Kmeans (EI-Kmeans). Still Kmeans algorithm used to cluster data with convex shapes, similar sizes, and densities. Thus we develop a new clustering algorithm called Efficient Data Clustering Algorithm (EDCA) that uses our new definition of densities of data points. The results show that the proposed algorithms improve the data clustering by Kmeans. EDCA is able to detect clusters with different non-convex shapes, different sizes and densities.

[...] Read more.
Other Articles