Rahul Chowdhury

Work place: School of Computing Science and Engineering VIT University, Vellore-632014 Tamil Nadu, India

E-mail: chowdhuryrahul5@gmail.com

Website:

Research Interests: Computer systems and computational processes, Computer Architecture and Organization, Data Mining, Database Management System, Data Structures and Algorithms

Biography

Rahul Chowdhury is a student, pursuing a Bachelors of Technology in Computer Science and Engineering at VIT University, Vellore, Tamil Nadu, India. He has written papers in text mining, rough sets and big data. Competitive Coding amazes him. He likes to work on real-time systems and machine learning based design models. He has been a part of various hackathons and working on application designs is his forte.

Author Articles
MMeMeR: An Algorithm for Clustering Heterogeneous Data using Rough Set Theory

By B.K. Tripathy Akarsh Goyal Rahul Chowdhury Patra Anupam Sourav

DOI: https://doi.org/10.5815/ijisa.2017.08.03, Pub. Date: 8 Aug. 2017

In recent times enumerable number of clustering algorithms have been developed whose main function is to make sets of objects having almost the same features. But due to the presence of categorical data values, these algorithms face a challenge in their implementation. Also some algorithms which are able to take care of categorical data are not able to process uncertainty in the values and so have stability issues. Thus handling categorical data along with uncertainty has been made necessary owing to such difficulties. So, in 2007 MMR algorithm was developed which was based on basic rough set theory. MMeR was proposed in 2009 which surpassed the results of MMR in taking care of categorical data and it could also handle heterogeneous values as well. SDR and SSDR were postulated in 2011 which were able to handle hybrid data. These two showed more accuracy when compared to MMR and MMeR. In this paper, we further make improvements and conceptualize an algorithm, which we call MMeMeR or Min-Mean-Mean-Roughness. It takes care of uncertainty and also handles heterogeneous data. Standard data sets have been used to gauge its effectiveness over the other methods.

[...] Read more.
Other Articles