Work place: Department of Computer Applications, National Institute of Technology, Tiruchirappalli - 620015, India
E-mail: npgopalan@nitt.edu
Website:
Research Interests: Computational Science and Engineering, Image Compression, Image Manipulation, Distributed Computing, Image Processing, Data Mining, Cellular Automata
Biography
N.P.Gopalan: Professor of Computer Applications Department, National Institute of Technology, Tiruchirappalli, TamilNadu, India. Done PhD from IISC Bangalore. Interested in Data mining, Web Technology, Distributed Computing and Theoretical Computer Science.
DOI: https://doi.org/10.5815/ijitcs.2015.04.08, Pub. Date: 8 Mar. 2015
The data generated and processed by modern computing systems burgeon rapidly. MapReduce is an important programming model for large scale data intensive applications. Hadoop is a popular open source implementation of MapReduce and Google File System (GFS). The scalability and fault-tolerance feature of Hadoop makes it as a standard for BigData processing. Hadoop uses Hadoop Distributed File System (HDFS) for storing data. Data reliability and fault-tolerance is achieved through replication in HDFS. In this paper, a new technique called Delay Scheduling Based Replication Algorithm (DSBRA) is proposed to identify and replicate (dereplicate) the popular (unpopular) files/blocks in HDFS based on the information collected from the scheduler. Experimental results show that, the proposed method achieves 13% and 7% improvements in response time and locality over existing algorithms respectively.
[...] Read more.Subscribe to receive issue release notifications and newsletters from MECS Press journals