Optimizing Hadoop Block Placement Policy and Cluster Blocks Distribution

Main Authors: Nchimbi Edward Pius, Liu Qin, Fion Yang, Zhu Hong Ming
Format: Article
Bahasa: eng
Terbitan: , 2013
Subjects:
Online Access: https://zenodo.org/record/1087934
ctrlnum 1087934
fullrecord <?xml version="1.0"?> <dc schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd"><creator>Nchimbi Edward Pius</creator><creator>Liu Qin</creator><creator>Fion Yang</creator><creator>Zhu Hong Ming</creator><date>2013-08-01</date><description>The current Hadoop block placement policy do not fairly and evenly distributes replicas of blocks written to datanodes in a Hadoop cluster. This paper presents a new solution that helps to keep the cluster in a balanced state while an HDFS client is writing data to a file in Hadoop cluster. The solution had been implemented, and test had been conducted to evaluate its contribution to Hadoop distributed file system. It has been found that, the solution has lowered global execution time taken by Hadoop balancer to 22 percent. It also has been found that, Hadoop balancer respectively over replicate 1.75 and 3.3 percent of all re-distributed blocks in the modified and original Hadoop clusters. The feature that keeps the cluster in a balanced state works as a core part to Hadoop system and not just as a utility like traditional balancer. This is one of the significant achievements and uniqueness of the solution developed during the course of this research work.</description><identifier>https://zenodo.org/record/1087934</identifier><identifier>10.5281/zenodo.1087934</identifier><identifier>oai:zenodo.org:1087934</identifier><language>eng</language><relation>doi:10.5281/zenodo.1087933</relation><relation>url:https://zenodo.org/communities/waset</relation><rights>info:eu-repo/semantics/openAccess</rights><rights>https://creativecommons.org/licenses/by/4.0/legalcode</rights><subject>Balancer</subject><subject>Datanode</subject><subject>Distributed file system</subject><subject>Hadoop</subject><subject>Replicas.</subject><title>Optimizing Hadoop Block Placement Policy and Cluster Blocks Distribution</title><type>Journal:Article</type><type>Journal:Article</type><recordID>1087934</recordID></dc>
language eng
format Journal:Article
Journal
author Nchimbi Edward Pius
Liu Qin
Fion Yang
Zhu Hong Ming
title Optimizing Hadoop Block Placement Policy and Cluster Blocks Distribution
publishDate 2013
topic Balancer
Datanode
Distributed file system
Hadoop
Replicas
url https://zenodo.org/record/1087934
contents The current Hadoop block placement policy do not fairly and evenly distributes replicas of blocks written to datanodes in a Hadoop cluster. This paper presents a new solution that helps to keep the cluster in a balanced state while an HDFS client is writing data to a file in Hadoop cluster. The solution had been implemented, and test had been conducted to evaluate its contribution to Hadoop distributed file system. It has been found that, the solution has lowered global execution time taken by Hadoop balancer to 22 percent. It also has been found that, Hadoop balancer respectively over replicate 1.75 and 3.3 percent of all re-distributed blocks in the modified and original Hadoop clusters. The feature that keeps the cluster in a balanced state works as a core part to Hadoop system and not just as a utility like traditional balancer. This is one of the significant achievements and uniqueness of the solution developed during the course of this research work.
id IOS16997.1087934
institution DEFAULT
institution_type library:public
library
library DEFAULT
collection DEFAULT
city DEFAULT
province DEFAULT
repoId IOS16997
first_indexed 2022-06-06T05:09:41Z
last_indexed 2022-06-06T05:09:41Z
recordtype dc
merged_child_boolean 1
_version_ 1739486077513629696
score 17.608967