Improvised Distributions framework of Hadoop: A review
Author (s)
Baydaa Hassan Husain & Subhi R. M. Zeebaree
Abstract
HADOOP is an open-source virtualization technology that allows the distributed processing of large data sets across standardized server clusters. With two modules, HADOOP Distributed File System (HDFS) and MapReduce framework, it is designed to scale single servers to thousands of computers, providing local computation and storage. Over a decade after HADOOP emerged on the forefront as an open system for Big Data analysis. Its growth has prompted several improvisations for particular data processing needs, based on the type of processing conditions at various periods of computation. This paper, through reviewing several kinds of research provides the basic HADOOP system structure and the description of the MapReduce, HDFS Efficiency. Explaining how the HADOOP framework can overcome the “5Vs” challenges in Big Data. However, in addition to the many benefits of the HADOOP system, like fault tolerance, reliability, high availability, scalable, decreases execution time, reduces latency, improve the security issues, improving the quality of data analysis, better scheduling model, and cost-efficiently. On the other hand, there were some barriers and challenges regarding adjusting data regularly, security issues, and load balancing. Finally, the certainly benefit and challenges of the HADOOP system have been represented paving the way for the future research to find solutions to these challenges.
Keywords: HADOOP, HDFS, MapReduce, Big Data.
Title: | Improvised Distributions framework of Hadoop: A review |
---|---|
Author: | Baydaa Hassan Husain & Subhi R. M. Zeebaree |
Journal Name: | International Journal of Science and Business |
Website: | ijsab.com |
ISSN: | ISSN 2520-4750 (Online), ISSN 2521-3040 (Print) |
DOI: | https://doi.org/10.5281/zenodo.4461761 |
Media: | Online |
Volume: | 5 |
Issue: | 2 |
Acceptance Date: | 20/01/2021 |
Date of Publication: | 25/01/2021 |
PDF URL: | https://ijsab.com/wp-content/uploads/668.pdf |
Free download: | Available |
Page: | 31-41 |
First Page: | 31 |
Last Page: | 41 |
Paper Type: | Literature Review |
Current Status: | Published |
Cite This Article:
Baydaa Hassan Husain & Subhi R. M. Zeebare (2021). Improvised Distributions framework of Hadoop: A review. International Journal of Science and Business, 5(2), 31-41. doi: https://doi.org/10.5281/zenodo.4461761
Retrieved from https://ijsab.com/wp-content/uploads/668.pdf
About Author (s)
Baydaa Hassan Husain, ISE Department, Erbil Polytechnic University, Erbil – Kurdistan Region – Iraq, Baydaa.mei20@epu.edu.iq.
Subhi R. M. Zeebaree (corresponding author), Information Technology Department, Duhok Polytechnic University, Duhok – Kurdistan Region – Iraq, subhi.rafeeq@dpu.edu.krd.
DOI: https://doi.org/10.5281/zenodo.4461761