New and Existing Approaches Reviewing of Big Data Analysis with Hadoop Tools

Main Article Content

Watheq Ghanim Mutasher
Abbas Fadhil Aljuboori


Everybody is connected with social media like (Facebook, Twitter, LinkedIn, Instagram…etc.) that generate a large quantity of data and which traditional applications are inadequate to process. Social media are regarded as an important platform for sharing information, opinion, and knowledge of many subscribers. These basic media attribute Big data also to many issues, such as data collection, storage, moving, updating, reviewing, posting, scanning, visualization, Data protection, etc. To deal with all these problems, this is a need for an adequate system that not just prepares the details, but also provides meaningful analysis to take advantage of the difficult situations, relevant to business, proper decision, Health, social media, science, telecommunications, the environment, etc. Authors notice through reading of previous studies that there are different analyzes through HADOOP and its various tools such as the sentiment in real-time and others. However, dealing with this Big data is a challenging task. Therefore, such type of analysis is more efficiently possible only through the Hadoop Ecosystem. The purpose of this paper is to analyze literature related analysis of big data of social media using the Hadoop framework for knowing almost analysis tools existing in the world under the Hadoop umbrella and its orientations in addition to difficulties and modern methods of them to overcome challenges of big data in offline and real –time processing. Real-time Analytics accelerates decision-making along with providing access to business metrics and reporting. Comparison between Hadoop and spark has been also illustrated.


Download data is not yet available.

Article Details

How to Cite
Mutasher WG, Aljuboori AF. New and Existing Approaches Reviewing of Big Data Analysis with Hadoop Tools. Baghdad Sci.J [Internet]. 2022 Aug. 1 [cited 2023 Dec. 8];19(4):0887. Available from:


Gole S, Tidke B. A survey of big data in social media using data mining techniques. ICACCS - Proc 2nd Int Conf Adv Comput Commun Syst. 2015;5–10.

Mouhssine E, Khalid C. Social Big Data Mining Framework for Extremist Content Detection in Social Networks. Int Symp Adv Electr Commun Technol ISAECT 2018 - Proc. 2019;1–5.

Bhardwaj A, Singh VK, Vanraj, Narayan Y. Analyzing BigData with Hadoop cluster in HDInsight azure Cloud. 12th IEEE Int Conf Electron Energy, Environ Commun Comput Control (E3-C3), INDICON 2015. 2016;

Monika, Bhat A. An analysis of Crime data under Apache Pig on Big Data. Proc 3rd Int Conf I-SMAC IoT Soc Mobile, Anal Cloud, I-SMAC 2019. 2019;330–5.

Jadhav B, Patankar AB, Jadhav SB. A Practical approach for integrating Big data Analytics into E-governance using hadoop. Proc Int Conf Inven Commun Comput Technol ICICCT 2018. 2018;(Icicct):1952–8.

Bhardwaj A, Vanraj, Kumar A, Narayan Y, Kumar P. Big data emerging technologies: A CaseStudy with analyzing twitter data using apache hive. 2015 2nd Int Conf Recent Adv Eng Comput Sci RAECS 2015. 2016;(December).

Farhan MN, Habib MA, Ali MA. A study and Performance Comparison of MapReduce and Apache Spark on Twitter Data on Hadoop Cluster. Int J Inf Technol Comput Sci. 2018;10(7):61–70.

Birjali M, Beni-Hssane A, Erritali M. Analyzing Social Media through Big Data using InfoSphere BigInsights and Apache Flume. Procedia Comput Sci [Internet]. 2017;113:280–5. Available from:

Adhikari BK, Zuo W, Maharjan R, Han X, Amatya PB, Ali W. Statistical analysis for detection of sensitive data using hadoop clusters. Proc - 21st IEEE Int Conf High Perform Comput Commun 17th IEEE Int Conf Smart City 5th IEEE Int Conf Data Sci Syst HPCC/SmartCity/DSS 2019. 2019;2373–8.

Sehgal D, Agarwal AK. Sentiment analysis of big data applications using Twitter Data with the help of HADOOP framework. Proc 5th Int Conf Syst Model Adv Res Trends, SMART 2016. 2017;V:251–5.

Khan M, Malviya A. Big data approach for sentiment analysis of twitter data using Hadoop framework and deep learning. Int Conf Emerg Trends Inf Technol Eng ic-ETITE 2020. 2020;1–5.

Tidke B, Mehta R, Rana D, Jangir H. Topic sensitive user clustering using sentiment score and similarity measures: Big data and social network. Int J Web-Based Learn Teach Technol. 2020;15(2):34–45.

Harikumar D, Kapoor D. Youtube Data Sensitivity and Analysis Using Hadoop Framework. Int Res J Eng Technol.2019; 06 (04): 3133-3139

Dabas C, Kaur P, Gulati N, Tilak M. Analysis of Comments on Youtube Videos using Hadoop. Proc IEEE Int Conf Image Inf Process. 2019;2019-Novem:353–8.

Kaur P, Dabas C, Singhal V, Nangru S, Sehgal A. News Data Analysis from Facebook Through MongoDB and Hive. Proc IEEE Int Conf Image Inf Process. 2019;2019-Novem:454–8.

Amin F, Ahmad A, Choi GS. To Study and Analyse Human Behaviours on Social Networks. Proc 4th Annu Int Conf Netw Inf Syst Comput ICNISC. 2018;233–6

Ashayer A, Yasrobi S, Thomas S, Tabrizi N. Performance Analysis of Hadoop Cluster for User Behavior Analysis. Proc - 20th Int Conf High Perform Comput Commun 16th Int Conf Smart City 4th Int Conf Data Sci Syst HPCC/SmartCity/DSS 2018. 2019;805–9.

Ganguly P. Big Data Analytics : Using Hadoop Inspired MapReduce and Apache Spark. Int. J. Adv. Sci. Technol. 2020;7(2):72–82.

Ashwitha TA, Rodrigues AP, Chiplunkar NN. Movie Dataset Analysis Using Hadoop-Hive. 2nd Int Conf Comput Syst Inf Technol Sustain Solut CSITSS 2017. 2018;1–5.

Seay C, Agrawal R, Kadadi A, Barel Y. Using hadoop on the mainframe: A big solution for the challenges of big data. Proc - 12th Int Conf Inf Technol New Gener ITNG 2015. 2015;765–9.

Cunha J, Silva C, Antunes M. Health Twitter Big Bata Management with Hadoop Framework. Procedia Comput Sci [Internet]. 2015;64:425–31. Available from:

Wu WT, Lin WW, Hsu CH, He LG. Energy-efficient hadoop for big data analytics and computing: A systematic review and research insights. Futur Gener Comput Syst [Internet]. 2018;86:1351–67. Available from:

Ahmad A, Rathore MM, Paul A, Rho S. Defining human behaviors using big data analytics in social internet of things. Proc - Int Conf Adv Inf Netw Appl AINA. 2016;2016-May:1101–7.