Comparative Analysis of Apache Sqoop and Apache Spark for Efficient Data Transfer Between Relational Databases and Hadoop Distributed File System (HDFS)

Author(s)	Sainath Muvva
Country	USA
Abstract	With the growing adoption of big data technologies like Hadoop, many companies are overhauling their data infrastructure. A crucial aspect of this transition is the ability to transfer both transactional and analytical data from traditional relational database management systems (RDBMS) into the new ecosystem. This migration enables advanced data processing and facilitates deeper analytical insights. This paper focuses on exploring the various tools available for importing data from relational databases into the Hadoop Distributed File System (HDFS). It delves into the underlying mechanisms of these tools and highlights the key distinctions between them.
Keywords	HDFS, Sqoop, Spark, SQL Loaders
Published In	Volume 2, Issue 4, July-August 2020
Published On	2020-08-25
DOI	https://doi.org/10.36948/ijfmr.2020.v02i04.25444
Short DOI	https://doi.org/g82h92

About IJFMR Fees & Payment Current Issue Publication Archive	Submit Research Paper Track Submission Status Publication Guidelines Publication Ethics Peer Review & Plagiarism	Join as a Reviewer Editors & Reviewers Reviewer Referral Program Get Reviewer Membership Certi.	Website/Journal Policies Usage Policy Content Policies Privacy Policy

Contact Us		+91-9687-828-838	editor@ijfmr.com

International Journal For Multidisciplinary Research