International Journal For Multidisciplinary Research

E-ISSN: 2582-2160     Impact Factor: 9.24

A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal

Call for Paper Volume 7, Issue 1 (January-February 2025) Submit your research before last 3 days of February to publish your research paper in the issue of January-February.

Comparative Analysis of Apache Sqoop and Apache Spark for Efficient Data Transfer Between Relational Databases and Hadoop Distributed File System (HDFS)

Author(s) Sainath Muvva
Country USA
Abstract With the growing adoption of big data technologies like Hadoop, many companies are overhauling their data infrastructure. A crucial aspect of this transition is the ability to transfer both transactional and analytical data from traditional relational database management systems (RDBMS) into the new ecosystem. This migration enables advanced data processing and facilitates deeper analytical insights. This paper focuses on exploring the various tools available for importing data from relational databases into the Hadoop Distributed File System (HDFS). It delves into the underlying mechanisms of these tools and highlights the key distinctions between them.
Keywords HDFS, Sqoop, Spark, SQL Loaders
Published In Volume 2, Issue 4, July-August 2020
Published On 2020-08-25
Cite This Comparative Analysis of Apache Sqoop and Apache Spark for Efficient Data Transfer Between Relational Databases and Hadoop Distributed File System (HDFS) - Sainath Muvva - IJFMR Volume 2, Issue 4, July-August 2020. DOI 10.36948/ijfmr.2020.v02i04.25444
DOI https://doi.org/10.36948/ijfmr.2020.v02i04.25444
Short DOI https://doi.org/g82h92

Share this