Scaling up Data Mining Algorithms for Big Data

Pankras K. Kandengukila; Daudi Mashauri

doi:10.36948/ijfmr.2025.v07i01.34838

Scaling up Data Mining Algorithms for Big Data

Author(s)	Pankras K. Kandengukila, Daudi Mashauri
Country	Tanzania
Abstract	The rapid development of science and technology and replacement of digital equipment have presided over today’s era of big data. Automatically discovering and extracting hidden knowledge in the forms of patterns from these big data is known as data mining. However, the emergence of big data era has brought a series of challenges to data mining techniques including too long processing time, insufficient memory capacity and excessive power consumption. Aim of this paper is to study scaling up data mining algorithms for big data by Random Forest and Naïve Bayes. The background and applications of data mining, big data and cloud computing are briefly introduced together with the basic principles of Random Forest and Naive Bayes as well as MapReduce model in cloud computing. Then, the feasibility of parallelism of Random Forest and Naive Bayes is studied. Two parallel Random Forest and Naive Bayes algorithms based on MapReduce are developed and realized in Hadoop platform. Finally, the parallelism of Random Forest and Naive Bayes is validated by experiments. Their execution efficiency is analyzed through the experimental results on the different sizes of data sets and different numbers of clusters. It is shown that the proposed methods have a good performance and can be applied in process of big data.
Keywords	Data Mining, Big data, Cloud Computing, Random Forest, Naïve Bayes
Field	Computer > Data / Information
Published In	Volume 7, Issue 1, January-February 2025
Published On	2025-01-19
DOI	https://doi.org/10.36948/ijfmr.2025.v07i01.34838
Short DOI	https://doi.org/g82gwt

View / Download PDF File

E-ISSN 2582-2160

doi

CrossRef DOI is assigned to each research paper published in our journal.

IJFMR DOI prefix is
10.36948/ijfmr

Downloads

Research Paper Format Copyright Permission Form and Undertaking Form Cover Page Vol 7 Isu 4 Cover Page Vol 7 Isu 3 Cover Page Vol 7 Isu 2

All research papers published on this website are licensed under Creative Commons Attribution-ShareAlike 4.0 International License, and all rights belong to their respective authors/researchers.

CC-BY-SA

About IJFMR Fees & Payment Current Issue Publication Archive	Submit Research Paper Track Submission Status Publication Guidelines Publication Ethics Peer Review & Plagiarism	Join as a Reviewer Editors & Reviewers Reviewer Referral Program Get Reviewer Membership Certi.	Website/Journal Policies Usage Policy Content Policies Privacy Policy

Contact Us		+91-9687-828-838	editor@ijfmr.com

International Journal For Multidisciplinary Research

A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal

Scaling up Data Mining Algorithms for Big Data

Share this