
International Journal For Multidisciplinary Research
E-ISSN: 2582-2160
•
Impact Factor: 9.24
A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal
Home
Research Paper
Submit Research Paper
Publication Guidelines
Publication Charges
Upload Documents
Track Status / Pay Fees / Download Publication Certi.
Editors & Reviewers
View All
Join as a Reviewer
Reviewer Referral Program
Get Membership Certificate
Current Issue
Publication Archive
Conference
Publishing Conf. with IJFMR
Upcoming Conference(s) ↓
WSMCDD-2025
GSMCDD-2025
Conferences Published ↓
RBS:RH-COVID-19 (2023)
ICMRS'23
PIPRDA-2023
Contact Us
Plagiarism is checked by the leading plagiarism checker
Call for Paper
Volume 7 Issue 1
January-February 2025
Indexing Partners



















The Future of AI in Production: Leveraging Kubernetes for Large Language Model Deployment
Author(s) | Shikhar Srivastava, Harsh Srivastava, Ayushi Jaymani, Palak Singh |
---|---|
Country | India |
Abstract | Deploying Large Language Models (LLMs) at scale presents significant challenges in resource allocation, cost-efficiency, latency, multi-cloud compatibility, and system reliability. This paper introduces a transformative approach leveraging Docker’s lightweight containerization and Kubernetes’ robust orchestration to redefine LLM deployment. Our proposed architecture ensures seamless scalability, optimal resource utilization, and multi-cloud flexibility, while addressing ethical, environmental, and security concerns. Through compelling case studies, we demonstrate how these technologies revolutionize AI workflows, delivering unmatched performance, cost savings, and operational excellence for large-scale LLM production systems. |
Keywords | Large Language Models, Docker, Kubernetes, Containerization, Orchestration, Multi-cloud Deployment, AI Workflows, Cloud Computing, Resource Management, Cost Efficiency, Security Best Practices, Ethical Considerations. |
Field | Engineering |
Published In | Volume 7, Issue 1, January-February 2025 |
Published On | 2025-01-29 |
Cite This | The Future of AI in Production: Leveraging Kubernetes for Large Language Model Deployment - Shikhar Srivastava, Harsh Srivastava, Ayushi Jaymani, Palak Singh - IJFMR Volume 7, Issue 1, January-February 2025. DOI 10.36948/ijfmr.2025.v07i01.36056 |
DOI | https://doi.org/10.36948/ijfmr.2025.v07i01.36056 |
Short DOI | https://doi.org/g834d3 |
Share this

E-ISSN 2582-2160

CrossRef DOI is assigned to each research paper published in our journal.
IJFMR DOI prefix is
10.36948/ijfmr
Downloads
All research papers published on this website are licensed under Creative Commons Attribution-ShareAlike 4.0 International License, and all rights belong to their respective authors/researchers.
