Open Access

The Convergence of Artificial Intelligence and Cloud-Native Orchestration: A Comprehensive Analysis Of AI-Driven Devops, Mlops, And Automated Incident Management for Agile Excellence

PDF

Suhana Tabrez ¹

⁴ Department of Software Engineering, Stanford University, United States of America

Abstract

The rapid evolution of cloud-native computing and microservices architectures has introduced unprecedented complexity into the software development lifecycle, necessitating a paradigm shift from traditional manual operations to automated, intelligent systems. This research provides an exhaustive exploration of the integration of Artificial Intelligence (AI) and Machine Learning (ML) within the DevOps and Site Reliability Engineering (SRE) domains. By synthesizing foundational principles of AI-driven continuous testing, proactive auto-scaling, and automated incident management, the study delineates a framework for achieving agile excellence. We examine the transition from DevOps to Machine Learning Operations (MLOps), identifying the architectural requirements for maintaining distributed edge and container-based services. Furthermore, the research investigates the prioritization of security challenges using multi-criteria decision-making models and evaluates the efficacy of memory leak and deadlock detection in distributed systems. Through a systematic analysis of current research trends, this article highlights the critical role of observability and quality-aware research in the container age. The findings suggest that the integration of ensemble models for predictive scaling and AI-based threat detection significantly enhances software quality and operational reliability. This article concludes with a roadmap for future research, emphasizing the need for unified full-stack environments and agile network access control to mitigate the inherent risks of modern cloud-hosted applications.

How to Cite

Suhana Tabrez. (2025). The Convergence of Artificial Intelligence and Cloud-Native Orchestration: A Comprehensive Analysis Of AI-Driven Devops, Mlops, And Automated Incident Management for Agile Excellence. Frontiers in Emerging Multidisciplinary Sciences, 2(11), 6–9. Retrieved from https://irjernet.com/index.php/fems/article/view/311

⬇ Endnote/Zotero/Mendeley (RIS) ⬇ BibTeX

References

Alnafessah A, Gias AU, Wang R, Zhu L, Casale G, Filieri A. Quality-Aware DevOps Research: Where Do We Stand? IEEE Access. 2021;9:44476–44489.

Diekmann C, Naab J, Korsten A, Carle G. Agile Network Access Control in the Container Age. IEEE Transactions on Network and Service Management. 2019;16(1):41–55.

Goyal Deepika. AI-Driven DevOps for Agile Excellence with Machine Learning. Insights2Techinfo. 2024.

Koskinen Eric, Herlihy Maurice. Dreadlocks: Efficient Deadlock Detection. SPAA. 2008.

Kreuzberger D, Kuhl N, Hirschi S. Machine Learning Operations (MLOps): Overview, Definition, and Architecture. IEEE Access. 2023;11:31866–31879.

Li Z, Zhang Y, Liu Y. Towards a full-stack devops environment (platform-as-a-service) for cloud-hosted applications. Tsinghua Science and Technology. 2017;22(01):1–9.

Perez JE, Gonzalez-Prieto A, Diaz J, Lopez-Fernandez D, Garcia-Martin J, Yague A. DevOps Research-Based Teaching Using Qualitative Research and Inter-Coder Agreement. IEEE Transactions on Software Engineering. 2022;48(9):3378–3393.

Rafi S, Yu W, MA Akbar, Alsanad A, Gumaei A. Prioritization Based Taxonomy of DevOps Security Challenges Using PROMETHEE. IEEE Access. 2020;8:105426–105446.

Samir M, Wassif KT, Makady SH. Proactive Auto-Scaling Approach of Production Applications Using an Ensemble Model. IEEE Access. 2023;11:25008–25019.

Shabrin Roohi S, Devi Prasad B, Prabu D, Pallavi RS, Revathi P. Memory Leak Detection in Distributed System. World Academy of Science, Engineering and Technology. 2006;16.

Skelton M. Joined-Up Thinking. ITNOW. 2016;58(1):40–41.

Usman M, Ferlin S, Brunstrom A, Taheri J. A Survey on Observability of Distributed Edge & Container-Based Microservices. IEEE Access. 2022;10:86904–86919.

Vadde Bharath Chandra, Munagandla Vamshi Bharath. Integrating AI-Driven Continuous Testing in DevOps for Enhanced Software Quality. Journal of Artificial Intelligence in Medicine. 2023;14(1):1-9.

S. R. Varanasi, "A Survey on Automated Incident Management Practices in Site Reliability Engineering for Cloud-Native Environments," 2025 International Conference on Electronics and Computing, Communication Networking Automation Technologies (ICEC2NT), Pune, India, 2025, pp. 1-7, doi: 10.1109/ICEC2NT65402.2025.11380120.