Home
World Journal of Advanced Research and Reviews
International Journal with High Impact Factor for fast publication of Research and Review articles

Main navigation

  • Home
    • Journal Information
    • Editorial Board Members
    • Reviewer Panel
    • Abstracting and Indexing
    • Journal Policies
    • Our CrossMark Policy
    • Publication Ethics
    • Issue in Progress
    • Current Issue
    • Past Issues
    • Instructions for Authors
    • Article processing fee
    • Track Manuscript Status
    • Get Publication Certificate
    • Join Editorial Board
    • Join Reviewer Panel
  • Contact us
  • Downloads

eISSN: 2581-9615 || CODEN: WJARAI || Impact Factor 8.2 ||  CrossRef DOI

Research and review articles are invited for publication in March 2026 (Volume 29, Issue 3) Submit manuscript

The role of AI/ML in improving system reliability of large-scale distributed systems

Breadcrumb

  • Home
  • The role of AI/ML in improving system reliability of large-scale distributed systems

Aravind Sekar *

Twilio Inc., USA.

Review Article

World Journal of Advanced Research and Reviews, 2025, 26(01), 1007-1020

Article DOI: 10.30574/wjarr.2025.26.1.1064

DOI url: https://doi.org/10.30574/wjarr.2025.26.1.1064

Received on 23 February 2025; revised on 07 April 2025; accepted on 09 April 2025

This article explores the transformative role of artificial intelligence and machine learning in enhancing system reliability across large-scale distributed systems. The article examines how AI/ML technologies are revolutionizing reliability engineering through predictive capacity management, autonomous monitoring, advanced anomaly detection, and integrated security approaches. The article demonstrates that properly implemented AI/ML solutions significantly reduce incident frequency and resolution times while optimizing resource utilization and decreasing operational costs. We present a comprehensive theoretical framework for AI-enhanced reliability and analyze real-world applications across multiple domains. The article evaluates both technical implementations and their quantifiable business impacts, showing typical operational cost reductions and engineer toil reductions in mature deployments. The article addresses critical challenges including data quality constraints, model explainability issues, and human-AI collaboration complexities while exploring promising future directions in reinforcement learning, real-time inference, and self-improving frameworks. This article provides reliability engineers, system architects, and organizational leaders with actionable insights for implementing AI/ML approaches that enhance distributed system resilience in increasingly complex technological environments. 

Aiops; System Reliability; Distributed Systems; Predictive Remediation; Autonomous Recovery

https://wjarr.com/sites/default/files/fulltext_pdf/WJARR-2025-1064.pdf

Preview Article PDF

Aravind Sekar. The role of AI/ML in improving system reliability of large-scale distributed systems. World Journal of Advanced Research and Reviews, 2025, 26(1), 1007-1020. Article DOI: https://doi.org/10.30574/wjarr.2025.26.1.1064

Copyright © Author(s). All rights reserved. This article is published under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, sharing, adaptation, distribution, and reproduction in any medium or format, as long as appropriate credit is given to the original author(s) and source, a link to the license is provided, and any changes made are indicated.


All statements, opinions, and data contained in this publication are solely those of the individual author(s) and contributor(s). The journal, editors, reviewers, and publisher disclaim any responsibility or liability for the content, including accuracy, completeness, or any consequences arising from its use.

Get Certificates

Get Publication Certificate

Download LoA

Check Corssref DOI details

Issue details

Issue Cover Page

Editorial Board

Table of content

Copyright © 2026 World Journal of Advanced Research and Reviews - All rights reserved

Developed & Designed by VS Infosolution