Department of Computer Science and Engineering-Data Science, Guru Nanak Institutions Technical Campus, Hyderabad, India.
World Journal of Advanced Research and Reviews, 2025, 26(03), 864-883
Article DOI: 10.30574/wjarr.2025.26.3.2197
Received on 25 April 2025; revised on 05 June 2025; accepted on 07 June 2025
This study presents a scalable and efficient solution for advanced anomaly detection in network traffic using Azure Databricks and machine learning techniques. Modern networks generate massive volumes of traffic data, making manual detection of anomalies or cyber threats challenging. Traditional tools, such as RDBMS and Hadoop, are slow and not designed for real-time security monitoring. To address these challenges, the proposed system utilizes Azure Databricks, a unified cloud platform for big data processing and machine learning. Network traffic logs were cleaned and transformed using PySpark to extract features, such as IP addresses, session duration, data transfer, and packet counts. K-means clustering was then applied to group similar traffic patterns and identify anomalies without the need for labeled data. Model performance was evaluated using the Silhouette Score to ensure meaningful and well-separated clusters. The objective of this study is to provide a comprehensive overview of recent advancements in abnormality detection, focusing on emerging technologies and potential future opportunities. All stages, from data ingestion to anomaly detection, were executed within a single databricks notebook, thus requiring a minimal setup. The system performs efficiently even on low-cost Azure plans, making it accessible to small teams, students, and researchers. This solution enables real-time threat detection, automatic scaling, and quick incident response, offering a faster, smarter, and more cost-effective alternative to traditional network security methods.
Network Traffic; Anomaly Detection; Azure Databricks; K-Means Clustering; Silhouette Score
Preview Article PDF
Sai Yathin Manugula, Dheeraj Varma Kalidindi, Sindhu Sri Gogikari and Srinivas Rao Billakanti. Anomaly detection in network traffic using azure machine learning and log analytics. World Journal of Advanced Research and Reviews, 2025, 26(3), 864-883. Article DOI: https://doi.org/10.30574/wjarr.2025.26.3.2197