Downloads

Keywords:

highly loaded web applications; auto-scaling; machine learning; load forecasting; anomaly detection; Kubernetes; Sidecar; event-driven

Integration of machine learning into high-load web application architectures

Authors

Serhii Savchenko1
Senior Full Stack Developer, IT Development (Web Dev) New York, NY 1

Abstract

The article analyzes the feasibility of implementing machine learning in high-load web application architectures.  An architectural technique for integrating machine learning (ML) modules into microservice high-load Kubernetes-based web applications is considered. The basis is the "Sidecar + event-driven" pattern, which allows each service to be supplemented with load forecasting on LSTM models and anomaly detection through auto-encoders. The ML Predictor, Decision Manager, and Actuator components are embedded as separate containers that communicate via the Kafka event bus. The methodological basis of the work, which made it possible to broadly consider the features of the machine learning implementation process in high-load web application architectures, was based on the results of other studies. The information presented in the article is of considerable interest to architects of distributed systems and DevOps engineers specializing in building fault-tolerant, scalable high-traffic web platforms that need to integrate machine learning models without performance degradation and while maintaining SLA requirements. In addition, the materials of the article will be useful to researchers in the field of MLOps and data engineers involved in optimizing data pipelines and ensuring low latency of inference under extreme loads, as well as graduate students developing methods for adaptive resource balancing in heterogeneous computing environments.

Article Details

Published

2025-06-12

Section

Articles

License

Copyright (c) 2025 International Journal of Engineering and Computer Science Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

How to Cite

Integration of machine learning into high-load web application architectures. (2025). International Journal of Engineering and Computer Science, 14(06), 27340-27345. https://doi.org/10.18535/ijecs.v14i06.5157