Self-Healing Subscription-Based Cloud Infrastructure: A Technical Review

Authors

  • Himanshu Pandey

DOI:

https://doi.org/10.22399/ijcesen.5209

Keywords:

Cloud Computing, Self-Healing Systems, Service Level Agreements, API Management, Anomaly Detection

Abstract

Enterprise cloud environments struggle with maintaining Service Level Agreements. Configuration errors happen frequently. Security threats emerge constantly. Infrastructure failures disrupt operations without warning. Manual intervention cannot keep pace with these challenges. A novel framework tackles these problems head-on. It uses automated API-level controls combined with machine learning detection. The system has an API Gatekeeper at its core. This component selectively disables specific application programming interfaces when problems occur. The isolation is granular. It prevents damage from spreading while keeping other services running. A Compliance Engine watches resource utilization patterns around the clock. Machine learning algorithms build baseline behavior profiles. They spot deviations that signal potential threats or performance problems. When anomalies show up, the framework toggles off problematic APIs right away. Automated remediation kicks in to fix root causes. No human intervention needed. After remediation succeeds and compliance is verified, the system brings everything back online. It re-enables the disabled APIs. Response times drop dramatically compared to manual processes. The framework keeps SLA compliance consistent through proactive detection and automated containment. Organizations get better service reliability. Operational overhead goes down. Security posture gets stronger. The API toggle mechanism marks real progress in cloud service management. It enables precise control at the function level instead of shutting down entire services.

References

1. Shih, et al., "CLOUD SERVICE FRAMEWORK," US 11,709,723 В2, 2023. Available: https://patentimages.storage.googleapis.com/2c/29/84/25dfa3bf9d21b7/US11709723.pdf

2. Peter Mell and Timothy Grance, "The NIST Definition of Cloud Computing," National Institute of Standards and Technology, 2011. Available: https://nvlpubs.nist.gov/nistpubs/legacy/sp/nistspecialpublication800-145.pdf

3. Michael Armbrust, et al., "A view of cloud computing," ACM Digital Library, 2010. Available: https://dl.acm.org/doi/10.1145/1721654.1721672

4. Qi Zhang, et al. "Cloud computing: state-of-the-art and research challenges," Journal of Internet Services and Applications, 2010. Available: https://link.springer.com/article/10.1007/s13174-010-0007-6

5. Varun Chandola, et al., "Anomaly detection: A survey," ACM Digital Library, 2009. Available: https://dl.acm.org/doi/abs/10.1145/1541880.1541882

6. Theophilus Benson, et al., "Network Traffic Characteristics of Data Centers in the Wild," ACM SIGCOMM Conference on Internet Measurement, Melbourne, Australia, 2010. Available: https://conferences.sigcomm.org/imc/2010/papers/p267.pdf

7. Andrei-Daniel Tudosi, et al., "Secure network architecture based on distributed firewalls," ResearchGate, 2022. Available: https://www.researchgate.net/publication/361079361_Secure_network_architecture_based_on_distributed_firewalls

8. Zhenhuan Gong, et al., "PRESS: PRedictive Elastic ReSource Scaling for cloud systems," IEEE Xplore, 2011. Available: https://ieeexplore.ieee.org/document/5691343

9. Mohammad Aazam, et al., "Fog Computing and Smart Gateway Based Communication for Cloud of Things," IEEE Xplore, 2014. Available: https://ieeexplore.ieee.org/document/6984239

10. Samuel Kounev, et al., "Autonomic QoS-Aware resource management in grid computing using online performance models," ACM Digital Library, 2007. Available: https://dl.acm.org/doi/10.5555/1345263.1345325

Downloads

Published

2026-05-02

How to Cite

Himanshu Pandey. (2026). Self-Healing Subscription-Based Cloud Infrastructure: A Technical Review. International Journal of Computational and Experimental Science and Engineering, 12(2). https://doi.org/10.22399/ijcesen.5209

Issue

Section

Review Article