Retrieval-Augmented Enterprise Analytics with Privacy-Aware Cloud Data Pipelines
DOI:
https://doi.org/10.22399/ijcesen.5133Keywords:
Retrieval-augmented analytics, privacy-aware data pipelines, enterprise analytics, cloud-native architectures, data governanceAbstract
The rapid adoption of cloud-native data platforms has intensified the need for enterprise analytics systems that are both contextually intelligent and compliant with strict privacy and governance requirements. Traditional analytics pipelines often struggle to deliver meaningful insights from heterogeneous enterprise data while simultaneously safeguarding sensitive information. This study investigates a retrieval-augmented enterprise analytics architecture integrated with privacy-aware cloud data pipelines to address these challenges. A design-science–oriented methodology is employed to evaluate analytical quality, system efficiency, and governance effectiveness across multiple pipeline configurations. The results demonstrate that retrieval augmentation significantly improves contextual accuracy, relevance, and explainability of analytics outputs, while privacy-aware mechanisms reduce policy violations and unauthorized data exposure without severely impacting system performance. Sensitivity and interaction analyses further reveal that balanced tuning of retrieval depth and privacy thresholds is critical for maximizing overall system effectiveness. The findings highlight that embedding retrieval intelligence and privacy controls as first-class architectural components enables scalable, trustworthy, and regulation-ready enterprise analytics in modern cloud environments.
References
1. Balogun, E. D., Ogunsola, K. O., & Samuel, A. D. E. B. A. N. J. I. (2021). A cloud-based data warehousing framework for real-time business intelligence and decision-making optimization. International Journal of Business Intelligence Frameworks, 6(4), 121-134.
2. Beeyani, G. (2025). From conceptualization to customer delight: A tri-dimensional framework for menu innovation, operational excellence, and presentation refinement designing the future of dining. Journal of Innovative Science, 1(2), 64–72.
3. Boadi-Mensah, J. (2022). A strategic analysis of non-profit animal welfare organizations: Lessons from the Winnipeg Pet Rescue Shelter. African Journal of Biological Sciences, 4(4), 947–960.
4. Boukraa, D., Bala, M., & Rizzi, S. (2024). Metadata management in data lake environments: a survey. Journal of Library Metadata, 24(4), 215-274.
5. Bukhari, T. T., Oladimeji, O., Etim, E. D., & Ajayi, J. O. (2024). Cloud-native business intelligence transformation: Migrating legacy systems to modern analytics stacks for scalable decision-making. International Journal of Scientific Research in Humanities and Social Sciences, 1(2), 744-762.
6. Cherukuri, R., & Yarram, V. K. (2024). From Intelligent Automation to Agentic AI: Engineering the Next Generation of Enterprise Systems. International Journal of Emerging Research in Engineering and Technology, 5(4), 142-152.
7. Eboseremen, B. O., Ogedengbe, A. O., Obuse, E., Oladimeji, O., Ajayi, J. O., Akindemowo, A. O., ... & Erigha, E. D. (2022). Secure data integration in multi-tenant cloud environments: Architecture for financial services providers. Journal of Frontiers in Multidisciplinary Research, 3(1), 579-592.
8. Georgiadis, G., & Poels, G. (2021). Enterprise architecture management as a solution for addressing general data protection regulation requirements in a big data context: a systematic mapping study. Information Systems and e-Business Management, 19(1), 313-362.
9. Harika, A., Bhavani, P., Sriteja, P., Tajuddin, S., & Harsha, S. S. (2023, December). Optimizing scalability and resilience: Strategies for aligning DevOps and cloud-native approaches. In 2023 3rd International Conference on Innovative Mechanisms for Industry Applications (ICIMIA) (pp. 1161-1167). IEEE.
10. Herath, H. M. S. S., Herath, H. M. K. K. M. B., Madhusanka, B. G. D. A., & Guruge, L. G. P. K. (2024). Data protection challenges in the processing of sensitive data. In Data Protection: The Wake of AI and Machine Learning (pp. 155-179). Cham: Springer Nature Switzerland.
11. Ieva, S., Loconte, D., Loseto, G., Ruta, M., Scioscia, F., Marche, D., & Notarnicola, M. (2024). A retrieval-augmented generation approach for data-driven energy infrastructure digital twins. Smart Cities, 7(6), 3095-3120.
12. Joksimović, S., Marshall, R., Rakotoarivelo, T., Ladjal, D., Zhan, C., & Pardo, A. (2021). Privacy-driven learning analytics. In Manage your own learning analytics: Implement a Rasch modelling approach (pp. 1-22). Cham: Springer International Publishing.
13. Joshi, D. (2024). Data governance maturity and its impact on analytical value creation: A cross-industry analysis. Sarcouncil Journal of Economics and Business Management, 3(7), 18–25.
14. Mbah, G. O. (2024). Data privacy in the era of AI: Navigating regulatory landscapes for global businesses. Int. J. Sci. Res. Anal, 13(2), 2396-2405.
15. Nambiar, A., & Mundra, D. (2022). An overview of data warehouse and data lake in modern enterprise data management. Big data and cognitive computing, 6(4), 132.
16. Olayinka, O. H. (2019). Leveraging predictive analytics and machine learning for strategic business decision-making and competitive advantage. International Journal of Computer Applications Technology and Research, 8(12), 473-486.
17. Oluoha, O. M., Odeshina, A. B. I. S. O. L. A., Reis, O. L. U. W. A. T. O. S. I. N., Okpeke, F. R. I. D. A. Y., Attipoe, V. E. R. L. I. N. D. A., & Orieno, O. (2023). A privacy-first framework for data protection and compliance assurance in digital ecosystems. Iconic Research and Engineering Journals, 7(4), 620-646.
18. Piras, L., Al-Obeidallah, M. G., Praitano, A., Tsohou, A., Mouratidis, H., Gallego-Nicasio Crespo, B., ... & Zorzino, G. G. (2019, August). DEFeND architecture: a privacy by design platform for GDPR compliance. In International conference on trust and privacy in digital business (pp. 78-93). Cham: Springer International Publishing.
19. Prabhune, A., Stotzka, R., Sakharkar, V., Hesser, J., & Gertz, M. (2018). MetaStore: an adaptive metadata management framework for heterogeneous metadata models. Distributed and parallel databases, 36(1), 153-194.
20. Prabhune, S., & Berndt, D. J. (2024). Deploying large language models with retrieval augmented generation. arXiv preprint arXiv:2411.11895.
21. Rai, C. (2025). Blending classical French technique with global flavors: A model for contemporary pastry innovation. Journal of Innovation Science, 1(2), 30–38.
22. Shaham, S., Hajisafi, A., Quan, M. K., Nguyen, D. C., Krishnamachari, B., Peris, C., ... & Pathirana, P. N. (2023). Holistic survey of privacy and fairness in machine learning. arXiv preprint arXiv:2307.15838.
23. Solano, M. C., & Cruz, J. C. (2024). Integrating analytics in enterprise systems: A systematic literature review of impacts and innovations. Administrative Sciences, 14(7), 138.
24. Szmeja, P., Fornés-Leal, A., Lacalle, I., Palau, C. E., Ganzha, M., Pawłowski, W., ... & Schabbink, J. (2023). ASSIST-IoT: A modular implementation of a reference architecture for the next generation Internet of Things. Electronics, 12(4), 854.
25. Udoh, O. R. (2024). Enhancing Internal Audit Efficiency For Effective Risk Management and Corporate Governance Frameworks. International Journal of Research Publication and Reviews, 5(12), 3646-3659.
26. Unhelkar, B., & Arntzen, A. A. (2020). A framework for intelligent collaborative enterprise systems. Concepts, opportunities and challenges. Scandinavian Journal of Information Systems, 32(2), 6.
27. Yu, H., Gan, A., Zhang, K., Tong, S., Liu, Q., & Liu, Z. (2024, August). Evaluation of retrieval-augmented generation: A survey. In CCF Conference on Big Data (pp. 102-120). Singapore: Springer Nature Singapore.
28. Zhang, X., Zhang, B., Zhang, C., & Wei, L. (2024, December). Enhanced Privacy Policy Comprehension via Pre-trained and Retrieval-Augmented Models. In 2024 IEEE 23rd International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom) (pp. 574-581). IEEE.
29. Zhao, S., Yang, Y., Wang, Z., He, Z., Qiu, L. K., & Qiu, L. (2024). Retrieval augmented generation (rag) and beyond: A comprehensive survey on how to make your llms use external data more wisely. arXiv preprint arXiv:2409.14924.
30. Zhong, H., Yang, D., Shi, S., Wei, L., & Wang, Y. (2024). From data to insights: the application and challenges of knowledge graphs in intelligent audit. Journal of Cloud Computing, 13(1), 114.
31. Zhou, Y., Liu, Y., Li, X., Jin, J., Qian, H., Liu, Z., ... & Yu, P. S. (2024). Trustworthiness in retrieval-augmented generation systems: A survey. arXiv preprint arXiv:2409.10102.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 International Journal of Computational and Experimental Science and Engineering

This work is licensed under a Creative Commons Attribution 4.0 International License.