AWS Step Functions for Serverless Deep Learning Model Training Pipelines
Dr. Javier Morales
Associate Professor, Department of Computer Engineering and Cloud Systems, University of Madrid, Spain
DOI: 10.63665/ijmlaidse-y1f1a005
View / Download Full Article (PDF)Abstract
This work may look into the future at how to make the functionality work on more than one cloud. That will free organizations from vendor lock-in and allow them to take advantage of price differences between Google Cloud, AWS, and Microsoft Azure. If schedulers using reinforcement learning are included, the system could automatically choose and allocate Spot Instances depending on system load and the probability of instance interruptions. In addition, developing training algorithms that tolerate interruptions and reduce checkpointing overhead can further improve efficiency. The overall machine learning workflow can also be enhanced if automated pipelines support other AutoML tasks such as model deployment, monitoring, and continuous data integration. Such integrated automation can significantly improve resource utilization while maintaining cost-efficient machine learning operations in large-scale cloud infrastructures.
Keywords
Serverless Computing, AWS Step Functions, Deep Learning Pipelines, Machine Learning Operations (MLOps), Model Training Automation, Cloud Orchestration, AWS Lambda, SageMaker, Event-Driven Architecture, Cost-Efficient ML Workflows
1. Introduction
Payroll management is a critical organizational function that involves calculating employee compensation, bonuses, deductions, and benefits. As organizations grow in scale and complexity, payroll systems become increasingly vulnerable to fraudulent activities such as ghost employees, inflated work hours, manipulated compensation structures, and unauthorized financial adjustments. Conventional fraud detection systems rely heavily on static rules and manual audits, which are insufficient in identifying complex and evolving fraud schemes.
The integration of Artificial Intelligence (AI) and Machine Learning (ML) technologies introduces adaptive mechanisms capable of learning patterns from historical payroll data and identifying anomalies in real time. These intelligent systems enhance transparency, minimize human bias, and reduce financial losses. By leveraging predictive analytics, classification models, clustering techniques, and neural networks, organizations can implement proactive detection strategies rather than reactive correction mechanisms.
This study aims to present an in-depth overview of AI-based advanced models for payroll fraud detection, evaluating their methodologies, implementation frameworks, and practical benefits within enterprise environments.
References
[1] Spillner, J., Mateos, C., & Monge, D. A. (2020). FaaS and Serverless Computing: From Research to Practice. Journal of Systems and Software, 170, 110708.
[2] AWS. (2024). AWS Step Functions Developer Guide. Retrieved from https://docs.aws.amazon.com/step-functions/
[3] AWS. (2024). Amazon SageMaker Developer Guide. Retrieved from https://docs.aws.amazon.com/sagemaker/
[4] Brescia, G. (2020). Serverless Machine Learning Pipelines with AWS Step Functions. AWS Machine Learning Blog.
[5] Peng, B., & Hosseini, M. (2020). Serverless Machine Learning Inference with AWS Lambda. ACM SIGOPS Operating Systems Review, 54(1), 23–29.
[6] Baldini, I., Castro, P., Chang, K., et al. (2017). Serverless Computing: Current Trends and Open Problems. IBM Research Report.
[7] Ramaswamy, R. (2021). Building Scalable MLOps Pipelines Using AWS Step Functions and SageMaker. Medium Article.
[8] Yu, M., & Liu, J. (2020). Design and Implementation of a Serverless Workflow for Model Training and Deployment. IEEE Cloud Computing, 7(2), 57–66.
[9] Chowdhury, F., & Nguyen, D. (2021). MLOps in Serverless Cloud Architecture: Benefits and Pitfalls. International Journal of Cloud Applications and Computing, 11(3), 15–32.
[10] AWS. (2024). Lambda Limits. Retrieved from https://docs.aws.amazon.com/lambda/latest/dg/gettingstarted-limits.html
[11] Shankar, S., & Bharadwaj, A. (2023). CI/CD for Machine Learning with Step Functions and SageMaker. AWS Architecture Blog.
[12] Amdahl, R., et al. (2022). Scaling Machine Learning Training Pipelines Using SageMaker and AWS CDK. AWS DevOps Blog.
[13] Fouladi, S., Shankar, S., Sreekanth, V., et al. (2019). Weld: Rethinking the Interface Between Data-Intensive Applications. USENIX OSDI.
[14] Dean, J., & Ghemawat, S. (2008). MapReduce: Simplified Data Processing on Large Clusters. Communications of the ACM, 51(1), 107–113.