-
View all jobs
We are seeking a highly skilled Lead DevOps Engineer to join our innovative team and play a key role in building and supporting cutting-edge cloud infrastructure.
This role involves leveraging expertise in AWS, Azure, Kubernetes, and CI/CD pipelines to create scalable, reliable platforms that support AI/ML initiatives for groundbreaking drug discovery solutions. You will collaborate with a diverse group of technical experts to bridge the gap between data science and engineering, driving advancements in healthcare technology.
Responsibilities
This role involves leveraging expertise in AWS, Azure, Kubernetes, and CI/CD pipelines to create scalable, reliable platforms that support AI/ML initiatives for groundbreaking drug discovery solutions. You will collaborate with a diverse group of technical experts to bridge the gap between data science and engineering, driving advancements in healthcare technology.
Responsibilities
- Design and deploy large-scale, secure production infrastructure in AWS to support AI/ML workflows
- Collaborate with data science teams to create advanced data science environments, enabling scalable model development
- Manage containerized workloads, including the deployment and optimization of Kubernetes clusters for performance and cost-effectiveness
- Build and maintain robust CI/CD pipelines to streamline the integration and deployment of ML models and data workflows
- Develop automation scripts using tools such as PowerShell and Bash to optimize system management and reduce manual processes
- Collaborate with cross-functional teams to transform ML pipelines into reliable production systems
- Monitor and troubleshoot cloud-based systems, ensuring high availability and scalability
- Document infrastructure design, configuration, and operational workflows to foster collaboration and knowledge sharing
- Lead ongoing efforts to enhance deployment, monitoring, and operational best practices
- Implement comprehensive monitoring and alerting solutions to address system health and reliability
- Mentor other team members, sharing best practices in cloud infrastructure and DevOps techniques
- Proven experience of 5+ years in managing infrastructure and deploying applications in cloud environments like AWS and Azure
- Expertise in container orchestration tools like Kubernetes and Docker, including the ability to manage and optimize clusters
- Proficiency in designing and optimizing CI/CD pipelines using Azure DevOps, Jenkins, or similar tools
- Skills in scripting languages such as PowerShell, Bash, or Python with a focus on automation and deployment
- Understanding of monitoring and logging systems such as Prometheus, Grafana, Datadog, or CloudWatch
- Background in managing secure, scalable production systems for machine learning or data science applications
- Familiarity with version control and collaboration tools such as Git and GitHub
- Insights into performance tuning, cost optimization, and troubleshooting in cloud environments
- Flexibility to use modern DevOps tools and practices to address evolving technical challenges
- Knowledge of infrastructure as code frameworks like Terraform or CloudFormation
- Experience with advanced monitoring solutions or APM tools such as New Relic or Dynatrace
- Familiarity with AI/ML frameworks like TensorFlow, PyTorch, or SciKit-Learn
- Understanding of healthcare-focused regulatory compliance in cloud environments
- Skills in data processing tools like Apache Airflow or similar platforms
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn
Key Skills
Ranked by relevance
cloud
devops
kubernetes
cicd
aws
powershell
bash
infrastructure as code
high availability
machine learning
prometheus
tensorflow
terraform
jenkins
grafana
datadog
pytorch
python
docker
apache
git
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
DevOps Engineer (AWS)
2026-05-27
Full-time
Associate
Argentina
Software Development
Engineering
View Job Details
Related
DevOps Engineer
2026-05-27
Full-time
Associate
Argentina
Software Development
Engineering
View Job Details
Related
Lead DevOps Engineer (Azure)
2026-05-16
Full-time
Mid-Senior
Turkey
Software Development
Engineering
Login to Apply
- Posted
- Jun 10, 2025
- Type
- Full-time
- Level
- Mid-Senior
- Location
- Ukraine
- Company
- EPAM Systems
Industries
Software Development
IT Services
IT Consulting
Oil
Gas
Mining
Categories
Engineering
Information Technology
Business Development
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
DevOps Engineer (AWS)
2026-05-27
Full-time
Associate
Argentina
Software Development
Engineering
View Job Details
Related
DevOps Engineer
2026-05-27
Full-time
Associate
Argentina
Software Development
Engineering
View Job Details
Related
Lead DevOps Engineer (Azure)
2026-05-16
Full-time
Mid-Senior
Turkey
Software Development
Engineering