Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
We are looking for a Lead DevOps Engineer to strengthen reliability and scale across our Ads Organization data infrastructure. You will modernize systems, solve complex incidents, and tune CI/CD while aligning with stakeholders on resilient platform changes. Apply to help keep our data platforms fast, stable, and observable
Responsibilities
- Lead and streamline data processing operations using Airflow/MWAA, Spark, and Flink
- Design and operate cloud infrastructure with AWS, Kubernetes, and Terraform
- Collaborate with stakeholders to capture requirements and communicate infrastructure updates
- Execute upgrades, run maintenance, and troubleshoot data platforms using Datadog for monitoring and performance insights
- Improve CI/CD pipelines with Spinnaker and Jenkins to deliver applications consistently and efficiently
Requirements
- Proven DevOps engineering experience of 5 years or more
- Demonstrated leadership or people management experience of 1 year or more
- Strong background with Amazon Web Services (AWS) for deploying and operating cloud infrastructure
- Hands-on experience with Apache Airflow for workflow orchestration and scheduling
- Advanced knowledge of Kubernetes for operating and scaling containerized workloads
- Skilled use of Terraform for infrastructure automation and configuration management
- Proficient English at B2 (Upper-Intermediate) or higher for clear collaboration and reporting
Nice to have
- Experience with Apache Flink for real-time stream processing
- Familiarity with Apache NiFi for automating and managing data flows
- Knowledge of Databricks for advanced analytics and machine learning initiatives
- Experience with Datadog for infrastructure monitoring and incident resolution
We offer
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn
Key Skills
Ranked by relevanceReady to apply?
Join EPAM Systems and take your career to the next level!
Application takes less than 5 minutes

