Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
We are seeking a Lead DevOps Engineer to boost scalability and uptime for our Ads Organization data infrastructure. You will lead upgrades and maintenance, resolve technical blockers, and refine CI/CD with Spinnaker and Jenkins while keeping stakeholders aligned. Apply today to help deliver robust, data-driven platforms
Responsibilities
- Own and optimize data processing operations leveraging Airflow/MWAA, Spark, and Flink
- Build and maintain cloud infrastructure using AWS, Kubernetes, and Terraform
- Partner with stakeholders to gather requirements and share updates on infrastructure changes
- Handle upgrades, maintenance, and troubleshooting for data platforms, using Datadog to monitor systems and analyze performance
- Optimize CI/CD pipelines with Spinnaker and Jenkins to improve delivery efficiency and consistency
Requirements
- Minimum 5 years of professional experience in a DevOps engineering position
- At least 1 year of experience leading a team or taking on people management duties
- Strong background in Amazon Web Services (AWS) for cloud infrastructure deployment and management
- Hands-on experience with Apache Airflow for scheduling and orchestrating data workflows
- Advanced knowledge of Kubernetes for managing and scaling containerized services
- Skilled in Terraform for infrastructure provisioning, automation, and configuration
- English communication proficiency at B2 (Upper-Intermediate) or higher for collaboration and reporting
Nice to have
- Experience using Apache Flink for processing real-time streams
- Familiarity with Apache NiFi to automate and control data flows
- Knowledge of Databricks for advanced analytics and machine learning projects
- Experience with Datadog for monitoring infrastructure and resolving operational issues
We offer
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn
Key Skills
Ranked by relevanceReady to apply?
Join EPAM Systems and take your career to the next level!
Application takes less than 5 minutes

