Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
We are searching for a Lead DevOps Engineer to improve the reliability and scalability of our Ads Organization data infrastructure. You will troubleshoot production challenges, deliver upgrades and maintenance, and optimize CI/CD workflows while coordinating with stakeholders. Apply to help us ship stable systems with strong observability
Responsibilities
- Oversee and optimize data processing operations using Airflow/MWAA, Spark, and Flink
- Create and sustain cloud infrastructure with AWS, Kubernetes, and Terraform
- Engage stakeholders to collect requirements and communicate infrastructure change progress
- Plan and execute upgrades, conduct maintenance, and troubleshoot data platforms while using Datadog for monitoring and performance insights
- Refine CI/CD delivery by enhancing Spinnaker and Jenkins pipelines for consistent releases
Requirements
- Proven track record of 5+ years in DevOps engineering roles
- Demonstrated experience of 1+ year in leadership or team management responsibilities
- Deep expertise in Amazon Web Services (AWS) for deploying, managing, and operating cloud infrastructure
- Hands-on experience with Apache Airflow for orchestrating and scheduling data workflows
- Advanced knowledge of Kubernetes to manage and scale containerized applications
- Strong proficiency in Terraform for infrastructure automation and configuration
- English level B2 (Upper-Intermediate) or higher with strong communication skills for collaboration and reporting
Nice to have
- Experience with Apache Flink for real-time data stream processing
- Familiarity with Apache NiFi for automating and controlling data flows
- Knowledge of Databricks for advanced analytics and machine learning efforts
- Experience with Datadog for monitoring infrastructure and driving issue resolution
We offer
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn
Key Skills
Ranked by relevanceReady to apply?
Join EPAM Systems and take your career to the next level!
Application takes less than 5 minutes

