Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
Responsibilities
- Design, develop, and maintain data pipelines and data infrastructure using Apache Spark, Databricks, and other data engineering tools
- Collaborate with the Data Science team to provide high-quality data solutions that support business decisions and drive growth
- Optimize and tune data pipelines for performance, scalability, and reliability
- Ensure data quality and integrity throughout the data pipeline, from data ingestion to data consumption
- Implement and maintain CI/CD pipelines for data engineering workflows
- Develop and maintain documentation for data engineering processes and data infrastructure
- Provide technical leadership and mentorship to junior data engineers
- 5+ years of experience in Data Software Engineering, showcasing your expertise in designing and developing complex data solutions
- 1+ years of experience in a leadership role, demonstrating your ability to lead a team of engineers
- Expertise in Apache Spark along with Spark streaming & Spark SQL, showcasing your ability to design and develop complex data pipelines
- Fluency working with AWS landscape, including EC2, S3, and EMR, among others
- Good hands-on experience with Databricks and delta-lake, enabling you to manage large-scale data infrastructures
- Good understanding & hands-on experience with CI/CD, enabling you to automate software development processes
- Rich working experience with Github, demonstrating your proficiency in version control and collaboration tools
- Familiarity with Presto, Superset, Starburst, or Exasol, demonstrating your broader knowledge of data engineering tools and technologies
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn