Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
Responsibilities
- Collaborate with data scientists and other technical staff to design and develop scalable, reliable, and efficient data pipelines
- Integrate new data sources into the existing technology stack
- Write clean, maintainable, and well-documented code
- Ensure the quality and accuracy of data by implementing data validation and monitoring processes
- Deploy, manage, and monitor machine learning models in production
- Work with cross-functional teams to identify and resolve data-related issues
- Contribute to the development of best practices and standards for data engineering
- Implement efficient data storage and retrieval mechanisms
- Maintain and optimize existing data pipelines for maximum performance and scalability
- Provide technical guidance and mentorship to junior team members
- 3+ years of experience in Data Software Engineering or related roles
- Expertise in Apache Spark and Databricks for building scalable and high-performance data pipelines
- Deep knowledge of Microsoft Azure for developing and deploying cloud-based solutions
- Strong proficiency in Python for data analysis, modeling, and manipulation
- Experience in forecasting models and statistical models for predicting future trends
- Familiarity with Git for version control and collaboration
- Knowledge of MLOps for deploying, managing, and monitoring machine learning models in production
- Ability to work with complex data structures and large datasets
- Excellent communication skills in spoken and written English, at an upper-intermediate level or higher
- Experience with Panda for data manipulation and analysis
- Knowledge of SQL and relational tables
- Understanding of ML models and their practical applications
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn