-
View all jobs
We are seeking a Senior Site Reliability Engineer to join our team and contribute to building and maintaining reliable, scalable, and efficient systems.
As part of the Reliability Tooling team, you will write and review code, influence technical decisions, and mentor engineers in your squad. You will act as a trusted resource, leveraging your expertise in SRE principles and best practices to enhance system reliability and support team growth.
Feel free to work remotely from anywhere across Latvia or connect with colleagues at our Riga office.
Responsibilities
About EPAM
EPAM is a leading global provider of digital platform engineering and development services. For over 30 years, our team has helped leading brands navigate the waves of digital transformation, building solutions that help them stay competitive through constant market disruption.
With offices in 55+ countries, EPAM has grown in Latvia to over 888+ talented innovators in 3 years. We foster creativity and unconventional ways of doing things, welcoming like-minded professionals to join us.
As part of the Reliability Tooling team, you will write and review code, influence technical decisions, and mentor engineers in your squad. You will act as a trusted resource, leveraging your expertise in SRE principles and best practices to enhance system reliability and support team growth.
Feel free to work remotely from anywhere across Latvia or connect with colleagues at our Riga office.
Responsibilities
- Develop tools that enable the SRE team to quickly identify, troubleshoot, and resolve infrastructure, platform, and application issues
- Apply Chaos Engineering methodologies to test your solutions in real-world scenarios and improve system resilience
- Implement and manage modern cloud technologies using Infrastructure as Code (IaC), self-healing, and automated security patterns
- Build telemetry, alerts, and response mechanisms to minimize Mean Time to Recovery (MTTR)
- Collaborate with teams to deliver technical excellence and ensure alignment across projects
- Provide guidance on best practices and create tools to promote the adoption of service reliability principles, including sustainable incident response and blameless postmortems
- Identify opportunities to improve reliability, operational efficiency, and overall system performance
- Write code to enhance scalability, performance, maintainability, and system security
- Foster a culture of team participation in creating thoughtful, high-quality software solutions
- Mentor team members in SRE-related technical and operational responsibilities
- Bachelor’s degree in Computer Science, Electrical & Computer Engineering, Mathematics, or equivalent experience
- At least 3 years of experience in SRE, DevOps, systems engineering, software engineering, or similar roles
- Expertise in working with cloud environments such as AWS, Azure, or GCP
- Hands-on experience managing enterprise production environments with containers like Docker, Kubernetes, LXC, AWS ECS, or EKS
- Proficiency in one or more programming languages such as Python, Go, or Rust
- Strong written and verbal communication skills to effectively collaborate with technical and non-technical teams
- A passion for leveraging technology to solve challenges and a commitment to continuous learning
- Fluent English communication skills (written and spoken) at a B2 level or higher
- Engineering Heritage: Best-in-class experts sharing a culture of engineering excellence and tackling complex engineering challenges for over 30 years.
- Advanced Tech Stack: Innovative projects where you can apply or enhance your expertise in Cloud, Data, AI, and other emerging technologies.
- World-Class Clients: Work closely with 295+ of the Forbes Global 2000 on creating disruptive solutions that make a global impact.
- Professional Growth: Exceptional support for career development with comprehensive resources for upskilling or reskilling in pioneering practices.
- GenAI Community: Strong AI competencies with 600+ experts across 55+ locations driving GenAI-enabled transformation journeys.
- Entrepreneurial Culture: If you're passionate and dedicated to improving business transformation, we provide the support you need to bring your ideas to life.
- Hybrid Setup: The flexibility to work from any location in Latvia, whether it's your home or our office in Riga.
- Other Benefits: Additional vacation and trust days, private health insurance, Employee Stock Purchase Plan and more.
About EPAM
EPAM is a leading global provider of digital platform engineering and development services. For over 30 years, our team has helped leading brands navigate the waves of digital transformation, building solutions that help them stay competitive through constant market disruption.
With offices in 55+ countries, EPAM has grown in Latvia to over 888+ talented innovators in 3 years. We foster creativity and unconventional ways of doing things, welcoming like-minded professionals to join us.
Key Skills
Ranked by relevance
cloud
aws
ai
infrastructure as code
incident response
kubernetes
python
docker
devops
ecs
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
DevOps Engineer
2026-05-27
Full-time
Associate
Argentina
Software Development
Engineering
View Job Details
Related
DevOps Engineer (AWS)
2026-05-27
Full-time
Associate
Argentina
Software Development
Engineering
View Job Details
Related
Full-stack .NET Software Engineer (React/Angular)
2026-05-27
Full-time
Associate
Ukraine
Software Development
Information Technology
Login to Apply
- Posted
- Jul 22, 2025
- Type
- Full-time
- Level
- Mid-Senior
- Location
- Latvia
- Company
- EPAM Systems
Industries
Software Development
IT Services
IT Consulting
Categories
Engineering
Information Technology
Business Development
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
DevOps Engineer
2026-05-27
Full-time
Associate
Argentina
Software Development
Engineering
View Job Details
Related
DevOps Engineer (AWS)
2026-05-27
Full-time
Associate
Argentina
Software Development
Engineering
View Job Details
Related
Full-stack .NET Software Engineer (React/Angular)
2026-05-27
Full-time
Associate
Ukraine
Software Development
Information Technology