-
View all jobs
We are seeking a skilled Senior Site Reliability Engineer (SRE) tasked with ensuring the reliability, performance, and availability of our systems and applications.
The successful candidate will have substantial expertise in configuring and managing AppDynamics to monitor application performance, customize rules, and generate insightful dashboards for proactive system health monitoring.
Responsibilities
The successful candidate will have substantial expertise in configuring and managing AppDynamics to monitor application performance, customize rules, and generate insightful dashboards for proactive system health monitoring.
Responsibilities
- Design and implement monitoring solutions using AppDynamics
- Configure custom rules, alerts, and health rules in AppDynamics
- Develop and maintain AppDynamics dashboards for system performance and usage insights
- Collaborate with application and infrastructure teams to integrate AppDynamics
- Conduct root cause analysis of incidents using AppDynamics and other observability tools
- Standardize best practices for monitoring, alerting, and capacity planning
- Automate tasks and refine processes to boost system reliability and operational efficiency
- Participate in an on-call rotation to support and resolve production system incidents
- Proven experience as an SRE focused on system reliability and performance
- Hands-on expertise in AppDynamics management including setting up health rules, alerts, and dashboards
- Strong understanding of APM concepts like tracing, metrics, and logs
- Proficiency in monitoring frameworks and best practices
- Strong scripting or programming skills in Python, Shell, or similar for automation
- Familiarity with cloud platforms like Azure and container environments such as Kubernetes or Docker
- Experience in incident management and post-incident analysis
- Strong communication skills and collaborative ability with cross-functional teams
- Experience with other observability tools such as Dynatrace or New Relic
- Familiarity with CI/CD pipelines and DevOps practices
- Knowledge of ITIL processes and frameworks
- Cloud certification like Azure Administrator
- Work on a flexible schedule remotely or from any of our comfortable offices or coworking spaces in Ukraine
- Receive the necessary equipment to perform your work tasks
- Change projects and technology stacks within EPAM
- Gain experience in various business domains (Insurance, E-commerce, Healthcare, Finance, Travelling, Media, Artificial Intelligence, and more)
- Consider relocation options in over 30 countries worldwide
- Participate in volunteer, charity programs and communities (both technical and interest-based)
- You can plan your individual career path together with your manager.
- Receive regular feedback from colleagues
- Improve your English for free with certified teachers (Speaking Clubs, client interview preparation courses, etc.)
- Get the opportunity to undergo free training and certification in AWS, GCP, or Azure Clouds
- Use the internal E-learn training program (18,200+ specialized training and mentoring programs)
- Access corporate accounts on LinkedIn Learning, Get Abstract and other partner resources
- Study at EPAM Solution Architecture School with the instructors who are practicing architects
- Develop as a leader, join Delivery Management, Resource Management, Leadership Essentials school and more
- Participate in internal communities (500+ meetups, technical discussions, brainstorming sessions, online events and conferences annually)
- Vacation and sick leave (including a sick leave without a medical certificate)
- A wide range of Voluntary Medical Insurance programs providing both medical treatment and various preventive options (including sports activities)
- Medical insurance for family members at corporate rates
- Company support during significant life events (childbirth or adoption, marriage, etc.)
- Support for psychological comfort: discounts on services from mental health specialists or coaches, thematic training
- E-kids program - a free programming language training program for EPAMers' children
Key Skills
Ranked by relevance
c
ai
ha
cloud
lan
das
unity
aci
ui
artificial intelligence
kubernetes
python
docker
devops
nist
itil
aws
gcp
esp
ids
oop
nat
pan
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
Lead Generative AI Data Scientist
2026-05-24
Full-time
Mid-Senior
Ukraine
Software Development
Business Development
View Job Details
Related
Full-stack .NET Software Engineer (React/Angular)
2026-05-27
Full-time
Associate
Ukraine
Software Development
Information Technology
View Job Details
Related
DevOps Engineer
2026-05-27
Full-time
Associate
Argentina
Software Development
Engineering
Login to Apply
- Posted
- Dec 07, 2024
- Type
- Full-time
- Level
- Mid-Senior
- Location
- Ukraine
- Company
- EPAM Systems
Industries
Software Development
IT Services
IT Consulting
Categories
Engineering
Information Technology
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
Lead Generative AI Data Scientist
2026-05-24
Full-time
Mid-Senior
Ukraine
Software Development
Business Development
View Job Details
Related
Full-stack .NET Software Engineer (React/Angular)
2026-05-27
Full-time
Associate
Ukraine
Software Development
Information Technology
View Job Details
Related
DevOps Engineer
2026-05-27
Full-time
Associate
Argentina
Software Development
Engineering