Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
Signify Technology is looking for an SRE to join a fast-growing team.
About the Role:
We are seeking a highly skilled Senior Site Reliability Engineer (SRE) to join a dynamic Platform Tribe. In this role, you will be responsible for ensuring the stability, performance, and reliability of our platform while proactively addressing challenges within our cloud infrastructure.
Key Responsibilities:
Monitoring & Issue Management:
- Oversee day-to-day alerts, system health checks, and timely issue escalation.
- Provide On-Call support for critical SaaS events, ensuring minimal disruption to services.
- Document incident details and remediation steps to maintain a knowledge base.
Cloud Infrastructure & Deployment:
- Develop and maintain proactive monitoring tools within the EKS/K8s ecosystem to detect and resolve issues swiftly.
- Deploy applications to EKS/K8s clusters using Terraform, Helm, and Flux.
- Implement infrastructure health checks and automation scripts to address known challenges.
- Maintain and improve deployment code to enhance operational efficiency.
Experience:
- Maintaining high-traffic platforms
- Extensive experience with Kubernetes for deployment, scaling, and troubleshooting.
- Proficiency in AWS, Terraform, Docker, and CI/CD pipelines.
- Familiarity with monitoring tools such as DataDog, Prometheus, Grafana, and logging solutions like Elasticsearch, Logstash, Kibana (ELK Stack), or AWS CloudWatch.
Technical Skills:
- Proficiency in at least one scripting language (e.g., Python, NodeJS, Go).
What We Offer:
- Quarterly Bonuses: Performance-based bonuses driven by a transparent evaluation process.
- Flexible Work Schedule: Designed to provide work-life balance and flexibility.
- Remote Work Option: Enjoy the flexibility of working remotely, based on your preference.
- Comprehensive Medical Coverage: Health insurance for you and your significant other.
- Life Event Support: Financial assistance during major life events.
- Paid Time Off: Unlimited paid vacation and sick leave for your well-being.
- Professional Development: Reimbursement for courses and training to support your career growth.
Language requirements:
- Ukrainian language required
If you're passionate about maintaining a high-traffic platform and enjoy working in a fast-paced, collaborative environment, please apply with a Detailed CV