Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
As a Site Reliability Engineer, you will play a key role in ensuring the stability, performance, and scalability of complex cloud systems that power high-traffic digital platforms. You'll collaborate with cross-functional engineering teams to design and operate resilient infrastructure, enhance observability, and streamline automation across environments. This role is ideal for a technically driven problem-solver who thrives in a remote-first, innovation-focused environment. You'll help strengthen cloud governance, optimize operations, and contribute to major architectural initiatives that enable global scalability and reliability.
Accountabilities:
- Design, operate, and optimize AWS-based infrastructure using Terraform, Helm, and Kubernetes to ensure scalability and high availability
- Strengthen observability through effective monitoring, logging, and alerting systems to improve incident detection and resolution times
- Automate key workflows and reduce manual tasks to enhance engineering productivity and operational consistency
- Partner with software and cloud engineering teams to improve the resilience and performance of services under heavy workloads
- Participate in building the next-generation architecture supporting regional expansion and data residency requirements
- Contribute to on-call rotations, manage incidents calmly, and document learnings to prevent recurrence
- Experiment with and adopt AI tools to streamline workflows and increase efficiency in reliability operations
- Minimum of 4 years of experience in cloud engineering, systems administration, or site reliability engineering
- Strong proficiency with AWS (or other major cloud platforms such as GCP or Azure) and infrastructure-as-code tools
- Hands-on experience with Kubernetes, serverless technologies, and automation frameworks
- Proficiency in a programming language such as Python, Go, or TypeScript
- Solid understanding of observability practices and tools like Grafana, Datadog, Prometheus, and Sentry
- Excellent analytical and problem-solving skills with a passion for improving performance and reliability
- Strong communication and documentation abilities to share knowledge across teams
- Eagerness to explore and apply AI technologies to improve operational processes
- Ability to work independently in a remote-first environment and collaborate effectively across regions
- Competitive salary packages, with equity and performance-based bonuses
- Transparent and equitable compensation philosophy focused on impact and growth
- Comprehensive healthcare and wellness benefits
- Fully remote work model offering flexibility and autonomy
- Inclusive, collaborative, and global work environment that values diversity and innovation
- Professional development opportunities and access to leading-edge tools and technologies
- Opportunity to influence large-scale infrastructure initiatives with measurable business impact
When you apply, your profile goes through our AI-powered screening process designed to identify top talent efficiently and fairly.
🔍 Our AI evaluates your CV and LinkedIn profile thoroughly, analyzing your skills, experience, and achievements.
📊 It compares your profile to the job's core requirements and past success factors to determine your match score.
🎯 Based on this analysis, we automatically shortlist the 3 candidates with the highest match to the role.
🧠 When necessary, our human team may perform an additional manual review to ensure no strong profile is missed.
The process is transparent, skills-based, and free of bias — focusing solely on your fit for the role.
Once the shortlist is completed, we share it directly with the company that owns the job opening. The final decision and next steps (such as interviews or additional assessments) are then made by their internal hiring team.
Thank you for your interest!
Key Skills
Ranked by relevanceReady to apply?
Join Jobgether and take your career to the next level!
Application takes less than 5 minutes