Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
📍 Location: Australia (Remote)
🕒 Employment Type: Full-Time
💼 Level: Mid-Level to Senior
We are seeking a Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of our systems and applications. This role is ideal for someone who combines software engineering skills with a deep understanding of infrastructure and operations.
🎯 Key Responsibilities- Design, build, and maintain highly available and scalable systems
- Implement monitoring, alerting, and incident response strategies
- Automate operational processes to improve system reliability and efficiency
- Collaborate with development and DevOps teams to deploy and manage services
- Analyze system performance and troubleshoot issues proactively
- Participate in capacity planning, disaster recovery, and performance optimization
- Contribute to documentation, best practices, and reliability standards
- Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience)
- 3–6 years of experience in site reliability, DevOps, or systems engineering
- Strong knowledge of Linux/Unix systems and networking
- Experience with cloud platforms (AWS, Azure, GCP) and infrastructure-as-code tools (Terraform, CloudFormation)
- Proficiency in scripting or programming (Python, Bash, Go, etc.)
- Familiarity with CI/CD pipelines, monitoring tools (Prometheus, Grafana), and container orchestration (Kubernetes)
- Excellent problem-solving, communication, and collaboration skills
- Based in Australia with full working rights
- Experience with distributed systems and microservices
- Knowledge of security best practices and compliance frameworks
- Familiarity with incident management frameworks (PagerDuty, Opsgenie)
- Cloud certifications (AWS Solutions Architect, Azure Administrator, GCP Professional)
- Fully remote work across Australia
- Opportunity to work on large-scale, high-availability systems
- Career growth and professional development opportunities
- Collaborative, innovative, and supportive team environment
- Competitive salary and benefits package
Key Skills
Ranked by relevanceReady to apply?
Join Happy Culture and take your career to the next level!
Application takes less than 5 minutes

