Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
Site Reliability Engineer
- 3-month contract (possible extensions)
- Fully remote
- €120 per hour
About the Role:
We are looking for a talented Site Reliability Engineer (SRE) to help ensure the reliability, scalability, and efficiency of our systems and services. As an SRE, you will treat operations as a software problem—building automation, reducing manual toil, and taking full ownership of services from infrastructure to application interfaces. You’ll work across complex distributed systems, collaborating with product teams to design, implement, and maintain high-performing, resilient applications.
Key Responsibilities:
- Develop, maintain, and refactor software applications with clean, reusable code.
- Take end-to-end ownership of services, monitoring health, performance, and key metrics.
- Resolve live production incidents and perform root cause analysis to prevent future issues.
- Automate operational tasks and reduce manual labor wherever possible.
- Collaborate with development teams on observability, monitoring, and architectural improvements.
- Evaluate and implement technical solutions to meet business and scalability requirements.
- Participate in operational shifts and on-call rotations as required.
Must-Have Skills & Experience:
- Minimum 3 years of experience in Apache Kafka administration.
- Strong software engineering skills, with experience in Java.
- Hands-on experience with Kubernetes, Docker, Helm, and Argo.
- Experience building and maintaining distributed, multi-tenant systems.
- Skilled in monitoring and observability of complex systems.
- Strong problem-solving skills in distributed environments.
- Experience with automation and reducing operational toil.
Nice-to-Have:
- Knowledge of Confluent Platform and Confluent Cloud.
- Experience working with database systems.
Apply today for immediate consideration!
Key Skills
Ranked by relevanceReady to apply?
Join Brookwood Recruitment Ltd and take your career to the next level!
Application takes less than 5 minutes