Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
Site Reliability Engineer - Kafka
Dublin | Hybrid
€70,000 - €80,000 per annum
Is this the Site Reliability Engineer role for you?
Crone Corkill have partnered with a technology consultancy who are searching for a Site Reliability Engineer to join a client in their Dublin office on a permanent basis. Expertise with Apache Kafka within a production environment is absolutely key here, with strong knowledge and experience across Kafka architecture, security, clusters, stream processing and distributed systems being vital.
Working as part of a diverse team, you’ll be heavily involved in a DevOps transformation, which involves production readiness, supporting developers during the application build phase, triage, root cause and more.
Please note that in order to be considered for this role, you must be capable of demonstrating expertise with Kafka.
What will you do as a Site Reliability Engineer?
- Operate and administer Apache Kafka clusters, including monitoring, scaling, security, and troubleshooting
- Work as a key contact responsible for ensuring application scalability, performance, and resilience
- Design, build, and maintain event-driven architectures to support scalable and resilient applications
- Collaborate with development teams to integrate SRE best practices (SLIs, SLOs, SLAs, error budgets, etc.)
- Automate operational tasks, CI/CD pipelines, and system monitoring to reduce manual interventions
- Manage and optimise PCF (Pivotal Cloud Foundry) deployments, ensuring application performance and availability
- Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating
- Automate data-driven alerts to proactively escalate issues. Work with development teams to establish SLOs and improve reliability
- Partner with the development and product team of a new application to establish the right monitoring and alerting strategy
- Develop and manage observability and monitoring solutions using Splunk, ensuring proactive issue detection and resolution
- Contribute to infrastructure as code (IaC) and cloud-native deployments
What skills do you need as a Site Reliability Engineer?
- Apache Kafka within a production environment (including architecture, brokers, topics, partitions and replicas)
- Kafka security (SSL, SASL & ACLs)
- Exposure to Splunk, including logging, dashboards, alerting and operational insights
- Configuring, deploying and managing Kafka clusters in cloud & on-prem environments
- Kafka stream processing, using Kafka Streams, KSQL or Apache Flink
- Proficiency in Java, Scala or Python for Kafka related development tasks
- Familiarity with DevOps practices, including CI/CD pipelines, monitoring and logging
- Experience with tools like Zookeeper, Schema Registry, and Kafka Connect
Key Skills
Ranked by relevanceReady to apply?
Join Crone Corkill and take your career to the next level!
Application takes less than 5 minutes