Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
SRE
Dublin/Hybrid
Fulltime
As a SRE Engineer, you will act as the production readiness steward for versatile Gateway products and integration with other platforms. You will partner with development teams to design, implement, and support services with a focus on operational resilience, automation, and compliance.
Key Responsibilities:
- Lifecycle Ownership: Engage in and improve the entire service lifecycle—from design and deployment to operations and continuous improvement.
- Operational Readiness: Ensure system availability, capacity, performance, monitoring, and self-healing capabilities are embedded throughout delivery.
- Incident Management: Practice sustainable incident response, lead blameless postmortems, and optimize Mean Time to Recovery (MTTR).
- Automation & CI/CD:
- Develop and maintain automation pipelines for certificate renewal, traffic routing, alerting, and compliance reporting using tools like Ansible, Venafi & XLR template.
- Support CI/CD pipelines for software promotion and operational gating.
- Reliability Engineering: Scale systems sustainably through automation and advocate for changes that improve reliability and velocity.
- Compliance & Risk Management: Drive initiatives for Safety & Soundness, PCI compliance, threat/toil reduction, and ITSM defect resolution.
- Monitoring & Observability: Implement robust logging, monitoring, and alerting standards to ensure system health and proactive issue detection. Hands-on experience with Dynatrace & Splunk monitoring tool configuration and alerting.
- Collaboration: Work with global teams across multiple time zones and mentor junior engineers.
- Continuous Improvement: Provide feedback loops to development teams on resiliency gaps and operational enhancements.
- Rotational On-Call & Flexibility:
- Participate in rotational on-call support for critical production systems.
- Demonstrate flexibility to take on additional responsibilities and ad-hoc duties as needed to support team and organizational goals.
All About You (Skills & Qualifications)
- Experience: 5+ years in Project, Site Reliability Engineering, or DevOps roles.
Technical Expertise:
- Strong understanding of NGINX configuration and gRPC event-driven architectures.
- Proficiency in DevOps tools: Chef, Jenkins, Groovy, shell scripting, Bitbucket, Git, Ansible, XLR.
- Experience with AWS infrastructure, secure access practices, and cloud-native deployments.
Security & Compliance:
- Awareness of certificate lifecycle management, mutual TLS, SSL handshake, SSH keys, encryption standards.
- Familiarity with ITSM processes, compliance frameworks, and incident management.
Networking & Systems:
- Knowledge of client-server relationships, network layers (L1–L7), load balancers (BIG-IP F5), and application firewalls.
- Ability to analyze stack traces, TCP dumps, heap/thread dumps, and perform OS-level troubleshooting.
- Authentication & Authorization:
- Intermediate understanding of Active Directory, SAML, LTPA, SSO, OAuth.
Key Skills
Ranked by relevanceReady to apply?
Join Fulcrum Digital Inc and take your career to the next level!
Application takes less than 5 minutes

