Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
LearnUpon LMS helps organizations train their employees, partners, and customers. Businesses can manage, track, and achieve their unique learning goals — all through a single, powerful solution.
With offices in Dublin (our HQ), Belgrade, Philadelphia, Salt Lake City and Sydney, we are a global team with lots of diverse cultures, backgrounds, and experiences that puts our customers' experience at the heart of everything we do. Our culture fosters an open, collaborative and supportive environment where our accomplishments are celebrated and encouraged. We're always striving for the best solution (not the easy one). We’re proud of our success and we’re humble and hungry to achieve more.
You will be part of the SRE Team, which sits within LearnUpon’s Engineering group. We are a small team focused on developing and supporting our cloud infrastructure and app services, to ensure platform scalability and site uptime. Our flagship product is coded predominantly in Ruby on Rails, with data managed through a common mix of current SaaS back-end technologies including AWS backed services. We also use local containerised development environments. However, we are not bound to our tech stack. We prefer choosing the right technology for the right problem so you’ll have plenty of space to grow your skills. We are key consultants for the entire company on matters of infrastructure feasibility.
What will I be doing?
Responsibilities
As a Staff Engineer in Site Reliability Engineering, you will be part of the team responsible for the scale-out of the LearnUpon infrastructure. Specifically, the main responsibilities are:
- Drive architectural improvements and best practices across the organisation for system reliability, scalability, and performance.
- Lead the design and implementation of automated solutions for infrastructure, deployments, and monitoring.
- Mentor and guide junior SREs, fostering a culture of operational excellence and continuous learning.
- Participate in the on-call rotation and respond to and resolve complex production incidents, performing root cause analysis and implementing preventative measures.
- Collaborate with engineering teams to ensure that new features and services are designed with reliability and ease of operations in mind from the outset.
- 7+ years of experience in a software or Ops role
- 5+ years of cloud engineering experience, with at least 2 years experience with AWS
- Experience in designing and implementing Observability tech stacks
- Have championed the benefits of Observability to Engineering teams
- Can architect the design of SLO/SLI implementation that balances the needs of different teams
- Familiar with cost analysis of Observability metrics gathering, Engineering effort, and tooling
- Experience building and supporting large-scale distributed systems that back a consumer app or website with associated requirements of performance, security and disaster recovery
- Experience deploying Microservice environments, using containerisation technologies such as Kubernetes, Docker
- Experience with implementing IaaC (e.g. CloudFormation, Terraform etc.), automation tooling (e.g. Puppet, Ansible etc.), CI/CD (e.g. Jenkins, Travis CI, GitLab etc.)
- Able to effectively communicate technical ideas to and collaborate with both technical and non-technical peers
- Experience with database scaling would be a strong plus
Not required but considered a big plus
- Certification in AWS, any PaaS, and/or related technologies
- Competitive salary and company ESOP
- Comprehensive private health insurance scheme and Company pension scheme
- 25 days annual leave + 1 annual company wellness day off
- Work in a fun and supportive environment with regular team events
- Excellent career progression - take LearnUpon where you think it can go
Our Typical Process Generally Works As Follows
- Qualified applicants will be invited to schedule a screening call
- Successful candidates will then be invited to a series of practical interviews
- Finally, candidates will have a short interview with a member of our C-Suite Team
- The successful candidate will be contacted with an offer to join our team
We do not discriminate on the basis of gender, marital status, family status, age disability, sexual orientation, race, religion, membership of the Traveller community, or any other legally protected status.
By applying for this job, you agree to LearnUpon's Privacy Policy. Find out more about our privacy policy here
Visit our Careers site to find out more about working for LearnUpon, and check us out on Instagram.
Key Skills
Ranked by relevanceReady to apply?
Join LearnUpon and take your career to the next level!
Application takes less than 5 minutes