ShiftCare
Reliability Engineer
ShiftCareAustralia4 hours ago
Full-timeEngineering
About ShiftCare

ShiftCare's innovative software is a market leader which helps disability support providers, in-home aged carers and allied health professionals worldwide streamline the way they work by creating efficiencies in rostering, client management and billing, enabling businesses to grow.

About The Team You'll Be Joining

The ShiftCare Engineering team is on a mission to make ShiftCare accessible to more businesses worldwide, but this is just the beginning of our journey. Doubling in revenue every year, we are having a fun time scaling our systems and processes appropriately.

Positioned within ShiftCare's Engineering team you'll be joining our foundations team to help propel our platform reliability into the future! At ShiftCare we treat our work as a craft and are passionate about reliability, security, our customers, and doing more with less.

The Role

As ShiftCare scales, the requirements of our platform and team evolve. But we like to think that's all part of the fun!

You'll play a key role in shaping how ShiftCare delivers reliable, secure, and high-performing services to thousands of care providers every day.

To give you a taste of the cool things you'll be working on when you start, you'll:

  • Own and improve our CI/CD pipelines (CircleCI), reducing deploy times and failure rates
  • Design and implement observability tooling - from synthetic checks and smoke tests to meaningful alerts and dashboards
  • Build reliable retry and back-off mechanisms for critical user workflows
  • Help architect and implement failover and fallback mechanisms for critical vendors and workflows
  • Work with Support to build debug tooling and dashboards that empower non-engineers
  • Collaborate with engineering to define and template runbooks, kill switches, and disaster mitigation patterns
  • Champion performance tuning and scalability improvements
  • ... and many other things, driven by you!

You will be joining our foundations team, playing a pivotal role in the next evolution of our technology stack and devops processes.

About You

  • You thrive on ownership. Identifying problems, proposing solutions, and driving them to completion
  • You're passionate about reliability, observability, and building robust distributed systems
  • You bring experience working in a modern SaaS environment, have learnt lessons along the way, and are eager to apply that expertise in a new context
  • You have deep knowledge of background job processing, eventing, caching, and distributed systems
  • You have proven experience improving CI/CD pipelines. We currently use CircleCI but don't discard a migration
  • You're comfortable designing and improving observability stacks (New Relic, Datadog, Honeycomb, etc.)
  • You've built resilient systems using retries, back-offs, queueing, circuit breakers, graceful degradation, kill switches, isolation of workloads, etc
  • You care deeply about developer ergonomics and fostering a culture of reliability
  • You have a bias toward action. Delivering tools that improve both system behavior and developer happiness

Key Skills

Ranked by relevance