ShiftCare Logo

ShiftCare

Site Reliability Engineer

Reposted 8 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in Australia
Senior level
Remote
Hiring Remotely in Australia
Senior level
The System Reliability Engineer will improve CI/CD pipelines, enhance observability, build resilient systems, and drive change in a startup environment.
The summary above was generated by AI
Description

We're on the lookout for a passionate and exceptional reliability engineer to join our dynamic team and help us transform the homecare industry. Rally with us in creating meaningful experiences for our hyper-growth healthcare startup.

Why ShiftCare?

We're not just optimising resources; we're enhancing experiences. Our purpose-built solution is changing the game for support providers in Australia and North America, making care accessible and affordable for all.

About this role:

  • Enjoy ownership and responsibility, with a bias towards identifying problems and proposing and implementing solutions.
  • Strong experience with Ruby on Rails, especially in production SaaS systems.
  • Deep knowledge of background job processing (Sidekiq or similar), caching, and distributed systems.
  • Proven experience improving CI/CD pipelines, we currently use CircleCI but don't discard a migration.
  • Comfortable designing and improving observability stacks (New Relic, Datadog, Honeycomb, etc.).
  • Experience building resilient systems — retries, back-offs, queueing, circuit breakers, graceful degradation, kill switches, isolation of workloads, etc.
  • Strong focus on developer ergonomics and reliability culture.
  • Bias toward action and delivering tools that improve system behaviour and developer happiness.

What you’ll do

  • Own and improve our CI/CD pipelines (CircleCI), reducing deploy times and failure rates.
  • Build reliable retry/back-off mechanisms for critical user workflows.
  • Design and implement observability tooling, including synthetic checks, smoke tests, etc.
  • Help architect and implement failover and fallback mechanisms for critical vendors and workflows.
  • Work with Support to build debug tooling and dashboards that empower non-engineers.
  • Collaborate with engineering to define and template runbooks, kill switches, and disaster mitigation patterns.
  • Champion performance tuning.
Requirements

Key requirements

  • Must have worked in a company known for its world-class engineering standards and global scale.
  • Must have a great track record working as a member of an engaged team, beyond the theory that someone would learn in a leadership book and have had some hiccups along the way and learned from them.
  • Must help bring the team up to their expected standards: Organise learning sessions and find development opportunities for everyone in their team.
  • Must have an appetite for a startup environment, enjoy making decisions and drive change. Have a good framework to pick a good balance between doing things perfectly and doing something subpar: What’s the minimum work we can do to solve a problem/feature properly and move on to the next problem?
  • Deep experience with technical problem-solving and code reviews within a Ruby environment
  • Must have the experience and will to drive the successful completion of projects within established timelines and quality standards and motivate and guide team members to do the same
  • Experience building resilient systems — retries, back-offs, queueing, circuit breakers, graceful degradation, kill switches, isolation of workloads

Top Skills

CircleCI
Datadog
Honeycomb
New Relic
Ruby On Rails
Sidekiq

Similar Jobs

17 Hours Ago
Remote or Hybrid
Sydney, New South Wales, AUS
Internship
Internship
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
As a Graduate Site Reliability Engineer, you will enhance infrastructure reliability, automate processes, and leverage skills in software development and networking.
Top Skills: JavaScriptLinuxPython
13 Days Ago
Easy Apply
In-Office or Remote
32 Locations
Easy Apply
Mid level
Mid level
Cloud • Security • Software • Cybersecurity • Automation
As an Intermediate Site Reliability Engineer at GitLab, you will manage and automate infrastructure for production systems, respond to incidents, and collaborate with teams to enhance incident management.
Top Skills: AWSBashGCPGitlabGoKubernetesRubyTerraform
3 Days Ago
In-Office or Remote
2 Locations
Senior level
Senior level
Software
As a Staff Site Reliability Engineer, you will define engineering practices, lead technical discovery, provide hands-on support, mentor engineers, and contribute to long-term architectural decisions while collaborating across teams.
Top Skills: .NetC#Cloud-Native Platforms

What you need to know about the Melbourne Tech Scene

Home to 650 biotech companies, 10 major research institutes and nine universities, Melbourne is among one of the top cities for biotech. In fact, some of the greatest medical advancements were conceptualized and developed here, including Symex Lab's "lab-on-a-chip" solution that monitors hormones to predict ovulation for conception, and Denteric's vaccine for periodontal gum disease. Yet, the thousands of people working in the city's healthtech sector are just getting started, to say nothing of the tech advancements across all other sectors.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account