Xero Logo

Xero

Lead Site Reliability Engineer (Observability)

Posted 3 Days Ago
Be an Early Applicant
Remote
Hybrid
5 Locations
Senior level
Remote
Hybrid
5 Locations
Senior level
Lead the observability strategy as a hands-on technical leader, enhancing system reliability and performance. Mentor engineers, promote best practices, and support team growth in observability and engineering excellence.
The summary above was generated by AI

Our Purpose 

At Xero, we’re here to help you supercharge your business. We do this by automating routine tasks, surfacing actionable insights and connecting businesses with the right data, advisors and apps. When that happens, we’re not only making life better for small business, we’ll be building a stronger economy that can change the world.


About the team


In Site Reliability Engineering (SRE), we drive and influence Xero to provide the most reliable experience for our customers. We are a global team based across New Zealand, Australia and the USA. 


We combine software and systems engineering to enable engineers across Xero to build and support products that are observable, stable, performant, tolerant to failure, and operate as intended in the face of varying conditions.


Our team builds and implements sophisticated monitoring and remediation toolsets to  support best in class observability, reliability, operational excellence and engineering productivity at Xero.


About the role


We’re looking for a Lead Engineer to help drive our observability strategy and uplift our engineering capabilities. This role will take ownership of shaping observability at Xero, driving the adoption of OpenTelemetry and modern solutions to empower teams in building reliable, high-performing services.


This is a hands-on technical leadership role, where you’ll use your expertise to shape the way we monitor, measure, and improve our systems reliability and performance. Your work will have a lasting impact, empowering teams to build scalable, high-performing, and resilient systems.


This role offers the opportunity to lead by example, providing hands-on technical mentorship and guiding engineers to develop their skills and adopt best practices.


Cross-functional collaboration is essential, requiring you to work closely with Product Managers, Team Leads, Principal Engineers, and other technical leaders to align team efforts with broader SRE and company goals. A strong focus on the customer experience will be paramount, ensuring our observability systems enable teams to proactively detect and resolve issues, delivering a seamless and self-service experience for Xero's users.

What you'll do:

  • Drive Observability and Engineering Excellence: Design and implement observability solutions that enhance Xero’s engineering practices, enabling teams to build more reliable software. Guide technical design, ensure adherence to architectural principles, and remove technical blockers to improve development efficiency.
  • Improve System Reliability and Champion Best Practices: Identify and address failure patterns to proactively enhance system reliability. Define and evolve observability and reliability standards, advocating for best practices in system instrumentation, monitoring, logging, tracing, and alerting. Promote automation, agile, DevOps, and CI/CD methodologies to improve software delivery speed and quality while reducing operational toil.
  • Support Team Growth and Recruitment: Help build and nurture a diverse and talented engineering team by participating in hiring and recruitment. Create an inclusive and collaborative environment where engineers feel empowered to innovate and succeed.

What you'll bring:

  • Strong Observability Expertise – Deep knowledge of reliability and observability concepts, including experience implementing observability in large, distributed cloud environments (ideally AWS). Hands-on experience with monitoring and logging tools such as Prometheus, VictoriaMetrics, Jaeger, New Relic, Datadog, Dynatrace, SignalFX, Scalyr, SumoLogic, or Splunk.
  • Technical Leadership in Software and Infrastructure – Proficiency in one or more programming languages such as C#, JavaScript, Golang, or Python
  • Experience in Incident Response and Operational Excellence – Previous experience in on-call rotations and resolving production incidents in complex environments. Ability to analyze and prevent system failures through proactive reliability improvements.
  • Agile and Collaborative Mindset – Experience working in agile software development environments with continuous integration and delivery (CI/CD). Ability to structure and prioritize work effectively to maximize the team’s impact.
  • Strong Stakeholder Engagement and Influence – Proven ability to build relationships, engage, and influence internal stakeholders across teams and disciplines. Comfortable working in a large-scale software delivery organization with a strong focus on architectural best practices.
  • Platform Ownership and Scalability – Experience managing and maintaining healthy observability platforms that support a large and diverse user base.

Why Xero? 

Offering very generous paid leave to use however you’d like (plus statutory holidays!), dedicated paid leave to care for your physical and mental wellbeing as well as an Employee Assistance Program to access mental health care for you and your family, health insurance, life insurance, and income protection, wellbeing and sports programmes, employee resource groups, 26 weeks of paid parental leave for primary caregivers, an Employee Share Plan, beautiful offices, flexible working, career development, and many other benefits that reflect our human value, you’ll do the best work of your life at Xero.

Top Skills

AWS
C#
Datadog
Dynatrace
Go
Jaeger
JavaScript
New Relic
Prometheus
Python
Scalyr
Signalfx
Splunk
Sumologic
Victoriametrics
HQ

Xero Hawthorn West, Victoria, AUS Office

Xero Melbourne (HQ) Office

Xero’s head office in Australia is in the buzzing suburb of Hawthorn, a stone’s throw from the CBD. Here, a diverse mix of Xeros work in both global and regional teams.

Similar Jobs at Xero

5 Days Ago
Remote
Hybrid
5 Locations
Mid level
Mid level
Cloud • Fintech • Information Technology • Machine Learning • Software
As a Software Engineer at Xero, you will enhance product features and maintain critical applications, focusing on collaboration, learning, and customer needs.
Top Skills: .NetAWSC#Ci/CdDockerKubernetesReactTypescript
20 Days Ago
Remote
Hybrid
5 Locations
Senior level
Senior level
Cloud • Fintech • Information Technology • Machine Learning • Software
As a Senior Software Engineer at Xero, you will enhance features of the software, build critical applications, and collaborate with cross-functional teams while supporting a culture of growth and inclusivity.
Top Skills: .NetAWSC#Ci/CdDockerKubernetesReactTypescript
21 Days Ago
Remote
Hybrid
5 Locations
Senior level
Senior level
Cloud • Fintech • Information Technology • Machine Learning • Software
The Senior Engineer will integrate Syft Analytics into Xero's ecosystem, deliver impactful projects, mentor team members, and enhance software development practices.
Top Skills: .NetAWSGCPReact

What you need to know about the Melbourne Tech Scene

Home to 650 biotech companies, 10 major research institutes and nine universities, Melbourne is among one of the top cities for biotech. In fact, some of the greatest medical advancements were conceptualized and developed here, including Symex Lab's "lab-on-a-chip" solution that monitors hormones to predict ovulation for conception, and Denteric's vaccine for periodontal gum disease. Yet, the thousands of people working in the city's healthtech sector are just getting started, to say nothing of the tech advancements across all other sectors.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account