Aussie Broadband Logo

Aussie Broadband

Site Reliability Engineering Manager

Posted 5 Days Ago
Be an Early Applicant
Australia
Senior level
Australia
Senior level
As Site Reliability Engineering Manager, you will lead a team of SREs to ensure system scalability, reliability, and performance while managing workflows, incident responses, and best practices.
The summary above was generated by AI

Aussie Broadband’s (ABB) purpose is to the change the game. As our Site Reliability Engineering Manager, you'll support this by leading and building a team of SREs that ensure the scalability, reliability and performance of our systems and infrastructure.

At Aussie Broadband we believe difference is something to celebrate. Being advocates for Inclusion and Diversity means our team can bring their whole selves to work and allows us to better represent our customers and the communities that we serve. As a proud Equal Opportunity Employer, supporting and celebrating difference is just one way that we demonstrate our value of ‘Be good to people’ everyday. 

Join us as we continue to grow and make a mark as the 5th largest telco in Australia!

Why work for Aussie? 

Founded in regional Victoria almost 20 years ago, we are local from the ground up. What started in a living room in Morwell, has now expanded to every corner of Australia - we’re growing fast and not slowing down!

Our fantastic culture lives and breathes our values: 

  • Don't be ordinary, be awesome

  • Think BIG

  • No bullsh*t

  • Be good to people

  • Have fun

We are proud to be a B Corp Certified company, which means we’re good to our people, our customers, and the planet by maintaining the highest standards for social and environmental performance, transparency, and accountability.

We care about our community through our Pledge 1% commitment, sponsorship programs and our paid staff community service leave offering.  

But don’t just take our word for it – We have been named one of the top employers in Australia by HRD magazine.

The good stuff

  • 26 weeks paid parental leave for both primary and secondary caregivers (in addition to any government-paid leave)

  • Discounted internet up to the value of $109 per month

  • 20% off our Mobile services 

  • Day to day benefits like flexible working arrangements, Employee Assistance Program (EAP), discounts with big names like Specsavers, HCF and many more

  • Celebrating you! With monthly rewards and recognition

  • Internal training and resources for you to continue to learn, grow and achieve your career goals

  • Yearly allowance for amazing Aussie merch

  • Fitness Passport for access to multiple gyms and pools across Australia

Let’s talk about you

Automated, scalable, reliable, and performant systems are critical to enabling the next wave of growth for Aussie Broadband.  As our Site Reliability Engineering Manager, you will lead, mentor, and grow a team of SREs, fostering a culture of ownership, accountability, and continuous improvement. To be successful in this role you will possess:

  • Experience in a leadership or management role in an SRE, DevOps, or infrastructure team

  • An understanding of the telco industry, and reference platform architectures for cloud, telco, IaaS, and CaaS

  • Highly proficient in programming languages like python and PowerShell with a focus on automation and scripting

  • Experience with infrastructure-as-code tools like Terraform, Puppet/Ansible, ArgoCD, or equivalent

  • Experience with cloud platforms (AWS, Azure) and container orchestration (Kubernetes)

  • Experience with SuSE Rancher, Harvester and related components highly desirable

  • Experience with PostgreSQL and other SQL databases

  • Strong knowledge of Linux systems and networking

  • Experience with distributed systems, HA architectures, and fault-tolerant systems

  • Familiarity with CI/CD tools (GitLab)

  • Experience with observability tools (Prometheus, Grafana)

  • Background in security and compliance for production environments

How will you support our “Why?”

As our Site Reliability Manager, you will be responsible for the following:

  • Defining clear goals and career paths and support the professional development of your team

  • Managing team workload, on-call schedules, and performance

  • Defining and maintaining service-level objectives (SLOs), indicators (SLIs), and error budgets with key stakeholders

  • Championing reliability best-practices across the organisation and ensure systems meet uptime and performance targets

  • Leading incident response and resolution efforts, including post-incident reviews (PIRs) and root cause analysis

  • Overseeing the stability, scalability, and efficiency of production environments across Aussie Broadband’s infrastructure

  • Driving infrastructure-as-code practices and automation for deployment, configuration, and observability

  • Collaborating with architecture, product and platform engineering teams to ensure reliability is baked into the architecture and design phase

  • Owning the observability stack (monitoring, logging, tracing) and establish proactive alerting strategies

  • Leading initiatives to improve overall observability coverage, telemetry, visibility and reporting of system health

  • Improving the operational processes, incident management frameworks, and on-call rotations.

Ready to join?

Hit the apply button to submit your application and our fantastic team will be in touch!

Even if you feel you don’t meet all the requirements, we’d still love to hear your story. We like to think outside the box with the people we hire.

If you have any questions, get in touch today with our team at [email protected]

Just a heads up, we can’t take applications through email, so make sure you apply via the job link we've set for this role, so you don't miss out!

Top Skills

Ansible
Argocd
AWS
Azure
Gitlab
Grafana
Kubernetes
Linux
Postgres
Powershell
Prometheus
Puppet
Python
Suse Rancher
Terraform

Similar Jobs

3 Days Ago
Remote
Hybrid
Sydney, New South Wales, AUS
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead a team of Site Reliability Engineers to ensure the reliability of enterprise applications for federal clients, driving automation and continuous improvement.
Top Skills: AIAutomation ToolsAzureCloud OperationsDatabasesItilLinux
23 Days Ago
Remote
4 Locations
Senior level
Senior level
Cloud • Software
As an SRE Manager, you will lead a team of operations engineers, manage DevOps practices, mentor engineers, and ensure high-quality managed services operations.
Top Skills: Agile Software DevelopmentCloud TechnologiesDevOpsInfrastructure As CodeKubernetesLinux
Yesterday
Hybrid
Sydney, New South Wales, AUS
Mid level
Mid level
Financial Services
As a Software Engineer III, you will deliver technology solutions, produce production code, and improve software applications through data insights in an Agile environment.
Top Skills: .Net CoreAws CloudC#MySQLOracleReact

What you need to know about the Melbourne Tech Scene

Home to 650 biotech companies, 10 major research institutes and nine universities, Melbourne is among one of the top cities for biotech. In fact, some of the greatest medical advancements were conceptualized and developed here, including Symex Lab's "lab-on-a-chip" solution that monitors hormones to predict ovulation for conception, and Denteric's vaccine for periodontal gum disease. Yet, the thousands of people working in the city's healthtech sector are just getting started, to say nothing of the tech advancements across all other sectors.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account