Technical Product Owner – AI Operations (Resilience)
Fuel your passion for AI and Engineering and transform outcomes for CBA's people and customers!
Join Australia's largest bank as we lead the world in AI innovation and ambition!
Let's revolutionise our engineering community with cutting-edge AI tools and capabilities!
About the team
The CIO for Technology team ensures that CommBank has a world-class engineering capability and is at the forefront of technology, using innovative and emerging technologies to help our customers and support our Group Strategy. Focused on innovation, best practices, and collaboration, the team seeks a Technical Product Owner to work with our global engineering community to use AI assisted tools and capabilities in ways that improve the engineering experience and enhancing productivity and engineering capability across the bank.
About AI Powered Engineering
Through our partnerships with leading LLM and AI Developer Platforms like Anthropic, we’re providing our over 7000 engineers with world-class AI Powered Engineering tools and capabilities.
Our team is passionate about working across the organisation to unlock the potential of these tools and capabilities and help our engineers be more productive and less burdened by repetitive time-consuming tasks.
We are the team behind Project Coral, an agentic AI solution. Read more about it here: https://www.commbank.com.au/articles/newsroom/2025/08/ai-powered-engineering.html
What exactly will you do?
Own the product vision and roadmap for AIOps (Resilience) capabilities (e.g., intelligent alerting, anomaly detection, predictive capacity/latency analytics, incident copilots, automated remediation, runbook orchestration, chaos/continuous resilience testing).
Lead a cross-functional squad with an Engineering Lead to deliver incremental value: define OKRs, prioritise backlogs, plan releases, and communicate outcomes with transparent, executive-ready reporting.
Partner with SRE, platform, cloud, and application teams to instrument SLIs/SLOs, reduce MTTR/MTTD, and industrialise practices like error budgeting, capacity management, change risk scoring, and failure-mode analysis.
Integrate AIOps with core operational tooling and processes (e.g., observability stacks, CMDB, ITSM/ITIL workflows such as ServiceNow; on-call and incident tooling like PagerDuty/Opsgenie; runbook and automation platforms).
Translate operational pain points into clear requirements and acceptance criteria; embed telemetry, feedback loops, and robust product analytics to validate value realisation (noise reduction, toil elimination, stability uplift).
Navigate complex stakeholder landscapes—engineering, cyber, risk, compliance, and business operations—aligning priorities, managing dependencies, and balancing innovation with control obligations.
Embed Responsible AI guardrails in operational use cases (explainability, human-in-the-loop for remediation, access controls, monitoring and evaluation of AI models).
Drive adoption, enablement, and change management at scale—training, communications, playbooks, and communities of practice to uplift operational maturity and behaviour.
Remain current on AI and reliability trends (LLMs for summarisation/RCA, RAG over runbooks, causal/seasonal anomaly detection, event correlation, pattern mining, chaos engineering, resilience testing) and apply them pragmatically to real environments.
About you
An experienced leader with strong collaboration, communication, and reporting skills, effectively communicating technical updates to diverse audiences.
Proven Product Ownership experience in technology operations, SRE, or platform enablement products in large/regulated organisations
Strong familiarity with SRE and resiliency practices: SLIs/SLOs, error budgets, incident/major incident management, post-incident review, chaos testing, DR/BCP, and capacity/performance engineering.
Comfortable with modern observability and AIOps ecosystems (for example: Prometheus/Grafana, Splunk/Elastic, Datadog, Dynatrace/New Relic, Open Telemetry; PagerDuty/Opsgenie; Kubernetes; public cloud; automation/runbooks). You don’t need to build them, but you can engage credibly.
Demonstrated ability to reduce alert fatigue and operational toil through correlation, deduplication, enrichment, and automation; measurable impact on MTTR/MTTD and service availability.
Skilled in Agile delivery, product discovery, and OKR-driven prioritisation; adept with Jira and Confluence; excellent storytelling and executive communication.
Fair working knowledge of AI techniques applied to operations (event correlation, anomaly detection, LLM-based incident summarisation, retrieval over knowledge bases, auto-remediation patterns) and a strong interest in technology.
Experience working within risk, security, privacy, and compliance frameworks; able to align with operational resilience standards and regulatory expectations (e.g., change controls, operational resilience, DR testing).
Curious, resilient, and bias-to-action—comfortable challenging assumptions, asking the hard questions, and unblocking delivery in a complex, matrixed environment.
If this sounds like you, apply today!
We offer hybrid work flexibility, part-time options, job sharing, and adjustable hours. Let’s discuss what works for you.
If you're already part of the Commonwealth Bank Group (including Bankwest, x15ventures), you'll need to apply through Sidekick to submit a valid application. We’re keen to support you with the next step in your career.
We're aware of some accessibility issues on this site, particularly for screen reader users. We want to make finding your dream job as easy as possible, so if you require additional support please contact HR Direct on 1800 989 696.