The Lead Site Reliability Engineer will build, deploy, and manage microservices in Kubernetes, optimize cloud applications, and integrate emerging technologies in AI and GenAI, ensuring high reliability and scalability.
Coupa makes margins multiply through its community-generated AI and industry-leading total spend management platform for businesses large and small. Coupa AI is informed by trillions of dollars of direct and indirect spend data across a global network of 10M+ buyers and suppliers. We empower you with the ability to predict, prescribe, and automate smarter, more profitable business decisions to improve operating margins.
Why join Coupa?
🔹 Pioneering Technology: At Coupa, we're at the forefront of innovation, leveraging the latest technology to empower our customers with greater efficiency and visibility in their spend.
🔹 Collaborative Culture: We value collaboration and teamwork, and our culture is driven by transparency, openness, and a shared commitment to excellence.
🔹 Global Impact: Join a company where your work has a global, measurable impact on our clients, the business, and each other.
Learn more on Life at Coupa blog and hear from our employees about their experiences working at Coupa.
The Impact of a Cloud Software Engineer / Lead DevOps Engineer at Coupa:
If you are passionate about new technologies, have a strong technical background and you are looking for an environment where you can continuously expand your knowledge, you are the right fit for this role. At Coupa, the “Cloud team” is looking for a Lead engineer who is ready to constantly question the status quo with a mixture of system design, code development, deployment, automation, networking, and experience in managing Machine Learning/GenAI / Agentic AI platforms.
What You'll Do:
- Build, deploy, and troubleshoot microservices in Kubernetes and Amazon EKS, ensuring scalability and reliability.
- Design secure, highly available web applications with a focus on capacity planning and performance optimization.
- Deploy and manage the lifecycle of LLMs and embedding models, defining KPIs to measure and improve AI application performance.
- Evaluate and integrate emerging technologies such as RAG systems, MCP servers, AI Agents, and agentic workflows into our platform.
- Manage AWS core and GenAI services (S3, IAM, EKS, Bedrock, etc.) using infrastructure-as-code tools like Terraform and Chef, while maintaining observability through tools like New Relic or PagerDuty.
- Collaborate across product, platform, and engineering teams on architecture design, security patching, incident response, and release management to ensure the reliability of our ML and GenAI infrastructure
What You Will Bring to Coupa:
- Bachelor’s degree and 8+ years of experience managing large-scale cloud applications with a strong background in Linux administration and troubleshooting. Excellent communication skills, a collaborative mindset, and the confidence to take ownership, drive solutions, and deliver results independently while thinking globally.
- Over 5 years of hands-on experience managing cloud infrastructure across AWS, GCP, and Azure environments.
- A solid understanding of today’s generative AI ecosystem, with practical experience using LLMs and embedding models (OpenAI, AWS Bedrock, SageMaker); familiarity with vector databases like LanceDB is a plus.
- Strong scripting skills in Bash or Python, and experience with container orchestration platforms like Amazon EKS or Azure AKS.
- Proficiency with DevOps and automation tools such as Chef, GitHub Actions, Rundeck, and IaC frameworks like Terraform, Spacelift, and Helm.
- Working knowledge of DNS, load balancers, and MySQL, along with a good grasp of source control and branching strategies in Git.
Coupa complies with relevant laws and regulations regarding equal opportunity and offers a welcoming and inclusive work environment. Decisions related to hiring, compensation, training, or evaluating performance are made fairly, and we provide equal employment opportunities to all qualified candidates and employees.
Please be advised that inquiries or resumes from recruiters will not be accepted.
By submitting your application, you acknowledge that you have read Coupa’s Privacy Policy and understand that Coupa receives/collects your application, including your personal data, for the purposes of managing Coupa's ongoing recruitment and placement activities, including for employment purposes in the event of a successful application and for notification of future job opportunities if you did not succeed the first time. You will find more details about how your application is processed, the purposes of processing, and how long we retain your application in our Privacy Policy.
Similar Jobs at Coupa
Artificial Intelligence • Fintech • Information Technology • Logistics • Payments • Business Intelligence • Generative AI
As a Product Management Director, you will lead the strategy for Coupa's Contract Management solution, solving customer problems, aligning with engineering, and driving market adoption.
Top Skills:
AIData AnalyticsSaaS
Artificial Intelligence • Fintech • Information Technology • Logistics • Payments • Business Intelligence • Generative AI
The Sr. Engineer, Knowledge Engineering will design and implement ontologies and knowledge graphs, enabling AI models to reason over enterprise spend data.
Top Skills:
ElasticsearchJson-LdNeo4JNeptuneOwlPythonRdf
Artificial Intelligence • Fintech • Information Technology • Logistics • Payments • Business Intelligence • Generative AI
The Sr Developer - Zuora manages and optimizes the Zuora Billing and Revenue ecosystem, ensuring seamless integration with Salesforce and NetSuite while driving automation and technical architecture.
Top Skills:
ApexCustom ObjectsNetSuiteSalesforceSQLSuitescriptWorkatoZuoraZuora Netsuite ConnectorZuora Object Query Language (Zoql)Zuora Rest ApiZuora Soap ApiZuora Workflow
What you need to know about the Melbourne Tech Scene
Home to 650 biotech companies, 10 major research institutes and nine universities, Melbourne is among one of the top cities for biotech. In fact, some of the greatest medical advancements were conceptualized and developed here, including Symex Lab's "lab-on-a-chip" solution that monitors hormones to predict ovulation for conception, and Denteric's vaccine for periodontal gum disease. Yet, the thousands of people working in the city's healthtech sector are just getting started, to say nothing of the tech advancements across all other sectors.

