Razer Jobs

Senior Site Reliability Engineer

Razer

Senior Site Reliability Engineer

Reposted 3 Days Ago

Be an Early Applicant

In-Office or Remote

Hiring Remotely in South

Senior level

In-Office or Remote

Hiring Remotely in South

Senior level

The Senior Site Reliability Engineer will design and maintain Infrastructure as Code solutions, enhance cloud infrastructure, lead incident responses, and mentor junior engineers.

The summary above was generated by AI

Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work, offering you the opportunity to make an impact globally while working across a global team located across 5 continents. Razer is also a great place to work, providing you the unique, gamer-centric #LifeAtRazer experience that will put you in an accelerated growth, both personally and professionally.

Job Responsibilities :

We are seeking a skilled and driven Senior Site Reliability Engineer (SRE) to join our growing infrastructure and platform engineering team. The ideal candidate will have hands-on experience in Amazon Web Services (AWS), strong troubleshooting capabilities, and a passion for building scalable, observable, and resilient systems using modern Infrastructure as Code (IaC) and automation tools.

REQUIREMENTS:

Bachelor’s degree in Computer Science, Software Engineering, Information Technology, or a related field.
Minimum 3 years of experience in SRE, DevOps, cloud infrastructure, or system administration roles.
Hands-on expertise with AWS Cloud Services, including:
Compute & Containerization: EC2, Lambda, ECS, EKS, Auto Scaling
Networking: Load Balancers, VPC, Route 53, Security Groups, Firewalls
Storage & Databases: RDS, ElastiCache, Athena, S3
Messaging: SQS, SES
Deep understanding of Infrastructure as Code (IaC) tools such as Terraform and CloudFormation.
Proficiency in at least one programming/scripting language: Python, Node.js, Bash, Ruby, or related.
Experience operating and troubleshooting across Linux, Windows, and container-based environments.
Strong understanding of distributed systems, cloud networking (routers, switches), firewalls, DNS, and HTTP/TLS.
Experience implementing monitoring and alerting systems and working with incident management processes.
Experience with Zero Downtime Deployments, blue/green or canary deployments.
Familiarity with cost optimization and right-sizing AWS resources.
Exposure to multi-region, multi-account AWS architecture.
Understanding of API gateway, or edge networking (e.g., Akamai, CloudFront).

JOB DESCRIPTION:

Design, implement, and maintain Infrastructure as Code (IaC) solutions using Terraform and/or CloudFormation across multi-account AWS environments.
Collaborate with developers, architects, and DevOps teams to build scalable, secure, and observable cloud infrastructure.
Lead and participate in architecture design sessions, focusing on system reliability, scalability, security, and performance.
Implement and manage robust monitoring, alerting, and observability solutions (e.g., CloudWatch, Prometheus, ELK, Datadog).
Set and monitor Key Performance Indicators (KPIs) for system uptime, latency, throughput, and overall reliability.
Drive incident response processes, including coordination, triaging, resolution, documentation, and post-incident reviews (PIRs).
Supervise and mentor junior SREs and infrastructure engineers, fostering knowledge-sharing and team growth.
Collaborate across development, operations, and security teams to ensure secure and compliant deployments.
Automate manual tasks and workflows through scripting and tooling (Python, Node.js, Bash, Ruby, JSON/YAML).
Troubleshoot complex infrastructure issues across Linux, Windows, Docker, and cloud-native environments.
Provide IaC and CI/CD best practices to ensure repeatability, scalability, and compliance across all environments.
Provide on-call support, participate in incident rotations, and lead technical investigations during outages or degradations.
Strong understanding and experience for Disaster Recovery (DR).
Provide support and solution handling to incident and tickets assigned.

Pre-Requisites :

Razer is proud to be an Equal Opportunity Employer. We believe that diverse teams drive better ideas, better products, and a stronger culture. We are committed to providing an inclusive, respectful, and fair workplace for every employee across all the countries we operate in. We do not discriminate on the basis of race, ethnicity, colour, nationality, ancestry, religion, age, sex, sexual orientation, gender identity or expression, disability, marital status, or any other characteristic protected under local laws. Where needed, we provide reasonable accommodations - including for disability or religious practices - to ensure every team member can perform and contribute at their best.

Are you game?

Similar Jobs

Falcon Funded

UGC Creator - Cameroon

Yesterday

Remote

Entry level

Fintech • Financial Services

Record short vertical, natural-performance videos on your phone per provided scripts/guidelines. Deliver raw footage only; film additional b-roll clips (holding/interacting with phone) with good lighting and clean background.

Astro (astro.com)

Intern, Commercial

Yesterday

Remote

Internship

News + Entertainment

Support sales operations and campaign execution by assisting with campaign setup in RTS, data entry, record-keeping, cross-functional coordination, reporting (PCRs/QBRs), finance liaison for invoicing, and preparing performance insights to aid upsell and campaign optimization.

Top Skills: Google SheetsExcelRts Systems

Razer

Lead Software Engineer

Yesterday

In-Office or Remote

Senior level

Gaming • Hardware

Lead the backend system development at Razer Gold, modernize systems, engage with partners, design high-quality code, and address technical challenges.

Top Skills: .NetAWSJavaJenkinsMicrosoft Sql ServerMySQLPostgres

What you need to know about the Melbourne Tech Scene

Home to 650 biotech companies, 10 major research institutes and nine universities, Melbourne is among one of the top cities for biotech. In fact, some of the greatest medical advancements were conceptualized and developed here, including Symex Lab's "lab-on-a-chip" solution that monitors hormones to predict ovulation for conception, and Denteric's vaccine for periodontal gum disease. Yet, the thousands of people working in the city's healthtech sector are just getting started, to say nothing of the tech advancements across all other sectors.