Red Hat Logo

Red Hat

Site Reliability Engineer - OpenShift

Posted 19 Hours Ago
Be an Early Applicant
Remote
Hiring Remotely in India
Junior
Remote
Hiring Remotely in India
Junior
As a Site Reliability Engineer, you will develop and operate the Red Hat OpenShift Managed Cloud platform, ensuring service reliability and automating management tasks. You'll work on large-scale distributed systems, contribute to codebases, and participate in a geographically distributed on-call rotation for production support.
The summary above was generated by AI

About the job:

The Red Hat IT OpenShift team is looking for a Site Reliability Engineer (SRE) based in India (Pune or Bangalore) to join our team. In this role, you will develop, scale, and operate our Red Hat OpenShift Managed Cloud platform. Red Hat OpenShift is our enterprise kubernetes distribution. As an SRE, you will contribute to running Red Hat OpenShift at scale by enabling customer self-service, making our monitoring system more sustainable, and eliminating toil through automation.In the IT OpenShift team you will have the opportunity to influence the complex challenges of scale which are unique to Red Hat IT managed cloud platform services, while using your skills in coding, operations, and large-scale distributed system design. We develop, deploy, and maintain Red Hat’s next-generation application deployment environment for mission critical custom applications and services across a range of hybrid cloud infrastructures.  We are a global team operating on-premise and in the public cloud, using the latest technologies from Red Hat and beyond. Red Hat relies on teamwork and openness for its success. We are a global team and strive to cultivate a transparent environment that makes room for different voices. We learn from our failures in a blameless environment to support the continuous improvement of the team. At Red Hat, your individual contributions have more visibility than most large companies, and visibility means career opportunities and growth. Successful applicants must reside in a state where Red Hat is registered to do business.

What will you do:

  • Applies software engineering principles to the operations domain.

  • Contributes to a service's codebase, writes automation that aids in the management of a service, and performs operational engineering work to support a service's Service Level Objectives (SLO).

  • Ensures service reliability meets users’ needs, including internally critical and externally visible services

  • Uses software & systems engineering to design, build, and run large-scale, distributed, fault-tolerant systems

  • Focuses on iterative improvement through toil reduction and error-budget enforcement

  • Interfaces with both cloud IaaS and SaaS providers and internal stakeholders, including Support, IT, and Product Engineering, to achieve desired outcomes.

  • Participates in an on-call rotation within a geographically distributed team to provide 24x7x365 production support, with responsibility to respond to urgent customer issues

  • Practice sustainable incident response and blameless postmortems 

  •  Work within a small agile team to develop and improve SRE methodologies, support your peers, plan and self-improve

  •  Provide feedback around bugs and feature improvements to the various Red Hat Product Engineering teams

What will you bring:

  • Bachelor's degree in computer science or a related technical field involving software or systems engineering, or practical experience demonstrating interest in SRE 

  • 2+ years of experience of using cloud providers and technologies (Google, Azure, Amazon, OpenStack, etc.) 

  • 1+ years of experience administering a kubernetes-based production environment 

  • 2+ years of experience programming with at least one object-oriented language; Golang, or Python are a big plus 

  • Ability to collaboratively troubleshoot and solve problems in a team setting

  • Basic understanding of UNIX or Linux operating systems The following will be considered a plus: 

  • Demonstrated comfort with collaboration, open communication, and reaching across functional boundaries

  • Passion for understanding users’ needs and delivering outstanding user experiences 

Additional Skills:

  • Demonstrated ability to quickly and accurately troubleshoot system issues 

  •  Solid understanding of standard TCP/IP networking and common protocols like DNS and HTTP 

  • 2+ years of experience managing Linux servers running Red Hat Enterprise Linux (RHEL), CentOS, or Fedora hosted at a cloud provider such as Amazon Web Services (AWS), Google Compute Engine (GCE), or Microsoft Azure

  • 1+ years of experience with enterprise systems monitoring 

  • 2+ years of experience with enterprise configuration management software like Red Hat Ansible Automation Platform (AAP)

  • Experience with static code analysis tools

  • Some experience with code deployment across cloud-based environments

  • Some experience with continuous Integration and continuous deployment approaches

  • Some experience working with complex distributed systems 

  • Demonstrated ability to debug, optimize code and automate routine tasks 

  • Ability to work with minimal supervision and as part of a global team, and problem solving skills 

  • Experience working with agile development methodologies

About Red Hat

Red Hat is the world’s leading provider of enterprise open source software solutions, using a community-powered approach to deliver high-performing Linux, cloud, container, and Kubernetes technologies. Spread across 40+ countries, our associates work flexibly across work environments, from in-office, to office-flex, to fully remote, depending on the requirements of their role. Red Hatters are encouraged to bring their best ideas, no matter their title or tenure. We're a leader in open source because of our open and inclusive environment. We hire creative, passionate people ready to contribute their ideas, help solve complex problems, and make an impact.

Diversity, Equity & Inclusion at Red Hat
Red Hat’s culture is built on the open source principles of transparency, collaboration, and inclusion, where the best ideas can come from anywhere and anyone. When this is realized, it empowers people from diverse backgrounds, perspectives, and experiences to come together to share ideas, challenge the status quo, and drive innovation. Our aspiration is that everyone experiences this culture with equal opportunity and access, and that all voices are not only heard but also celebrated. We hope you will join our celebration, and we welcome and encourage applicants from all the beautiful dimensions of diversity that compose our global village.

Equal Opportunity Policy (EEO)
Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications for employment without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, veteran status, genetic information, physical or mental disability, medical condition, marital status, or any other basis prohibited by law.


Red Hat does not seek or accept unsolicited resumes or CVs from recruitment agencies. We are not responsible for, and will not pay, any fees, commissions, or any other payment related to unsolicited resumes or CVs except as required in a written contract between Red Hat and the recruitment agency or party requesting payment of a fee.

Red Hat supports individuals with disabilities and provides reasonable accommodations to job applicants. If you need assistance completing our online job application, email [email protected]. General inquiries, such as those regarding the status of a job application, will not receive a reply.

Top Skills

Go
Python

Red Hat Melbourne, Victoria, AUS Office

Level 24, 120 Collins Street , Australia, Melbourne, VIC , Australia, 3000

Similar Jobs

2 Days Ago
Remote
India
Junior
Junior
Edtech • Software
As a Site Reliability Engineer at GoGuardian, you will work closely with product engineers to implement scalable, reliable systems. Your responsibilities include scaling backend systems, collaborating on CI/CD tools, building infrastructure, and monitoring service performance to meet SLAs.
Top Skills: Python
Yesterday
Remote
8 Locations
Mid level
Mid level
Cloud • Software
As an SRE & Gitops Engineer, you'll automate software operations, enhance infrastructure as code practices, maintain core services at Canonical, and collaborate with development teams to improve products. Responsibilities include troubleshooting, capacity planning, and using observability tools for monitoring and alerting.
Top Skills: Python
Senior level
Cloud • Information Technology • Software
The Site Reliability Engineer will implement observability solutions, develop monitoring tools, and gather system metrics. Collaboration with development teams is essential to ensure reliability and performance standards, while also identifying and resolving system issues.
Top Skills: PerlPHPPythonRuby

What you need to know about the Melbourne Tech Scene

Home to 650 biotech companies, 10 major research institutes and nine universities, Melbourne is among one of the top cities for biotech. In fact, some of the greatest medical advancements were conceptualized and developed here, including Symex Lab's "lab-on-a-chip" solution that monitors hormones to predict ovulation for conception, and Denteric's vaccine for periodontal gum disease. Yet, the thousands of people working in the city's healthtech sector are just getting started, to say nothing of the tech advancements across all other sectors.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account