Encora Logo

Encora

SRE

Posted 14 Days Ago
Be an Early Applicant
Remote
28 Locations
Senior level
Remote
28 Locations
Senior level
The Site Reliability Engineer (SRE) focuses on integrating software engineering with infrastructure operations, ensuring software reliability and efficiency. Responsibilities include application monitoring, incident response, change management, system reliability, and collaborating with development teams to streamline processes and enhance scalability and performance.
The summary above was generated by AI

Important Information

  • Experience: More than 4 years
  • Job Mode: Full-time
  • Work Mode: Hybrid

Job Summary

  • Site Reliability Engineering (SRE) is a discipline that blends software engineering with infrastructure and operations, aimed at building scalable and highly reliable software systems.
  • Focus on application monitoring, emergency response, and change management to ensure reliability and efficiency.
  • Collaborate with development teams throughout the software lifecycle to solve system-related issues and automate routine tasks.
  • Enhance system reliability, scalability, and performance by leveraging modern tools and processes.

Responsibilities and Duties

  • Application Monitoring: Utilize tools and automation for continuous application monitoring and reliability.
  • Emergency Response: Respond promptly to emergency incidents, perform root cause analysis, and resolve ongoing production issues.
  • Change Management: Manage and streamline release and change management processes to improve system performance.
  • Collaboration: Partner with development teams to solve system issues, automate routine tasks, and eliminate toil.
  • Reliability and Scalability: Ensure systems are highly reliable, scalable, and efficient to meet performance standards.

Qualifications and Skills

  • Strong understanding of monitoring tools such as Azure Monitoring, App Insights, Prometheus, and Grafana.
  • Experience with Infrastructure as Code tools like Terraform, ARM/Bicep, or Pulumi.
  • Proficiency in release management tooling such as ArgoCD, Harness, and Octopus.
  • Familiarity with incident alert tools like PagerDuty or Opsgenie.
  • Expertise in container orchestration tools like Kubernetes and AKS.
  • Proficiency in scripting (C#, Python, Bash, PowerShell -one of them is mandatory)
  • Strong collaboration and problem-solving abilities to resolve system issues effectively.
  • Knowledge of project tracking and version management tools like JIRA, SVN, and GitHub.

Role-specific Requirements

  • Proven experience in application monitoring and automated reliability processes.
  • Strong background in managing system reliability and performing root cause analysis during emergency responses.
  • Hands-on experience in change management processes and production environment releases.
  • Advanced knowledge of tools and practices for infrastructure automation and incident handling.
  • Familiarity with scalable system architecture principles and best practices.

Technologies

  • Monitoring Tools: Azure Monitoring, App Insights, Prometheus, Grafana
  • Infrastructure as Code: Terraform, ARM/Bicep, Pulumi
  • Release Management Tools: ArgoCD, Harness, Octopus
  • Incident Alert Tools: PagerDuty, Opsgenie
  • Container Orchestration: Kubernetes, AKS
  • Project Management Tools: JIRA, SVN, GitHub
  • Scripting: C#, Python, Bash or PowerShell

Skillset Competencies

  • Advanced monitoring and incident management techniques.
  • Infrastructure as Code and automation of routine workflows.
  • Expertise in release and change management processes.
  • Strong knowledge of container orchestration and scalable system design.
  • Excellent communication, collaboration, and problem-solving skills.
  • Ability to work effectively in cross-functional and virtual teams.

 About Encora

Encora is a trusted partner for digital engineering and modernization, working with some of the world’s leading enterprises and digital-native companies. With over 9,000 experts in 47+ offices worldwide, Encora offers expertise in areas such as Product Engineering, Cloud Services, Data & Analytics, AI & LLM Engineering, and more. At Encora, hiring is based on skills and qualifications, embracing diversity and inclusion regardless of age, gender, nationality, or background.

Top Skills

Aks
App Insights
Argocd
Arm/Bicep
Azure Monitoring
Bash
C#
Git
Grafana
Harness
JIRA
Kubernetes
Octopus
Opsgenie
Pagerduty
Powershell
Prometheus
Pulumi
Python
Svn
Terraform

Similar Jobs

8 Days Ago
Easy Apply
Remote
28 Locations
Easy Apply
Mid level
Mid level
Cloud • Security • Software • Cybersecurity • Automation
As an Intermediate Site Reliability Engineer focused on Environment Automation, you'll automate operations across numerous GitLab environments. Your responsibilities include building deployment packages, managing infrastructure as code, deploying microservices, maintaining observability, and enhancing security measures while collaborating with engineering teams to resolve architectural issues.
6 Days Ago
Remote
Greece
Mid level
Mid level
Information Technology
As a Site Reliability Engineer at Speedcast, you will analyze and implement CI/CD pipelines, design infrastructure-as-code processes, support architectural decisions, and improve operational excellence in a team-oriented environment for advanced communication solutions.
4 Days Ago
Remote
27 Locations
Expert/Leader
Expert/Leader
Big Data • Cloud • Digital Media • Machine Learning • Mobile • Software • Industrial
As a Principal Engineer in SRE/DevOps, you will lead initiatives to improve DevOps and SRE practices, manage cloud infrastructure, ensure system reliability, and collaborate on large-scale engineering projects in a hybrid work model.
Top Skills: Apache FlinkAWSDatadogDevOpsDynatraceGrafanaGroovyJavaKafkaKinesisKubernetesOpentelemetryPrometheusPythonSplunkTerraform

What you need to know about the Melbourne Tech Scene

Home to 650 biotech companies, 10 major research institutes and nine universities, Melbourne is among one of the top cities for biotech. In fact, some of the greatest medical advancements were conceptualized and developed here, including Symex Lab's "lab-on-a-chip" solution that monitors hormones to predict ovulation for conception, and Denteric's vaccine for periodontal gum disease. Yet, the thousands of people working in the city's healthtech sector are just getting started, to say nothing of the tech advancements across all other sectors.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account