Ensono

Ensono

Ensono provides comprehensive hybrid IT solutions and governance, enabling businesses to navigate complexity and modernize their technology infrastructure, from cloud services to mainframe systems, tailored to each client's unique journey.

IT Services
1K-5K
Founded 1969

Description

  • Troubleshoot production issues, identify systemic failings, and implement fixes to restore and improve service reliability.
  • Lead the incident resolution process, create and maintain incident documentation, and provide input to post-mortem analysis.
  • Design, implement, and maintain Infrastructure as Code for scalable infrastructure deployments (Terraform/ARM/Bicep).
  • Manage and operate CI/CD pipelines and deployments, using tools such as Azure DevOps, GitHub Actions, or GitLab.
  • Configure, maintain, and optimize monitoring, alerting, and observability across systems (DataDog, NewRelic, Splunk, Azure Monitor, AWS CloudWatch).
  • Propose and implement solutions to reduce operational toil and automate repetitive work.
  • Improve Service Request and Change Management processes through technical changes and stakeholder management.
  • Proactively mitigate security risks in code, infrastructure, and dependencies and participate in vulnerability management.
  • Lead client-facing discussions about SRE practices and identifying opportunities to expand SRE adoption; engage with third-party suppliers for support and opportunities.

Requirements

  • 3-9 years of relevant experience.
  • Bachelor’s degree (or equivalent) in computer science or a related discipline.
  • SRE Foundation certificate (DevOps Institute) and an associate-level cloud provider certification (AWS, Azure, or GCP), or ability to complete during the probationary period.
  • Proficiency in Azure and Kubernetes with hands-on experience managing and deploying applications.
  • Expertise with Infrastructure as Code, particularly Terraform (ARM/Bicep and CloudFront also acceptable).
  • Commercial experience with CI/CD tooling such as Azure DevOps, GitHub Actions, or GitLab.
  • Experience with monitoring/observability tools such as DataDog, Splunk, NewRelic, Azure Monitor, or AWS CloudWatch.
  • Commercial experience in at least one core technology (Dotnet, Java, AI/Data Engineering, or Golang).
  • Preferred: Certified Kubernetes Administrator/Application Developer, Certified Azure DevOps Engineer, familiarity with Harness for continuous delivery, and strong programming skills in .NET, Java, or JavaScript.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Site Reliability Engineer

Orcrist Technologies Internet Software & Services

Orcrist is hiring a Site Reliability Engineer to deploy and operate its Kubernetes-based data intelligence platform in on-prem, hybrid, and agency-controlled environments for defense, law-enforcement, and enterprise customers.

Ansible Argo CD Elasticsearch Flux GitOps Grafana Helm Kubernetes Prometheus SAML SIEM Splunk Terraform
25 minutes ago

Site Reliability Engineer-SkillBridge Intern

Zscaler 1K-5K Internet Software & Services

Zscaler is hiring a Site Reliability Engineer SkillBridge Intern to support its Zero Trust Exchange team in a remote role based in San Jose or Bellevue, helping operate and improve the cloud security platform behind its global cybersecurity services.

Ansible AWS DNS HTTP Kubernetes Python SQL Terraform
40 minutes ago

Senior Site Reliability Engineer (SRE, Compute Node Team)

Nebius 51-250 Internet Software & Services

Nebius AI Cloud is hiring a Senior Site Reliability Engineer to operate and improve the Compute Node platform that runs virtual machines across global cloud regions, with a focus on Linux systems, virtualization, and operational reliability.

Kubernetes Linux System Design
1 hour, 10 minutes ago

Senior Site Reliability Engineer I

instacart.careers 1K-5K Internet Software & Services

Instacart is hiring a Senior Site Reliability Engineer I to help maintain and improve the reliability, performance, and scalability of its grocery delivery platform and supporting services.

AWS Azure Docker GCP Go Kubernetes Ruby
1 hour, 55 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers