Ensono

Ensono

Ensono provides comprehensive hybrid IT solutions and governance, enabling businesses to navigate complexity and modernize their technology infrastructure, from cloud services to mainframe systems, tailored to each client's unique journey.

IT Services
1K-5K
Founded 1969

Description

  • Act as a technical escalation point for unresolved data platform issues within the SRE pod.
  • Monitor, maintain, and troubleshoot databases, data warehouses, and related infrastructure.
  • Collaborate with the data engineering team to ensure efficient data flow and transformation.
  • Develop and maintain operational runbooks and other technical documentation.
  • Perform pre-approved changes within the client’s change management process, such as user administration.
  • Use helpdesk and work tracking systems to log support requests and incidents and improve related processes.
  • Participate in security management processes by proactively mitigating risks in code, infrastructure, and dependencies.
  • Engage with suppliers and third parties to support client requests and improve service value.
  • Lead incident resolution activities, including post-mortem analysis and mitigation planning.
  • Identify opportunities to reduce toil, improve automation, and strengthen service request and change management processes.

Requirements

  • Experience working across data platforms, databases, or data warehousing environments.
  • Demonstrable experience in multiple core technologies such as .NET, Java, AI/data engineering, or Golang.
  • Experience with infrastructure-as-code tooling, preferably Terraform, or ARM/Bicep and CloudFront.
  • Experience with core CI/CD tooling such as Azure DevOps, GitHub Actions, or GitLab.
  • Experience with monitoring tools such as DataDog, Splunk, New Relic, Azure Monitor, or AWS CloudWatch.
  • Experience troubleshooting incidents, identifying systemic failures, and implementing fixes and features.
  • Experience providing leadership in incident resolution and maintaining documentation for post-mortems and mitigation plans.
  • Experience designing and improving service request, change management, and security management processes.
  • Ability to lead client-facing discussions about SRE processes and opportunities for additional work.
  • Cloud provider DevOps Engineer-level certification in AWS, Azure, or GCP and CKAD certification are highly beneficial, or required during the probationary period.

Benefits

  • Competitive base salary with uncapped commission.
  • Flexible work locations.
  • 27 days annual leave plus bank holidays, increasing to 30 days.
  • Half-day leave on your birthday.
  • Sabbatical options at 5 and 10 years of service.
  • 5 days of study leave.
  • Generous company pension.
  • Private healthcare for you and your family.
  • Enhanced paternity and maternity leave.
  • Equity appreciation program incentive plan.
  • Life and income protection.
  • Discounted gym memberships, cycle scheme, and employee assistance program support.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Database Reliability Engineer

Sporty Group 51-250 Media

Sporty is seeking a Database Reliability Engineer to own and improve its database infrastructure supporting multiple platforms and international expansion.

Ansible Argo CD Elasticsearch GitHub Actions Go Grafana Helm Jenkins Kubernetes MongoDB MySQL PostgreSQL Prometheus Python RabbitMQ Terraform
11 hours, 52 minutes ago

Senior Site Reliability Engineer

Moniepoint 1K-5K Diversified Financial Services

Moniepoint is hiring an experienced Site Reliability Engineer to improve the reliability, scalability, and observability of its highly distributed financial platform serving emerging markets.

AWS Azure Datadog GCP Go Java Kafka Kubernetes Microservices MySQL New Relic OpenTelemetry PostgreSQL Prometheus Python RabbitMQ Rust
12 hours, 37 minutes ago

Senior Site Reliability Engineer, Identity Platform

Coinbase 1K-5K Capital Markets

Coinbase is hiring an experienced Site Reliability Engineer to build and scale identity and access management tooling for its IT Operations Corporate Engineering team supporting cloud-based, security-first systems.

Ansible AWS Azure C# CI/CD Docker GCP Go Java Kubernetes Python Ruby Secrets Management Terraform
13 hours, 7 minutes ago

Database Reliability Engineer - Core Team

ClickHouse 51-250 IT Services

ClickHouse is hiring a Site Reliability Engineering team member for ClickHouse Core to improve the reliability, availability, scalability, and performance of ClickHouse Cloud for customers worldwide.

AWS Azure C++ ClickHouse GCP Python SQL
13 hours, 37 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers