Arbor

Arbor is the leading cloud MIS provider in the UK, empowering schools and MATs to collaborate effectively, save time, and enhance pupil achievement through centralized data management and insightful analytics.

IT Services

Information Technology

51-250 (180)

12 open positions

Links

View All Jobs

Site Reliability Engineer

1 hour, 55 minutes ago

United Kingdom

Full-time

Mid Level

Site Reliability Engineer (SRE)

DevOps and Infrastructure

Agile Datadog Docker Kanban Nginx Prometheus Terraform

Apply Now

Arbor

IT Services

51-250

View All Jobs 12

Description

Proactively monitor and analyse platform performance.
Collaborate with engineering teams to identify and resolve performance bottlenecks.
Support the implementation and review of service level objectives (SLOs).
Improve observability through monitoring, alerting, and dashboards using tools such as DataDog or Prometheus.
Ensure services remain highly available and resilient.
Champion high-availability design best practices.
Devise runbooks and run game sessions to test disaster recovery, high availability, and backup plans.
Assess capacity and plan for current and future scaling needs.
Work with Platform, feature, support, and other stakeholders to embed SRE practices and maintain service levels.
Lead incident response and troubleshooting, including rapid resolution and minimising downtime.
Participate in blameless postmortems to identify root causes and corrective actions.
Develop and maintain playbooks and documentation.

Requirements

Experience in performance monitoring and analysis.
Capacity planning experience.
Scripting and automation skills with experience in relevant technologies.
Experience with Infrastructure as Code, particularly Terraform.
Understanding of relational database technologies and cloud versions such as AWS Aurora.
Experience with messaging and distributed asynchronous workloads.
Experience with nginx or similar technologies.
Familiarity with SRE processes and DevOps principles such as the 3 ways and 5 ideals.
Bonus: experience with other database technologies and cloud platforms.
Bonus: past experience with enterprise solutions running at scale.
Bonus: familiarity with Kanban and Agile development processes.
Bonus: experience with containerisation, for example Docker.
Bonus: familiarity with software best practices such as Refactoring, Clean Code, Domain-Driven Design, and Test-Driven Development.

Benefits

Salary of £60,000 - £70,000.
32 days holiday, including Bank Holidays, made up of 25 days annual leave plus 7 company-wide days over Easter, Summer, and Christmas.
Life assurance at 3x annual salary.
Comprehensive wellness support through AIG Smart Health, including a 24/7 virtual GP, mental health support, counselling, and personalised health checks.
Private dental insurance with Bupa.
Salary sacrifice pension provided by Scottish Widows.
Enhanced maternity and adoption leave with 20 weeks full pay, and paternity leave with 6 weeks full pay.
Flexible working arrangements.
Dedicated professional development budget for CPD courses, upskilling resources, and professional memberships.
Access to Calm and Bippit for mental health and financial wellbeing support.
A dedicated wellbeing team and social committees that support employee wellbeing and team events.
Volunteer for a charity of your choice for one day each year.
Dog-friendly offices.
Refer-a-friend voucher worth up to £200.
No visa sponsorship available at this time.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Site Reliability Engineer

66degrees 251-1K IT Services

66degrees is hiring a Site Reliability Engineer to help enterprise cloud clients maintain, optimize, and scale Google Cloud environments through reliability engineering, automation, and incident response.

Canada Mid Level Site Reliability Engineer (SRE)

Agile Datadog GCP JIRA Kanban Kubernetes Linux Prometheus Python Scrum SQL Server Terraform

1 hour, 39 minutes ago

Apply

1 hour, 39 minutes ago

Senior Software Engineer - Grafana Databases, Managed Services | Spain | Remote

Grafana 1K-5K IT Services

Grafana Labs is hiring a Senior Software Engineer for its Managed Services team to run and evolve production-critical database infrastructure powering Grafana Cloud’s next-generation metrics, logs, and traces services.

Spain Full-time Senior Database Administrator Site Reliability Engineer (SRE)

$90k-$108k

AWS Azure Cassandra ClickHouse GCP Go Grafana Helm Kafka Kubernetes Linux PostgreSQL Snowflake Terraform

1 hour, 39 minutes ago

Apply

1 hour, 39 minutes ago

Senior Software Engineer - Grafana Databases, Managed Services | UK | Remote

Grafana 1K-5K IT Services

Grafana Labs is hiring a Senior Software Engineer for its Managed Services team to run and improve the shared multi-cloud infrastructure powering Grafana Cloud’s database products.

United Kingdom Full-time Senior Database Administrator Site Reliability Engineer (SRE)

$117k-$141k

AWS Azure Cassandra ClickHouse GCP Go Grafana Helm Kafka Kubernetes Linux PostgreSQL Snowflake Terraform

2 hours, 10 minutes ago

Apply

2 hours, 10 minutes ago

Senior Software Engineer - Grafana Databases, Managed Services | Ireland | Remote

Grafana 1K-5K IT Services

Grafana Labs is hiring a Senior Software Engineer for its Managed Services team to run and improve the shared database and streaming infrastructure behind Grafana Cloud’s next-generation observability products.

Ireland Full-time Senior Site Reliability Engineer (SRE)

$111k-$134k

AWS Azure Cassandra ClickHouse GCP Go Grafana Helm Kafka Kubernetes Linux PostgreSQL Snowflake Terraform

2 hours, 25 minutes ago

Apply

2 hours, 25 minutes ago

Arbor

Tags

Links

Site Reliability Engineer

Arbor

Description

Requirements

Benefits

Similar Roles

Site Reliability Engineer

Senior Software Engineer - Grafana Databases, Managed Services | Spain | Remote

Senior Software Engineer - Grafana Databases, Managed Services | UK | Remote

Senior Software Engineer - Grafana Databases, Managed Services | Ireland | Remote

You're on a roll! Sign up now to keep applying.