Civica

Civica

Civica is a global leader in public sector software, providing digital solutions and managed services to transform customer experience and operational efficiency for over 3,000 organizations worldwide.

Internet Software & Services
1K-5K
Founded 2002

Description

  • Monitor live environments using observability tools and respond to production alerts and incidents.
  • Triage issues quickly and coordinate with SRE and Platform teams to restore service.
  • Automate environment builds, deployments, and routine operational tasks to reduce manual work.
  • Support and maintain cloud and infrastructure environments across Azure, AWS, and VMware.
  • Work with containerised workloads and contribute to scaling and performance improvements.
  • Drive root cause analysis and preventative actions following incidents.
  • Refine alerting thresholds, deployment processes, and monitoring coverage.
  • Maintain runbooks and operational documentation to ensure knowledge is shared consistently.
  • Collaborate with engineers, support, and services teams on live issue resolution and operational improvements.

Requirements

  • Experience operating production systems in cloud or hybrid environments such as Azure, AWS, or similar.
  • Familiarity with Kubernetes, containerisation, and supporting tools such as Helm and ingress controllers.
  • Basic understanding of networking and infrastructure fundamentals, including DNS, load balancing, VPNs, and firewalls.
  • Ability to troubleshoot infrastructure issues, including using packet capture tools (pcap).
  • Hands-on experience with scripting or automation using PowerShell, Bash, Go, or Python.
  • Knowledge of CI/CD pipelines and version control tools such as GitHub Actions, Azure DevOps, or Jenkins.
  • Experience with monitoring and alerting tools such as Prometheus, Grafana, DataDog, Elastic, or Azure Monitor.
  • Strong analytical and problem-solving skills with the ability to stay calm during incidents.
  • Collaborative communicator who thrives in cross-functional, fast-paced environments.

Benefits

  • 25 days of annual leave plus bank holidays, with the option to buy up to 10 extra days.
  • Up to 3 additional days off for volunteering through the Days of Difference program.
  • 5% employer pension match.
  • Income protection covering up to 75% of salary for long-term illness.
  • Life assurance equal to 4x salary as a tax-free lump sum.
  • Critical illness cover of £25,000, extendable to dependents.
  • Private medical insurance, health cash plan, and dental insurance.
  • Employee affinity groups and a referral bonus for recommending a friend.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Database Reliability Engineer

PointClickCare 1K-5K Health Care Providers & Services

PointClickCare is hiring a Senior Database Reliability Engineer to manage and improve the cloud database infrastructure behind its mission-critical SaaS platform.

Ansible AWS Azure C# Databricks GCP Git Grafana InfluxDB JIRA MySQL PostgreSQL PowerShell Python SQL SQL Server Terraform
30 minutes ago

Site Reliability Engineer

SwissBorg 51-250 Capital Markets

SwissBorg is hiring a Site Reliability Engineer to support and scale its cloud infrastructure and operations for a fast-growing crypto investment platform.

Ansible Argo CD AWS CI/CD DNS Git GitLab GitOps Grafana Kafka Kubernetes OpenSearch OpenTelemetry PostgreSQL Prometheus Terraform
45 minutes ago

Staff Platform Site Reliability Specialist (Observability & Kubernetes)

Everbridge 1K-5K Internet Software & Services

Everbridge is hiring a Staff Platform Site Reliability Specialist to own and evolve its enterprise observability platform and Kubernetes environment across a large-scale cloud-native infrastructure.

AWS GCP Grafana Kubernetes Terraform
45 minutes ago

Staff Platform Site Reliability Specialist (Observability & Kubernetes) (copy)

Everbridge 1K-5K Internet Software & Services

Everbridge is hiring a Staff Platform Site Reliability Specialist to own and evolve its enterprise observability platform and Kubernetes environment across a large-scale cloud-native AWS and GCP infrastructure.

AWS GCP Grafana Kubernetes Terraform
2 hours, 30 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers