Civica

Civica

Civica is a global leader in public sector software, providing digital solutions and managed services to transform customer experience and operational efficiency for over 3,000 organizations worldwide.

Internet Software & Services
1K-5K
Founded 2002

Description

  • Architect, implement, and continuously improve data center and cloud environments across AWS, Azure, and VMware.
  • Ensure platform reliability, performance, and security meet service-level agreements and scale with demand.
  • Build and evolve infrastructure as code and CI/CD pipelines to release features safely and efficiently.
  • Partner with teams to define, measure, and improve SLIs and SLOs.
  • Implement real-time observability and proactively identify risks before they affect users.
  • Own the on-call rota and lead incident response for production issues.
  • Coach teams through blameless post-mortems and drive continuous improvement after outages.
  • Collaborate with principal engineers, developers, product teams, and security teams on platform roadmaps and controls.
  • Mentor engineers through pairing, brown-bag sessions, and reliability best-practice evangelism.
  • Embed security controls into CI/CD, runtime environments, and disaster-recovery planning.

Requirements

  • Demonstrable experience in a production SRE, DevOps, or infrastructure role, ideally in a SaaS or large-scale web environment.
  • Expertise in at least one public cloud platform: AWS, Azure, or GCP.
  • Experience designing hybrid migrations from on-premises infrastructure to cloud.
  • Strong coding, scripting, and troubleshooting skills in Go, .NET, Java, Python, or similar.
  • Proven experience with infrastructure as code tools such as Terraform or CloudFormation.
  • Experience with container orchestration platforms such as Kubernetes, ECS, AKS, or OpenShift.
  • Experience with virtual machine orchestration, provisioning, and resiliency tools such as KubeVirt, Packer, or Ansible.
  • Deep understanding of monitoring, logging, and tracing tools such as Prometheus/Grafana, ELK/OpenSearch, or Jaeger.
  • Excellent communication skills and experience working in cross-functional teams.
  • Passion for building reusable, tested libraries and tooling.

Benefits

  • 25 days of annual leave plus bank holidays, with the option to buy up to 10 extra days.
  • Up to 3 additional days off for volunteering through the Days of Difference program.
  • 5% employer pension match.
  • Income protection covering up to 75% of salary for long-term illness.
  • Life assurance providing a 4x salary tax-free lump sum.
  • Critical illness cover of £25,000, extendable to dependents.
  • Private medical insurance, health cash plan, and dental insurance.
  • Electric vehicle and hybrid vehicle scheme.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Site Reliability Engineer

Alpaca 51-250 Capital Markets

Alpaca is hiring a Site Reliability Engineer to keep its brokerage platform reliable and operable across cloud, Kubernetes, observability, messaging, and database systems, with a strong focus on PostgreSQL reliability on the trading-critical path.

DNS GitOps Go Kafka Kubernetes Linux Load Balancing PostgreSQL Python RabbitMQ Secrets Management TLS
1 hour, 29 minutes ago

Site Reliability Engineer

Kaseya 1K-5K IT Services

Kaseya is hiring a Site Reliability Engineer to own the reliability, automation, and production stability of its AWS-based services used by thousands of MSPs worldwide.

Ansible AWS Chef CloudFormation Datadog DevSecOps Elasticsearch Kibana Kubernetes MySQL PostgreSQL Puppet Secrets Management Serverless Terraform
5 hours, 29 minutes ago

SRE - DevOps Engineer - Argentina

Coderio 51-250 Internet Software & Services

Coderio is hiring a remote DevOps/SRE Engineer in Argentina to ensure the stability, scalability, and efficient operation of the infrastructure that supports its global digital solutions.

Argo CD CI/CD Flux GitHub Actions GitOps Helm Jenkins Kubernetes OpenShift Terraform
9 hours, 9 minutes ago

Senior Site Reliability Engineer

Cribl 251-1K IT Services

Cribl is hiring a Senior Site Reliability Engineer in Poland to help build and operate the telemetry infrastructure and observability platform that supports its cloud products and enterprise customers.

Ansible AWS Azure CI/CD Grafana JavaScript Kibana Linux New Relic Node.js PagerDuty Prometheus Splunk Terraform TypeScript
16 hours, 42 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers