Capital.com

Capital.com

Capital.com is a leading fintech company providing online trading services through a smart investment app, offering access to 3700+ global markets with AI-powered features for secure and efficient trading.

Capital Markets
251-1K
Founded 2016
$25M raised

Description

  • Design, deploy, and maintain scalable cloud infrastructure on AWS with high availability, performance, and security.
  • Own and evolve Kubernetes cluster management, including bare-metal deployments, and support reliable containerised workloads with Docker and Helm.
  • Build and maintain CI/CD pipelines using GitLab CI and GitOps workflows with FluxCD or ArgoCD.
  • Define, manage, and review Infrastructure as Code using Terraform.
  • Lead monitoring and observability efforts, including dashboards, alerting, and log pipelines with VictoriaMetrics/Prometheus, Grafana, and the ELK stack.
  • Operate and optimize Apache Kafka ecosystems, including Strimzi, Kafka Connect, and MirrorMaker.
  • Drive incident response, root cause analysis, and post-mortem practices to improve reliability.
  • Collaborate with Engineering, Security, and Product teams to embed DevOps best practices across the organisation.
  • Mentor and guide junior engineers to raise the engineering bar for infrastructure reliability and automation.

Requirements

  • 6+ years of hands-on experience in a DevOps or SRE role.
  • Strong knowledge of AWS services, including VPC, EC2, EKS, S3, ECR, EBS, RDS, ElastiCache, IAM, KMS, Secrets Manager, SSM Parameter Store, CloudWatch, MSK, SNS, SQS, Route 53, Direct Connect, Transit Gateway, and ELB/ALB/NLB.
  • Solid Linux administration skills with a deep understanding of system internals.
  • Deep expertise in Kubernetes, including bare-metal cluster deployment and day-2 operations.
  • Proficiency with Docker and Helm.
  • Hands-on experience with Terraform as a primary Infrastructure as Code tool, including writing, reviewing, and maintaining production-grade modules.
  • Proven experience with GitLab CI for building and maintaining CI/CD pipelines; familiarity with GitOps practices using FluxCD or ArgoCD.
  • Strong background in monitoring and observability with VictoriaMetrics or Prometheus, Grafana, and the ELK stack.
  • Experience operating and managing Apache Kafka ecosystems, including Strimzi, Kafka Connect, and MirrorMaker.
  • Experience with Ansible for configuration management; AWX experience is a plus.
  • Proficiency in scripting and automation with Bash, Python, and Go.
  • Strong communication skills and the ability to collaborate cross-functionally in a fast-paced, regulated environment.
  • English language proficiency.

Benefits

  • Competitive salary.
  • Hybrid work arrangement with flexibility to work remotely.
  • Generous annual leave.
  • Employee referral program.
  • Comprehensive health and pension benefits, including medical insurance and pension plans.
  • 30 extra days per year to work remotely from anywhere in the world, subject to restrictions.
  • Two additional paid volunteer days each year.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

(Senior) Fullstack Software Engineer

Emma Specialty Retail

Emma – The Sleep Company is hiring a Senior Fullstack Software Engineer to help build and operate the technology platform behind its global online store and fulfilment operations.

Agile AWS CI/CD Design Systems Docker E-commerce Express.js Fastify Git Grafana Kafka Next.js Node.js Nuxt.js React REST API TypeScript Vue.js
6 hours, 30 minutes ago

Site Reliability Engineer - Canada Wide - Remote

Newton 51-250 Capital Markets

Newton is hiring a remote Site Reliability Engineer across Canada to improve the reliability, resilience, and operational readiness of its crypto trading platform.

AWS Java JavaScript Python
6 hours, 45 minutes ago

Junior DevSecOps Engineer

European Dynamics 251-1K IT Services

European Dynamics is hiring a Junior DevSecOps Engineer to support cloud infrastructure, automation, and CI/CD operations in a hands-on engineering environment.

AWS Azure Bash CI/CD DNS Docker GCP Git GitHub Actions Grafana HTTP Jenkins Kubernetes Linux Prometheus Python Terraform Unix
7 hours ago

Site Reliability Engineer - India

Zimperium 251-1K Professional Services

Zimperium is hiring a Senior Site Reliability Engineer in India to improve the reliability, automation, and scalability of its mobile security production systems and applications.

CI/CD Datadog Docker Java Kubernetes Linux Python Unix
7 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers