Capital.com

Capital.com

Capital.com is a leading fintech company providing online trading services through a smart investment app, offering access to 3700+ global markets with AI-powered features for secure and efficient trading.

Capital Markets
251-1K
Founded 2016
$25M raised

Description

  • Design, deploy, and maintain scalable cloud infrastructure on AWS with high availability, performance, and security.
  • Own and evolve Kubernetes cluster management, including bare-metal deployments, and support reliable containerised workloads with Docker and Helm.
  • Build and maintain CI/CD pipelines using GitLab CI and GitOps workflows with FluxCD or ArgoCD.
  • Define, manage, and review Infrastructure as Code using Terraform.
  • Lead monitoring and observability efforts, including dashboards, alerting, and log pipelines with VictoriaMetrics/Prometheus, Grafana, and the ELK stack.
  • Operate and optimize Apache Kafka ecosystems, including Strimzi, Kafka Connect, and MirrorMaker.
  • Drive incident response, root cause analysis, and post-mortem practices to improve reliability.
  • Collaborate with Engineering, Security, and Product teams to embed DevOps best practices across the organisation.
  • Mentor and guide junior engineers to raise the engineering bar for infrastructure reliability and automation.

Requirements

  • 6+ years of hands-on experience in a DevOps or SRE role.
  • Strong knowledge of AWS services, including VPC, EC2, EKS, S3, ECR, EBS, RDS, ElastiCache, IAM, KMS, Secrets Manager, SSM Parameter Store, CloudWatch, MSK, SNS, SQS, Route 53, Direct Connect, Transit Gateway, and ELB/ALB/NLB.
  • Solid Linux administration skills with a deep understanding of system internals.
  • Deep expertise in Kubernetes, including bare-metal cluster deployment and day-2 operations.
  • Proficiency with Docker and Helm.
  • Hands-on experience with Terraform as a primary Infrastructure as Code tool, including writing, reviewing, and maintaining production-grade modules.
  • Proven experience with GitLab CI for building and maintaining CI/CD pipelines; familiarity with GitOps practices using FluxCD or ArgoCD.
  • Strong background in monitoring and observability with VictoriaMetrics or Prometheus, Grafana, and the ELK stack.
  • Experience operating and managing Apache Kafka ecosystems, including Strimzi, Kafka Connect, and MirrorMaker.
  • Experience with Ansible for configuration management; AWX experience is a plus.
  • Proficiency in scripting and automation with Bash, Python, and Go.
  • Strong communication skills and the ability to collaborate cross-functionally in a fast-paced, regulated environment.
  • English language proficiency.

Benefits

  • Competitive salary.
  • Hybrid work arrangement with flexibility to work remotely.
  • Generous annual leave.
  • Employee referral program.
  • Comprehensive health and pension benefits, including medical insurance and pension plans.
  • 30 extra days per year to work remotely from anywhere in the world, subject to restrictions.
  • Two additional paid volunteer days each year.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Deployment Engineer

Armada 201-500 information technology & services

Armada is hiring a Deployment Engineer to execute field deployment, commissioning, startup, troubleshooting, and operational readiness for modular data center infrastructure across North America and global customer sites.

3 hours, 9 minutes ago

Staff Operations Engineer

Mozilla 251-1K Internet Software & Services

Mozilla is hiring a Staff Operations Engineer to lead the design, reliability, and evolution of hybrid-cloud and workplace infrastructure across teams.

Ansible DNS Linux Puppet Python TCP/IP Unix
3 hours, 39 minutes ago

Senior DevOps Engineer (V)

Blue Coding 51-250 Internet Software & Services

Blue Coding is hiring a Senior DevOps Engineer to help modernize and operate a cloud-native AWS platform for a remote LATAM-based client team, with a focus on migrating legacy Windows infrastructure and improving delivery and reliability practices.

Agile AWS Bitbucket CI/CD Confluence Docker JIRA .NET PowerShell Serverless SonarQube Terraform Windows Server
3 hours, 54 minutes ago

Principal Site Reliability Engineer (SRE)

Symmetrio Professional Services

Symmetrio is recruiting a Principal Site Reliability Engineer for a rapidly growing healthcare technology company to own the reliability, scalability, security, and performance of a mission-critical SaaS platform used by healthcare providers across the United States.

Active Directory AWS CI/CD Datadog Django Grafana Kubernetes Python Terraform Windows Server
3 hours, 54 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers