Senior IaaS / Kubernetes Platform Engineer (worldwide remote, work anywhere)

4 weeks ago
Full-time
Senior
DevOps and Infrastructure
CloudLinux

CloudLinux

CloudLinux is a leading provider of the CloudLinux OS, a platform for Linux web hosting that offers next-level performance and security. With a focus on optimizing web hosting environments, CloudLinux helps service providers improve density, stability,...

IT Services
51-250
Founded 2009

Description

  • Design, build, and operate a multi-tenant Kubernetes platform using Cluster API and bare-metal providers.
  • Implement hard multi-tenancy with vCluster or similar isolation technology for tenant-specific Kubernetes environments.
  • Deploy and manage KubeVirt for VM orchestration, including CPU pinning, NUMA awareness, and HugePages configuration.
  • Run GitOps-driven infrastructure management with ArgoCD or Flux as the source of truth for cluster configuration.
  • Deploy policy-as-code controls with Kyverno or OPA Gatekeeper for admission, quotas, and security enforcement.
  • Build self-service infrastructure provisioning workflows using Crossplane or similar Kubernetes-native tools.
  • Operate and optimize Ceph storage clusters and manage Rook-Ceph deployments at scale.
  • Design storage tiering and per-VM/per-tenant I/O isolation across Ceph, NVMe, LINSTOR/DRBD, or TopoLVM storage layers.
  • Deploy and maintain overlay networking, micro-segmentation, encrypted connectivity, and multi-datacenter cluster networking.
  • Work with physical network infrastructure including Juniper switches, BGP, EVPN/VXLAN, VLANs, and site-to-site IPsec connectivity.
  • Maintain SRE practices including SLOs, capacity forecasting, chaos experiments, on-call support, runbooks, DRPs, and postmortems.
  • Automate infrastructure provisioning and lifecycle management with Terraform/OpenTofu, Ansible, PXE, Foreman, and IPMI.
  • Implement FinOps practices for cost attribution, utilization analysis, and right-sizing recommendations.

Requirements

  • 5+ years of experience in infrastructure or platform engineering roles.
  • At least 3 years of production Kubernetes platform experience, not just application deployment.
  • Production experience with at least 3 of the following: KubeVirt, Cluster API, Cilium or Calico, Rook-Ceph, ArgoCD or Flux.
  • Deep Linux systems knowledge, including kernel tuning, networking stacks, filesystem operations, and performance troubleshooting.
  • Ceph distributed storage experience, including cluster operations, OSD lifecycle, pool management, tuning, and degraded-state troubleshooting.
  • Infrastructure as Code experience with Terraform/OpenTofu and Ansible at production scale.
  • Bare-metal infrastructure experience with IPMI/iDRAC, PXE boot, RAID configuration, hardware diagnostics, and datacenter operations.
  • Networking fundamentals including BGP, VLAN, IPSec/WireGuard, DNS, and load balancing.
  • Strong written and verbal English communication skills at B2+ level.
  • A proactive mindset with demonstrated experience identifying and resolving problems before they become incidents.
  • Experience building multi-tenant Kubernetes platforms with vCluster, Capsule, or custom namespace isolation is preferred.
  • Crossplane or similar Kubernetes-native infrastructure abstraction experience is preferred.
  • Policy-as-Code experience with Kyverno, OPA Gatekeeper, or Kubewarden is preferred.
  • Container security experience with Sigstore/cosign, Falco, Kata Containers, or gVisor is preferred.
  • SRE experience with SLO/SLI design, error budgets, chaos engineering, or incident management frameworks is preferred.
  • FinOps experience with OpenCost, Kubecost, or cloud cost optimization is preferred.
  • Experience with immutable operating systems such as Talos Linux or Flatcar Container Linux is preferred.
  • OpenNebula experience is preferred.
  • Experience with LINSTOR/DRBD or TopoLVM for high-performance local storage is preferred.
  • SR-IOV and DPDK experience for hardware-accelerated networking is preferred.
  • Experience migrating from VMware, OpenNebula, or Proxmox to Kubernetes/KubeVirt is preferred.
  • Experience with the Grafana LGTM stack, compliance environments, Go or Python tooling, or Juniper JunOS is preferred.

Benefits

  • Fully remote work with flexible working hours from any location worldwide.
  • Paid 24 days of vacation per year.
  • 10 days of national holidays.
  • Unlimited sick leave.
  • Private medical insurance reimbursement.
  • Co-working and gym/sports reimbursement.
  • Budget for education and professional development.
  • Opportunity to receive a reward for an innovative idea the company can patent.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Licensed Civil Engineer - Data Center

Olsson 1K-5K Construction & Engineering

Olsson is hiring a Licensed Civil Engineer to support its Data Center Civil team on large hyperscale and colocation data center projects across the U.S., with a focus on designing critical infrastructure for complex engineering-driven developments.

1 hour, 5 minutes ago

Sr. Data Center Engineer II (6384)

MetroStar 251-1K IT Services

MetroStar is hiring a Sr. Data Center Engineer II to design and sustain secure, high-availability data center infrastructure supporting mission-critical federal government operations.

Agile
6 hours, 11 minutes ago

Senior AI Platform Engineer

Wellhub 1-10 Gas Utilities

Wellhub is hiring a Senior AI Platform Engineer in Brazil to help build and evolve the cloud-native ML development platform that enables engineers and data scientists to develop and deploy AI at scale.

Apache Spark AWS CI/CD Kubeflow Kubernetes MLOps Python Terraform
6 hours, 25 minutes ago

Database Administrator - Cloud Platform / Infrastructure

3Cloud 251-1K Internet Software & Services

3Cloud is seeking an experienced Database Administrator to support multiple customer database migration and Azure data services projects across development, test, and production environments.

Azure Oracle SQL Server Terraform
8 hours, 26 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers