SpaceX

SpaceX

SpaceX designs, manufactures, and launches advanced rockets and spacecraft with the aim of revolutionizing space technology and enabling human life on other planets.

Aerospace & Defense
10K-50K
Founded 2002

Description

  • Install, manage, scale, and optimize Kubernetes and RKE clusters in production using Ansible, Terraform, and related technologies.
  • Collaborate with SpaceX engineers to gather requirements, research options, design solutions, plan deployments, and support Kubernetes-based software platforms.
  • Build highly resilient, high-performance, scalable, and robust systems for demanding engineering teams.
  • Recommend, justify, and implement infrastructure improvements through formal change control processes.
  • Work with internal business units to design creative solutions and resolve problems proactively.
  • Define, document, and enforce standards and best practices for system design, testing, and implementation.
  • Foster collaboration and cross-training to grow Kubernetes expertise across the team.
  • Drive scripting, self-service, and automation to reduce administrative overhead and TOIL.
  • Participate in on-call rotation to respond to urgent after-hours issues when needed.

Requirements

  • Bachelor’s degree in Computer Science or a STEM discipline and 5+ years of systems engineering experience, or 7+ years of systems engineering experience in lieu of a degree.
  • Experience deploying and supporting Linux servers in physical and virtualized environments, such as VMware, using automation.
  • Experience with the Linux shell and configuring/extending Linux instances, including kernel modules, cgroups, PKI, iptables, and network interfaces.
  • Experience supporting and scaling containerized applications in Linux environments.
  • Experience using automation frameworks such as Ansible and Terraform to manage provisioning and post-provisioning lifecycles of infrastructure and Kubernetes installations.
  • Experience creating repeatable, reliable, scalable systems architectures with high availability, fault tolerance, performance tuning, monitoring, and metrics collection (preferred).
  • Experience with source control tools such as Git and Subversion and Git-based collaboration workflows (preferred).
  • Strong understanding of Linux Container Runtime (preferred).
  • Experience with Infrastructure as Code, CI/CD, and GitOps tools such as Ansible, AWX/Tower, Vagrant, Puppet, Redfish, Jenkins, cloud-init, and ArgoCD (preferred).
  • Experience writing test automation for backward compatibility in automation processes and Kubernetes deployments (preferred).
  • Experience with Python or Golang and integrating with RESTful APIs to implement automation (preferred).
  • Experience installing, configuring, and troubleshooting Kubernetes internals, CNI, CRI and CSI plugins, load balancing, service mesh, and software-defined storage in cloud or on-premise environments (preferred).
  • Experience developing Kubernetes-based extensions such as webhooks, controllers, operators, and sidecars (preferred).
  • Experience implementing monitoring and alerting dashboards using Prometheus, Grafana, InfluxDB, or similar tools (preferred).
  • Experience with dynamic system configuration templating using Jinja, Jsonnet, YAML, and Helm (preferred).
  • Must be willing to work extended hours and weekends as needed.
  • Must meet ITAR export-control eligibility requirements as a U.S. citizen/national, lawful permanent resident, refugee, asylee, or otherwise eligible for required U.S. Department of State authorization.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior AI Platform Engineer

Wellhub 1-10 Gas Utilities

Wellhub is hiring a Senior AI Platform Engineer in Brazil to help build and evolve the cloud-native ML development platform that enables engineers and data scientists to develop and deploy AI at scale.

Apache Spark AWS CI/CD Kubeflow Kubernetes MLOps Python Terraform
6 hours, 33 minutes ago

Platform Engineer III

Veeam Software 1K-5K Internet Software & Services

Veeam is hiring a Platform Engineer for the Veeam Data Cloud to build and operate a secure, reliable platform that helps teams develop, test, deploy, and monitor the VDC product.

AWS Azure Bash Docker Git GitHub Actions Go Helm Java Kubernetes Microservices Pulumi Python Serverless Terraform
11 hours, 43 minutes ago

AI Platform Engineer

NEORIS 5K-10K Internet Software & Services

NEORIS, part of the EPAM group, is seeking a Principal AI Platform Engineer to design and advance enterprise-scale AI platform capabilities that support governed ML and AI delivery across the organization.

Apache Spark AWS CI/CD Cybersecurity Kubernetes MLOps Python Terraform
13 hours, 18 minutes ago

Développeuse ou développeur en fiabilité de production / Production Reliability Engineer

Unity 5K-10K Internet Software & Services

Unity is hiring an experienced engineer to help own a shared internal platform that enables hundreds of developers to build, deploy, and operate services across the company.

Argo CD AWS Azure CI/CD DNS Docker GCP GitHub Actions Go HashiCorp Vault Helm Kubernetes Node.js Python Secrets Management Terraform
21 hours, 40 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers