Sr. Site Reliability Engineer

10 hours, 8 minutes ago
Full-time
Senior
DevOps and Infrastructure
SpaceX

SpaceX

SpaceX designs, manufactures, and launches advanced rockets and spacecraft with the aim of revolutionizing space technology and enabling human life on other planets.

Aerospace & Defense
10K-50K
Founded 2002

Description

  • Develop automation to deploy and manage compute resources on-premises and in the cloud.
  • Deploy and manage core infrastructure including databases, monitoring systems, and storage.
  • Collaborate closely with software engineers to build scalable, operable, and maintainable products.
  • Own the full service lifecycle from inception and design through deployment, operation, and refinement.
  • Support the development of secure, reliable, and autonomous software systems for satellite programs.
  • Build tools that improve team efficiency and software delivery.

Requirements

  • Bachelor’s degree in computer science, information systems/IT, or an engineering discipline plus 5+ years of professional experience with Linux operating systems, or 7+ years of experience in software, DevOps, or site reliability engineering in lieu of a degree.
  • 5+ years of experience with Kubernetes.
  • 5+ years of experience with Linux operating systems.
  • Experience with Bash, Python, and/or other scripting languages.
  • 5+ years of experience with Python and Python-based development frameworks, preferred.
  • Experience managing Kubernetes clusters, not just using them, preferred.
  • Knowledge of the Linux boot process and systems configuration, preferred.
  • Deep understanding of testing, continuous integration, build, deployment, and continuous monitoring, preferred.
  • Understanding of build technologies such as Bazel and Makefiles, preferred.
  • Experience managing dozens, hundreds, or thousands of servers with tools such as Terraform or Ansible, preferred.
  • Strong networking knowledge of TCP/IP, preferred.
  • Active Top Secret, Top Secret SCI, or DOE Level Q clearance, preferred.
  • Must be willing to work extended hours and weekends as needed.
  • Must be able to obtain and maintain a Top Secret Security Clearance as a condition of employment.
  • Must meet ITAR eligibility requirements as a U.S. citizen/national, lawful permanent resident, refugee, asylee, or otherwise authorized by the U.S. Department of State.

Benefits

  • Base salary range of $165,000 to $230,000 for Level 3.
  • Eligibility for long-term incentives in the form of company stock or long-term cash awards.
  • Potential discretionary bonuses and participation in an Employee Stock Purchase Plan.
  • Comprehensive medical, vision, and dental coverage.
  • 401(k) retirement plan.
  • Short- and long-term disability insurance, plus life insurance.
  • Paid parental leave.
  • Approximately 3 weeks of paid vacation and 10 or more paid holidays per year.
  • Paid sick leave in accordance with company policy.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior SRE - Platform (Managed Kubernetes Infrastructure)

Elastic 1K-5K Internet Software & Services

Elastic is hiring a Site Reliability Engineer on its Platform Engineering team to design and operate the multi-cloud platform that hosts Elastic Cloud services and supports rapid, reliable product delivery.

Docker Go InfluxDB Kubernetes Linux Prometheus Terraform
9 hours, 23 minutes ago

Site Reliability Engineer

Dropbox 1K-5K Internet Software & Services

Dropbox is hiring a Corporate Site Reliability Engineer to lead infrastructure reliability, observability, automation, and security for its IT Services environment.

Ansible AWS Bash Chef Datadog DHCP DNS Docker EC2 GitHub GitHub Actions GitOps Kubernetes Linux Python REST API Serverless Terraform Ubuntu WAF
9 hours, 38 minutes ago

Senior Observability Engineer

Ensono 1K-5K IT Services

Ensono is hiring an observability and monitoring engineer to operate and improve hybrid cloud monitoring platforms for enterprise clients, with the goal of delivering real-time visibility, reliable alerting, and compliant monitoring operations.

Ansible AWS Azure Bash Datadog GCP JavaScript Kubernetes Python Terraform
10 hours, 8 minutes ago

Senior SRE Engineer (Observability Focus)

Capital.com 251-1K Capital Markets

Senior SRE Engineer at a leading trading platform, owning the company’s observability practice end to end for a hybrid AWS and on-prem production environment.

Ansible Argo CD AWS Bash Elasticsearch Fluentd GitOps Grafana Helm Java JavaScript Kafka Kubernetes OpenSearch OpenTelemetry Prometheus Python Terraform TypeScript
10 hours, 8 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers