Sr. Site Reliability Engineer (Starshield)

12 hours, 32 minutes ago
Full-time
Senior
DevOps and Infrastructure
SpaceX

SpaceX

SpaceX designs, manufactures, and launches advanced rockets and spacecraft with the aim of revolutionizing space technology and enabling human life on other planets.

Aerospace & Defense
10K-50K
Founded 2002

Description

  • Develop automation to deploy and manage compute resources on-premises and in the cloud.
  • Deploy and manage core infrastructure, including databases, monitoring systems, and storage.
  • Collaborate closely with software engineers to create scalable, operable, and maintainable products.
  • Engage in the full service lifecycle from inception and design through deployment, operation, and refinement.
  • Support the development of highly reliable in-space mesh networks and secure software systems.
  • Contribute to tools that improve engineering efficiency and help build secure, reliable, and autonomous systems.
  • Provide development, testing, and operational support across the software lifecycle.

Requirements

  • Bachelor’s degree in computer science, information systems/IT, or an engineering discipline with 5+ years of professional Linux experience, or 7+ years of experience in software, DevOps, or site reliability engineering in lieu of a degree.
  • 5+ years of experience with Kubernetes.
  • 5+ years of experience with Linux operating systems.
  • Experience with Bash, Python, and/or other scripting languages.
  • 5+ years of experience with Python and Python-based development frameworks preferred.
  • Experience managing Kubernetes clusters, not just using them, preferred.
  • Knowledge of the Linux boot process and systems configuration preferred.
  • Deep understanding of testing, continuous integration, build, deployment, and continuous monitoring preferred.
  • Understanding of build technologies such as Bazel and Makefiles preferred.
  • Experience with infrastructure automation tools such as Terraform or Ansible preferred.
  • Strong networking knowledge of TCP/IP preferred.
  • Excellent communication skills with the ability to work with customers, peers, and management in formal and informal settings preferred.
  • Active Top Secret, Top Secret SCI, or DOE Level Q clearance preferred.
  • Must be willing to work extended hours and weekends as needed.
  • Must be able to successfully obtain and maintain a Top Secret Security Clearance.
  • Must meet ITAR eligibility requirements as a U.S. citizen/national, lawful permanent resident, refugee, asylee, or otherwise eligible for required U.S. Department of State authorizations.

Benefits

  • Pay range of $165,000 to $230,000 for Level 3.
  • Eligibility for long-term incentives in the form of company stock or long-term cash awards.
  • Potential discretionary bonuses.
  • Ability to purchase additional stock at a discount through an Employee Stock Purchase Plan.
  • Comprehensive medical, vision, and dental coverage.
  • 401(k) retirement plan.
  • Short- and long-term disability insurance and life insurance.
  • Paid parental leave, 3 weeks of paid vacation, 10 or more paid holidays per year, and paid sick leave.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Site Reliability Engineer

DexCare 51-250 Health Care Providers & Services

DexCare is hiring a Senior Site Reliability Engineer to help operate and improve its AWS-based healthcare infrastructure that supports digital care access and reliable patient service delivery.

Agile AWS Azure CI/CD Datadog EC2 GitHub Actions Helm HIPAA JIRA Kubernetes Python Scrum Serverless Terraform
13 hours, 17 minutes ago

Data Center Reliability Engineer

Phaidra 51-250 Internet Software & Services

Phaidra is hiring a Data Center Reliability Engineer to translate data center telemetry into operational intelligence for its AI-powered monitoring and control systems.

GitLab LLM Machine Learning NumPy Pandas Python Reinforcement Learning
1 day, 12 hours ago

Senior Site Reliability Engineer

Accenture 100K+ Professional Services

Accenture Federal Services is hiring a Site Reliability Engineer to improve the reliability, performance, and scalability of a client system supporting US federal mission operations.

1 day, 13 hours ago

Senior Site Reliability Engineer (Remote Build)

Remote 251-1K Professional Services

Remote is hiring a Senior Site Reliability Engineer for Remote Build to own the reliability, security, and operational strategy behind its global employment infrastructure platform.

AWS Bash CI/CD Datadog Elixir GitHub Actions GitLab Go Grafana Java Jenkins Kubernetes Linux Microservices Node.js Prometheus Python Terraform
2 days, 12 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers