Sr. Site Reliability Engineer (Starshield)

13 hours, 3 minutes ago
Full-time
Senior
DevOps and Infrastructure
SpaceX

SpaceX

SpaceX designs, manufactures, and launches advanced rockets and spacecraft with the aim of revolutionizing space technology and enabling human life on other planets.

Aerospace & Defense
10K-50K
Founded 2002

Description

  • Develop automation to deploy and manage compute resources on-premises and in the cloud.
  • Deploy and manage core infrastructure including databases, monitoring systems, and storage.
  • Collaborate closely with software engineers to build scalable, operable, and maintainable products.
  • Improve the full service lifecycle from design and inception through deployment, operation, and refinement.
  • Support the development and operation of secure, reliable, and autonomous software systems.
  • Contribute to tools and infrastructure that improve engineering efficiency and operational support.

Requirements

  • Bachelor’s degree in computer science, information systems/IT, or an engineering discipline with 5+ years of professional Linux experience; or 7+ years of experience in software, DevOps, or site reliability engineering in lieu of a degree.
  • 5+ years of experience with Kubernetes.
  • 5+ years of experience with Linux operating systems.
  • Experience with Bash, Python, and/or other scripting languages.
  • 5+ years of experience with Python and Python-based development frameworks preferred.
  • Experience managing Kubernetes clusters, not just using them, preferred.
  • Knowledge of Linux boot process and systems configuration preferred.
  • Deep understanding of testing, continuous integration, build, deployment, and continuous monitoring preferred.
  • Understanding of build technologies such as Bazel and Makefiles preferred.
  • Experience managing dozens, hundreds, or thousands of servers using tools such as Terraform or Ansible preferred.
  • Strong networking knowledge of TCP/IP preferred.
  • Excellent communication skills with the ability to communicate with customers, peers, and management in formal and informal settings preferred.
  • Active Top Secret, Top Secret SCI, or DOE Level Q clearance preferred.
  • Must be willing to work extended hours and weekends as needed.
  • Must be able to successfully obtain and maintain a Top Secret Security Clearance as a condition of employment.
  • Must meet ITAR requirements as a U.S. citizen, national, lawful permanent resident, refugee, asylee, or otherwise eligible for required authorizations from the U.S. Department of State.

Benefits

  • Pay range of $165,000 to $230,000 for Level 3.
  • Eligible for long-term incentives such as company stock or long-term cash awards.
  • Potential discretionary bonuses and employee stock purchase plan access with discounted stock purchases.
  • Comprehensive medical, vision, and dental coverage.
  • Access to a 401(k) retirement plan.
  • Short- and long-term disability insurance plus life insurance.
  • Paid parental leave.
  • Approximately 3 weeks of paid vacation and 10 or more paid holidays per year.
  • Paid sick leave in accordance with company policy and applicable law.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Site Reliability Engineer

DexCare 51-250 Health Care Providers & Services

DexCare is hiring a Senior Site Reliability Engineer to help operate and improve its AWS-based healthcare infrastructure that supports digital care access and reliable patient service delivery.

Agile AWS Azure CI/CD Datadog EC2 GitHub Actions Helm HIPAA JIRA Kubernetes Python Scrum Serverless Terraform
13 hours, 18 minutes ago

Data Center Reliability Engineer

Phaidra 51-250 Internet Software & Services

Phaidra is hiring a Data Center Reliability Engineer to translate data center telemetry into operational intelligence for its AI-powered monitoring and control systems.

GitLab LLM Machine Learning NumPy Pandas Python Reinforcement Learning
1 day, 12 hours ago

Senior Site Reliability Engineer

Accenture 100K+ Professional Services

Accenture Federal Services is hiring a Site Reliability Engineer to improve the reliability, performance, and scalability of a client system supporting US federal mission operations.

1 day, 13 hours ago

Senior Site Reliability Engineer (Remote Build)

Remote 251-1K Professional Services

Remote is hiring a Senior Site Reliability Engineer for Remote Build to own the reliability, security, and operational strategy behind its global employment infrastructure platform.

AWS Bash CI/CD Datadog Elixir GitHub Actions GitLab Go Grafana Java Jenkins Kubernetes Linux Microservices Node.js Prometheus Python Terraform
2 days, 12 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers