Anduril Industries

Anduril Industries

Anduril Industries is an American defense technology firm that specializes in developing advanced autonomous systems for integrated awareness and security across land, sea, and air, utilizing its proprietary Lattice platform to enhance intelligence, su...

Aerospace & Defense
1K-5K
Founded 2017
$2200M raised

Description

  • Own the full lifecycle of core self-hosted developer tools used by the engineering organization.
  • Design and implement automation for patching, validated backups, and upgrades.
  • Scale infrastructure to support a fast-growing engineering organization.
  • Manage environments using Infrastructure-as-Code with Terraform.
  • Operate and troubleshoot systems across Docker, Kubernetes, and cloud platforms.
  • Define and maintain service-level objectives for availability, reliability, and performance.
  • Build and maintain monitoring, alerting, and observability for developer tool services.
  • Lead incident response and root cause analysis for service issues.
  • Collaborate cross-functionally with platform, security, infrastructure, and software teams.
  • Help expand SRE capabilities for on-prem systems as the infrastructure footprint grows.

Requirements

  • Experience operating production systems using Docker and Kubernetes.
  • Proficiency with at least one cloud platform: AWS, GCP, or Azure.
  • Experience managing infrastructure with Infrastructure-as-Code tools such as Terraform.
  • Strong problem-solving skills with a focus on automation.
  • Scripting or software development experience in Python, Go, or Bash.
  • Familiarity with CI/CD pipelines and developer tooling.
  • Ability to own systems end-to-end, from design through incident resolution.
  • Eligibility to obtain and maintain an active U.S. Secret security clearance.
  • Prior experience with GitHub Enterprise Server, JFrog Artifactory/Xray, or CircleCI is preferred.
  • Experience maintaining highly available, scalable internal tools is preferred.
  • Exposure to security best practices, compliance requirements, or auditing is preferred.
  • Experience supporting large, rapidly scaling engineering organizations is preferred.
  • Experience with monitoring and observability platforms such as Datadog, Prometheus, or Grafana is preferred.
  • Background in SRE or hybrid SWE/DevOps roles is preferred.
  • Experience with on-prem infrastructure operations, reliability, or capacity planning is preferred.

Benefits

  • US salary range of $146,000 to $194,000.
  • Highly competitive equity grants included in the majority of full-time offers.
  • Top-tier benefits for full-time employees.
  • Comprehensive benefits package available at little to no cost to employees.
  • Support for health and recovery needs.
  • Security-focused hiring and candidate screening processes that protect applicant information.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

DevOps Engineer - Remote, Latin America

Bluelight Consulting 11-50 Internet Software & Services

Bluelight is hiring a remote DevOps Engineer in Latin America to support cloud infrastructure, deployment, and operations for a fast-growing software consultancy serving technology clients.

Ansible AWS Azure Chef CircleCI GCP GitLab Helm Jenkins Kubernetes Load Balancing Pulumi Puppet Terraform
34 minutes ago

Senior AIOps Engineer, Incident Response [Remote-US]

Quanata 201-500 information technology & services

Quanata is hiring an experienced production operations and reliability leader to oversee production health, incident response, and operational support for its AI-driven insurance technology platform.

AWS Confluence JIRA
1 hour, 34 minutes ago

DevOps Engineer - Remote, Latin America

Bluelight Consulting 11-50 Internet Software & Services

Bluelight is hiring a remote DevOps Engineer in Latin America to support cloud infrastructure, deployment, and operations for client projects in a fast-growing software consultancy.

Ansible AWS Azure Chef CI/CD CircleCI GCP GitLab Helm Jenkins Kubernetes Load Balancing Pulumi Puppet Terraform
1 hour, 42 minutes ago

Security Automation Lead

Point72 51-250 Capital Markets

Point72’s Technology team is hiring a leader to build and run security automation and observability systems that strengthen infrastructure, cloud, and security operations across the firm.

Bash CI/CD CloudFormation Datadog GitHub Actions GitLab CI Grafana Helm Jenkins Kubernetes PowerShell Prometheus Pulumi Python Sentinel Splunk Terraform
3 hours, 55 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers