Sr. Site Reliability Engineer III (6448)

1 day, 7 hours ago
Full-time
Senior
Software Development
MetroStar

MetroStar

MetroStar builds innovative technology solutions designed to enhance and accelerate the missions of government agencies, leveraging a rich legacy of expertise in the digital age.

IT Services
251-1K
Founded 1999

Description

  • Design, deploy, and maintain mission-critical application workloads in virtualized or containerized environments such as VMWare or Kubernetes.
  • Develop and sustain automated CI/CD pipelines, monitoring, and configuration management workflows across development, integration, staging, and production environments.
  • Provision, configure, and maintain developer environments and toolchains that support secure and efficient software delivery.
  • Identify friction across the software development lifecycle and implement improvements that reduce developer pain points.
  • Establish and maintain customer trust through technical expertise and mission-aligned problem solving.
  • Support reliable software delivery and operational observability for highly available production systems.
  • Participate in incident response, root cause analysis, and continuous improvement activities.
  • Maintain solutions that meet government compliance and continuity-of-operations requirements.

Requirements

  • Active Top Secret clearance or higher.
  • Certification meeting DoD 8140 requirements, such as Security+ or higher.
  • Bachelor’s degree in Computer Science or a related engineering field preferred; relevant experience may substitute.
  • 7+ years of experience in software development, systems engineering, or operations roles focused on production availability, performance, and reliability.
  • Demonstrated experience combining software engineering and systems administration practices for scalable, highly available applications.
  • Experience designing and managing monitoring, alerting, and observability solutions to meet Service Level Objectives.
  • Experience with incident response, root cause analysis, and continuous improvement activities.
  • Experience with Ansible and Desired State Configuration.
  • Experience with GitLab CI/CD automation and Bash scripting.
  • Experience with Kubernetes and container-native or object storage solutions such as MinIO, S3-compatible services, or PortWorX.
  • Experience with enterprise load-balancing solutions such as F5 or similar platforms.
  • Ability to contribute immediately with minimal ramp-up in a mission-critical operational environment.
  • Essential personnel status; may be required to work during government shutdowns, emergencies, or other critical situations.

Benefits

  • Salary range of $185,000 to $230,000.
  • Eligible for performance-based bonuses and other additional compensation.
  • Company-paid training and/or certifications.
  • Referral bonuses.
  • Health, dental, and vision insurance.
  • 401(k) retirement plan with company match.
  • Paid time off and holidays.
  • Parental leave and dependent care.
  • Flexible work arrangements.
  • Professional development opportunities.
  • Employee assistance and wellness programs.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Site Reliability Engineer (Remote Build)

Remote 251-1K Professional Services

Remote is hiring a Senior Site Reliability Engineer for Remote Build to own the reliability, security, and operational strategy behind its global employment infrastructure platform.

AWS Bash CI/CD Datadog Elixir GitHub Actions GitLab Go Grafana Java Jenkins Kubernetes Linux Microservices Node.js Prometheus Python Terraform
6 hours, 10 minutes ago

Senior Site Reliability Engineer (Remote Build)

Remote 251-1K Professional Services

Remote is hiring a Senior Site Reliability Engineer to own the reliability, security, and operational strategy for Remote Build’s global infrastructure platform supporting AI-driven HR and Finance integrations.

AWS Bash CI/CD Datadog Elixir GitHub Actions GitLab Go Grafana Java Jenkins Kubernetes Linux Microservices Node.js Prometheus Python Terraform
7 hours, 10 minutes ago

NoSQL Database Engineer II

LivePerson 1K-5K Internet Software & Services

LivePerson is hiring a NoSQL Database Engineer (L2) in India to support production reliability and platform engineering for large-scale NoSQL systems and cloud infrastructure.

Bash Cassandra Couchbase GCP Go Grafana Prometheus Python Redis Terraform
2 days, 6 hours ago

Sr. Production Engineer, Solutions Engineering

Pinterest 5K-10K Internet Software & Services

Pinterest is hiring a Senior Production Engineer on Solutions Engineering to design AI-driven reliability and automation systems that improve the operation of large-scale distributed infrastructure serving hundreds of millions of users.

Ansible AWS Azure Chef Docker Envoy GCP Go Hadoop Kafka Kubernetes Linux MySQL Puppet Python Terraform Unix
2 days, 6 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers