Veeam Software

Veeam Software is the global leader in Backup that delivers Modern Data Protection, offering solutions for virtual environments, enterprises, small businesses, and service providers worldwide.

Internet Software & Services

Information Technology

1K-5K (4300)

Founded 2006

$500M raised

133 open positions

Links

View All Jobs

GOV Site Reliability Engineer

1 month, 1 week ago

United States

Full-time

Junior

Site Reliability Engineer (SRE)

Software Development

Argo CD Azure C# ELK Stack GitHub Actions GitLab CI Go Grafana HIPAA Java JavaScript Kubernetes OpenTelemetry Prometheus Pulumi Terraform TypeScript

Apply Now

Veeam Software

Veeam Software is the global leader in Backup that delivers Modern Data Protection, offering solutions for virtual environments, enterprises, small businesses, and service providers worldwide.

Internet Software & Services

1K-5K

Founded 2006

$500M raised

View All Jobs 133

Description

Get up to speed on Veeam Data Cloud workloads, dependencies, and operational workflows by reading code, documentation, and working with subject matter experts.
Write and maintain runbooks, incident guides, onboarding materials, and other operational documentation.
Participate in incident response, including triage, investigation, mitigation, and postmortems.
Help implement and maintain service level indicators, service level objectives, and error budgets.
Identify reliability issues and propose concrete improvements during incidents and reviews.
Support high availability and fault tolerance work on Azure, including Azure Government.
Implement monitoring improvements by adding instrumentation, alerting, and dashboards.
Contribute to toil reduction through automation and tooling improvements.
Participate in on-call rotations.
Work with engineering, security, compliance, and operations teams to deliver reliability improvements.

Requirements

3+ years of experience in Software Engineering, including at least 1 year in SRE, Platform Engineering, or DevOps for cloud-hosted services.
Experience with cloud infrastructure on Azure or a comparable cloud provider.
Experience working in regulated or compliance-oriented environments such as government, financial, or healthcare.
Ability to read and understand code well enough to investigate system behavior independently.
Experience with monitoring and observability tools such as Prometheus, Grafana, OpenTelemetry, or the ELK stack.
Experience with IaC tools such as Terraform, Terragrunt, or Pulumi, and with Kubernetes.
Experience with CI/CD tools such as GitHub Actions, Azure DevOps, GitLab CI, or ArgoCD.
Strong programming skills in one or more of TypeScript/JavaScript, Go, Java, or C#, or similar languages.
Solid understanding of distributed systems fundamentals and networking basics.
Clear written and verbal communication skills.
Preferred: experience in Government or Sovereign Cloud environments such as Azure Government or AWS GovCloud.
Preferred: background in SaaS platforms or multi-tenant systems.
Preferred: familiarity with chaos engineering, resilience testing, or load testing.
Preferred: exposure to building or improving reliability practices on a team.
Preferred: familiarity with AI-first development workflows using LLM-powered tools for automation, code generation, or documentation.

Benefits

Unlimited paid time off, 12 paid holidays, 4 global VeeaMe Days for self-care, and 24 paid volunteer hours annually.
Paid parental leave: 8 weeks for all parents and 16 weeks for birthing parents.
Medical, dental, and vision coverage starting on the first day.
Mental health support, therapy sessions, and digital wellness tools through the Employee Assistance Program.
401(k) retirement plan with company matching contributions.
Fertility, adoption, and surrogacy support through Maven.
Legal services, identity protection, and supplemental health insurance options.
Tax-advantaged spending accounts for healthcare, dependent care, and commuting.
Professional development resources including mentorship, training, workshops, on-demand learning libraries, and learning events.
Competitive compensation with pay transparency, performance-based bonus, and role-based geographic salary ranges.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Site Reliability Engineer

Counterpart Health 51-200 hospital & health care

Counterpart Health is hiring a Senior Site Reliability and Infrastructure Engineer to support and evolve the technology platform behind its primary care tool and maintain reliable infrastructure for domestic and international workloads.

United States Full-time Senior Site Reliability Engineer (SRE)

$160k-$208k

AWS Azure CI/CD Containerd DNS Docker GCP Go gRPC Helm Kubernetes Linux Load Balancing Prometheus Python Shell Scripting TCP/IP

17 hours, 31 minutes ago

Apply

17 hours, 31 minutes ago

Senior Test Platform & Reliability Engineer - Star Trek Fleet Command

Scopely 1K-5K Internet Software & Services

Scopely is hiring a Senior Test Platform & Reliability Engineer in Ireland to build validation, reliability, and developer enablement platforms for Star Trek Fleet Command’s large-scale live-service backend systems.

Ireland Full-time Senior SDET (Software Development Engineer in Test) Site Reliability Engineer (SRE)

AWS Bash CI/CD Docker GitLab Go Python Terraform

17 hours, 46 minutes ago

Apply

17 hours, 46 minutes ago

Senior Software Engineer - Databases, SRE | Canada | Remote

Grafana 1K-5K IT Services

Grafana Labs is hiring a Senior Software Engineer for its remote SRE team to improve reliability and operability of Grafana Cloud database services for high-SLA customers across AWS, GCP, and Azure.

Canada Full-time Senior Site Reliability Engineer (SRE) Software Engineer

$108k-$130k

AWS Azure GCP Go Helm Java Kubernetes Linux Microservices Python Terraform

1 day, 16 hours ago

Apply

1 day, 16 hours ago

Senior Site Reliability Engineer

Semios 51-250 Food Products

Semios Group is hiring a Senior Site Reliability Engineer to help scale, secure, and improve the reliability of its global agricultural technology platform.

Canada Full-time Senior Site Reliability Engineer (SRE)

$140k-$160k

AWS Azure Bash Buildkite CI/CD Datadog Docker Envoy GCP Git GitHub GitHub Actions GitLab Go Jenkins Kubernetes Linux NATS New Relic Prometheus Python Ruby Splunk Terraform

1 day, 18 hours ago

Apply

1 day, 18 hours ago

Veeam Software

Tags

Links

GOV Site Reliability Engineer

Veeam Software

Description

Requirements

Benefits

Similar Roles

Senior Site Reliability Engineer

Senior Test Platform & Reliability Engineer - Star Trek Fleet Command

Senior Software Engineer - Databases, SRE | Canada | Remote

Senior Site Reliability Engineer

You're on a roll! Sign up now to keep applying.