Backblaze

Backblaze

Backblaze is a pioneer in robust, scalable low-cost cloud backup and storage services, offering enterprise hot storage, low-cost backup and archive solutions. With the easiest way to back up all files, Backblaze provides unlimited, unthrottled, and unc...

IT Services
251-1K
Founded 2007

Description

  • Support the availability and durability of critical services across production environments.
  • Monitor service health using SLIs, SLOs, and error budgets, and escalate issues when thresholds are at risk.
  • Participate in on-call rotations, incident response, and post-incident reviews to drive service improvements.
  • Follow ITIL/OSS processes for incident, change, problem, and capacity management.
  • Develop automation for operational tasks to reduce manual intervention and toil.
  • Contribute to monitoring, logging, and alerting frameworks.
  • Work with CI/CD pipelines, configuration management, and infrastructure as code tools.
  • Write scripts to improve system reliability and efficiency.
  • Partner with engineering, product, and operations teams to support resilient system design and operations.
  • Assist with capacity planning, disaster recovery exercises, vendor troubleshooting, and SLA tracking.

Requirements

  • Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent experience.
  • 2–4 years of experience in site reliability, systems engineering, or operations.
  • Exposure to large-scale, production-grade systems.
  • Solid Linux systems administration and troubleshooting skills.
  • Familiarity with service reliability concepts, including monitoring, alerting, incident response, and root cause analysis.
  • Proficiency in at least one scripting language such as Python, Bash, or Go.
  • Understanding of containers such as Kubernetes and Docker, and microservices concepts.
  • Knowledge of incident response and operational best practices.
  • Experience in a SaaS, service provider, or distributed systems environment is preferred.
  • Familiarity with ITIL/OSS practices and SLO/SLA concepts is preferred.
  • Experience with cloud platforms such as AWS, GCP, or Azure is preferred.
  • Ability to work independently, take ownership, and drive projects from problem discovery through resolution is preferred.

Benefits

  • Backblaze emphasizes learning, development, and growth as part of its culture.
  • The company supports candidates who may not meet every requirement and encourages them to apply.
  • Backblaze is committed to diversity, equity, and inclusion and fostering a sense of belonging.
  • Backblaze is an Equal Opportunity Employer.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Site Reliability Engineer

TextNow 51-250 Wireless Telecommunication Services

TextNow is hiring a remote Site Reliability Engineer in Canada to own infrastructure, monitoring, logging, CI/CD, and reliability for the systems supporting its free phone service platform.

Ansible AWS CI/CD GitHub System Design Terraform
6 hours, 26 minutes ago

Senior Application Engineer

Warner Music Group is hiring a Senior Application Engineer to support, improve, and modernize the software systems behind its global music operations.

Angular AWS CI/CD GitHub Actions Java Oracle PostgreSQL Python React SQL
6 hours, 41 minutes ago

Site Reliability Engineer - Backstage

Spotify Media

Site Reliability Engineer for Spotify’s Backstage team in New York City, focused on building and operating cloud infrastructure for an external developer portal and internal AI-driven coding workflows.

AWS GCP Go Java LLM Microservices Python React Terraform TypeScript
7 hours, 56 minutes ago

Blockchain Site Reliability Engineer

InfStones 51-250 Internet Software & Services

InfStones is hiring a remote Blockchain Site Reliability Engineer in Dallas to ensure the reliability, availability, and performance of its blockchain node infrastructure.

Docker Ethereum Go Grafana JavaScript Kubernetes Linux Prometheus Python Rust Solana
8 hours, 41 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers