Backblaze

Backblaze is a pioneer in robust, scalable low-cost cloud backup and storage services, offering enterprise hot storage, low-cost backup and archive solutions. With the easiest way to back up all files, Backblaze provides unlimited, unthrottled, and unc...

IT Services

Information Technology

251-1K (393)

Founded 2007

45 open positions

Links

View All Jobs

Site Reliability Engineer I

1 month ago

India, United States

Full-time

Junior

Site Reliability Engineer (SRE)

DevOps and Infrastructure

Ansible HashiCorp Vault Linux Zabbix

Apply Now

Backblaze

IT Services

251-1K

Founded 2007

View All Jobs 45

Description

Act as the first point of contact for customer-affecting issues and production alerts.
Drive resolution of technical problems and support timely incident handling.
Follow incident management processes and complete post-mortems to identify improvements.
Provide consistent communication to management during incidents and operational events.
Respond to Zabbix alerts, take direct action when needed, or escalate appropriately.
Ensure escalations are handed off successfully to the right owners.
Monitor pod health across sites and perform daily filesystem checks for pods.
Troubleshoot infrastructure and deployment issues for Data Center Technicians, including migration and Ansible playbook issues.
Identify and escalate potential network issues and support network-related deployment readiness.
Support Vault pre-deployment configuration, testing, migrations, and migration pod health checks.
Document operational procedures and help automate daily tasks.
Monitor server farm releases and updates, escalating issues as they arise.
Participate in on-call rotation and work outside normal business hours as needed.
Assist other TechOps team members and recommend process improvements to increase productivity.

Requirements

Must be located in Bangalore.
2-4 years of relevant experience.
Knowledge of sysadmin and Linux skills.
Knowledge of network cabling, network classification, and network topology.
Strong analytical thinking.
Strong communication skills and ability to work with different teams.
Desire to learn and develop necessary technical skills.
Ability to work outside normal business hours, including weekends, holidays, and evenings, as needed.

Benefits

RSU grants for full-time employees.
Annual company bonus plan.
Healthcare for family, including dental and vision coverage.
401(k) retirement plan.
ESPP program.
Flexible vacation policy.
Maternity and paternity leave.
MacBook Pro for work plus a generous stipend to personalize your workstation.
Childcare bonus.
Fertility treatment and support.
Learning and development program.
Commuter benefits.
Culture that supports a healthy work-life balance.
Expected salary range of $66,000 - $88,000.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Staff Operations Engineer

Mozilla 251-1K Internet Software & Services

Mozilla is hiring a Staff Operations Engineer to lead the design, reliability, and evolution of hybrid-cloud and workplace infrastructure across teams.

Canada Full-time Lead Infrastructure Engineer Site Reliability Engineer (SRE)

$86k-$127k

Ansible DNS Linux Puppet Python TCP/IP Unix

7 hours, 37 minutes ago

Apply

7 hours, 37 minutes ago

Principal Site Reliability Engineer (SRE)

Symmetrio Professional Services

Symmetrio is recruiting a Principal Site Reliability Engineer for a rapidly growing healthcare technology company to own the reliability, scalability, security, and performance of a mission-critical SaaS platform used by healthcare providers across the United States.

United States Full-time Lead Site Reliability Engineer (SRE)

Active Directory AWS CI/CD Datadog Django Grafana Kubernetes Python Terraform Windows Server

7 hours, 53 minutes ago

Apply

7 hours, 53 minutes ago

Performance Test Engineer Lead

PartnerOne 51-250 Media

An enterprise performance engineering role at a cloud-focused organization, responsible for validating the scalability, stability, and production readiness of distributed systems across Azure and hybrid environments.

Egypt Full-time Lead QA Engineer Site Reliability Engineer (SRE)

Azure CI/CD Kubernetes PowerShell

8 hours, 8 minutes ago

Apply

8 hours, 8 minutes ago