Block

Block

Block is a company that consists of Square, Cash App, Spiral, TIDAL, TBD, and foundational teams. They are focused on economic empowerment by creating tools to expand access to the economy. Square helps sellers run and grow businesses, Cash App redefin...

Capital Markets
10K-50K
Founded 2009

Description

  • Build and extend platforms to improve system reliability.
  • Work toward company-wide reliability goals across multiple platforms and organizations.
  • Standardize reliability tools across teams and systems.
  • Triage, coordinate, and lead stabilization efforts for sev 0–1 incidents.
  • Serve as primary on-call for Tier 0 services and maintain structured escalation paths.
  • Lead incident command, mitigation, and escalation during high-severity events.
  • Drive platform-wide reliability improvements, shared operational tooling, and deploy-safety patterns.
  • Use AI-driven systems to improve signal detection, reduce noise, and accelerate root cause analysis.
  • Design and implement safe deployment patterns such as progressive delivery, automated rollback, and guardrails.
  • Improve observability, incident detection and response, and operational workflows through automation.

Requirements

  • 5+ years of software development experience.
  • Experience running production on-call for high-availability systems.
  • Strong incident management skills, including structured triage, mitigation under pressure, and blameless postmortems.
  • Fluency with CI/CD pipelines, progressive rollout strategies, and rollback automation.
  • Monitoring and observability expertise, including tuning alerts for uptime, error rates, latency regression, and resource exhaustion.
  • Familiarity with AI-driven tooling for observability, incident analysis, or automation.
  • A mindset that naturally uses AI to accelerate problem-solving and reduce toil.
  • Demonstrated technical initiative and leadership on previous backend or platform-focused projects.
  • Ability to create and maintain evidence-based maturity assessments using trailing 90-day data windows.
  • Comfort with vendor and dependency management, including maintaining validated escalation contacts reachable within 5 minutes.

Benefits

  • Remote work.
  • Medical insurance.
  • Flexible time off.
  • Retirement savings plans.
  • Modern family planning benefits.
  • A globally distributed work environment with collaboration across multiple time zones.
  • Reasonable accommodations during the recruitment process for disabled applicants.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

SRE - DevOps Engineer - Argentina

Coderio 51-250 Internet Software & Services

Coderio is hiring a remote DevOps/SRE Engineer in Argentina to ensure the stability, scalability, and efficient operation of the infrastructure that supports its global digital solutions.

Argo CD CI/CD Flux GitHub Actions GitOps Helm Jenkins Kubernetes OpenShift Terraform
5 hours, 46 minutes ago

Sr. DevOps Engineer II (6620)

MetroStar 251-1K IT Services

MetroStar is hiring a Senior DevOps Engineer II to support de-coupled Drupal systems, improve deployment and reliability, and help keep digital platforms scalable and available.

AWS DevSecOps Drupal Gatsby Next.js
9 hours, 41 minutes ago

Site Reliability Engineer

Recorded Future 251-1K Professional Services

Recorded Future is hiring a Site Reliability Engineer to strengthen the reliability, scalability, and performance of its critical cloud systems in close partnership with engineering teams.

AWS Chef Elasticsearch ELK Stack Grafana Kafka Kibana Kubernetes Linux Logstash Microservices MongoDB OpenTelemetry Prometheus RabbitMQ Terraform
15 hours, 1 minute ago

Site Reliability Engineer

Kaseya 1K-5K IT Services

Kaseya is hiring a Site Reliability Engineer to own the reliability, automation, and production stability of its AWS-based services used by thousands of MSPs worldwide.

Ansible AWS Chef CloudFormation Datadog DevSecOps Elasticsearch Kibana Kubernetes MySQL PostgreSQL Puppet Secrets Management Serverless Terraform
16 hours, 24 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers