Oowlish

Oowlish

Top Nearshore Software Developers And Tech Squads | Oowlish Oowlish provides companies of all sizes access to the best technical talent in Brazil, making innovation more accessible and convenient than ever. Because our mission is to give every company,...

Internet Software & Services
51-250
Founded 2017

Description

  • Define, implement, and continuously improve SLIs, SLOs, and error budgets.
  • Develop and maintain observability strategies across monitoring, logging, tracing, and alerting.
  • Own observability configuration, instrumentation, and alert optimization.
  • Lead incident command during production outages and coordinate cross-functional response efforts.
  • Drive blameless postmortems and ensure corrective actions are completed.
  • Own and improve the on-call program, including rotations, escalation policies, runbooks, and alert tuning.
  • Establish production readiness standards for new services.
  • Partner with engineering teams on capacity planning, scalability, and disaster recovery initiatives.
  • Automate operational processes and reliability improvements using software engineering best practices.
  • Continuously improve system reliability, availability, and operational efficiency.

Requirements

  • 5+ years of experience in Site Reliability Engineering, Production Engineering, Reliability Engineering, or similar roles.
  • Proven experience operating production systems in high-availability environments.
  • Hands-on experience defining and managing SLIs, SLOs, and error budgets.
  • Experience leading production incident response and Incident Command.
  • Strong observability and monitoring experience.
  • Strong software engineering skills using Python, Go, or TypeScript.
  • Experience working with cloud platforms.
  • Strong written and verbal English communication skills.
  • Experience with Datadog, AWS, or Heroku is preferred.
  • Experience working in regulated industries such as Healthcare, HIPAA, or Financial Services is preferred.
  • Experience establishing or maturing an SRE practice is preferred.
  • Experience with Kubernetes is preferred.
  • Experience with PostgreSQL or SQL Server is preferred.

Benefits

  • Remote work / home office.
  • Competitive compensation based on experience.
  • Career plans with extensive growth opportunities.
  • International projects with clients in the United States and Europe.
  • Oowlish English Program for technical and conversational English.
  • Oowlish Fitness with Total Pass.
  • Games and competitions.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Manager, Engineering

Sumo Logic 251-1K Internet Software & Services

Sumo Logic is hiring a Senior Manager, Engineering for Application Security to lead global programs that improve product security, reliability, and operational efficiency across its cloud platform.

Agile AWS C++ Docker GCP Java Kafka Kubernetes OWASP Ruby Scala SIEM
12 hours, 58 minutes ago

Staff Software Engineer - Databases SRE | Sweden | Remote

Grafana 1K-5K IT Services

Grafana Labs is hiring a Staff Software Engineer, SRE to improve the reliability and scalability of Grafana Cloud’s database products for high-value customers across AWS, GCP, and Azure.

AWS Azure GCP Go Helm Java Kubernetes Linux Microservices Python Terraform
1 day, 12 hours ago

Staff Software Engineer - Databases SRE | Spain | Remote

Grafana 1K-5K IT Services

Grafana Labs is hiring a Staff Software Engineer - SRE to strengthen the reliability of its cloud database products for high-SLA customers across AWS, GCP, and Azure.

AWS Azure GCP Go Helm Java Kubernetes Linux Python Terraform
1 day, 12 hours ago

Site Reliability Engineer IV

OpenX 51-250 Media

OpenX is hiring a Senior Cloud SRE in Poland to ensure the performance, uptime, and growth of large-scale Google Cloud Platform systems serving globally distributed teams.

AWS Docker GCP Go Java Kubernetes Load Balancing Prometheus Python Shell Scripting Terraform
1 day, 12 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers