Senior Site Reliability Engineer

1 week, 4 days ago
Full-time
Senior
DevOps and Infrastructure
Oxylabs

Oxylabs

Oxylabs is a leading provider of premium proxy services with a vast network of residential and datacenter IP proxies. They offer industry-leading web scraping solutions and a trusted ethical proxy network for global clients.

IT Services
251-1K
Founded 2015

Description

  • Own and evolve Webshare's production infrastructure, including the migration from Docker Swarm to Kubernetes or a hybrid Kubernetes + Ansible setup.
  • Maintain high availability across hundreds of servers and approximately 50 services.
  • Drive observability in cooperation with the development team.
  • Establish and enforce Infrastructure as Code practices, CI/CD pipeline reliability, and change management processes.
  • Participate in the on-call rotation alongside backend developers.
  • Respond to incidents, lead resolution efforts, run post-mortems, and drive systematic remediation.
  • Build platform tooling that improves developer experience and reduces infrastructure toil.
  • Keep backend engineers informed and capable through shared infrastructure ownership.

Requirements

  • Experience building and operating highly available infrastructure at comparable scale, including hundreds of servers and dozens of services in production.
  • Hands-on experience with Kubernetes in self-hosted or bare-metal environments.
  • Strong Infrastructure as Code experience.
  • Experience owning CI/CD pipelines end-to-end, such as GitLab CI or an equivalent system.
  • Experience being on call in a production environment.
  • Proactive communication and problem-solving mindset.
  • Scripting and development skills.
  • Experience leading at least one major infrastructure migration, from planning through stabilization, is preferred.
  • Familiarity with Python and/or Go is preferred.
  • Exposure to proxy or networking-heavy infrastructure is preferred.
  • Experience in a small team where developers shared infrastructure responsibility is preferred.
  • Familiarity with edge clusters or split compute/edge architectures is preferred.

Benefits

  • Gross salary of 26,000 PLN to 34,000 PLN per month, with flexibility to discuss a different salary based on skills and experience.
  • 40+ internal learning options, external conferences, mentorship, and year-round knowledge sharing.
  • Private health insurance, a gym allowance, and a wellness app.
  • Team events, an overseas workation, and opportunities to celebrate milestones together.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Observability Architect

Geotab 1K-5K Road & Rail

Geotab is hiring an SRE Observability Architect to define and lead the observability architecture for its cloud platforms, with the goal of delivering scalable, cost-efficient, and highly reliable insight across distributed systems.

Elasticsearch GCP Go Grafana Helm Jaeger Kubernetes OpenTelemetry Prometheus Python Terraform
3 hours, 49 minutes ago

Senior Site Reliability Engineer (SRE)

Sleek 251-1K Professional Services

Sleek is hiring a Senior SRE Engineer to architect and scale its cloud and AI-ready infrastructure across a multi-country, fast-growing platform serving micro SMEs.

API Gateway Argo CD AWS Azure CI/CD Cloudflare CloudFormation Flux GCP GitOps Kong Kubernetes Microservices NestJS Node.js OpenSearch OpenTelemetry Prometheus Pulumi Python Secrets Management Serverless Terraform Traefik WAF
3 hours, 49 minutes ago

[Job 30278] SRE (DevOps)

CI&T 5K-10K Internet Software & Services

CI&T is hiring a senior SRE/DevOps to evolve the infrastructure behind critical digital products, with a focus on resilient multi-region AWS architecture and mobile delivery pipelines.

Android Ansible API Gateway AWS Bash CI/CD DynamoDB GitHub Actions GitLab CI Grafana iOS Jenkins Kubernetes Prometheus Python Secrets Management Terraform
4 hours, 4 minutes ago

Senior Manager, Engineering

Sumo Logic 251-1K Internet Software & Services

Sumo Logic is hiring a Senior Manager, Engineering for Application Security to lead global programs that improve product security, reliability, and operational efficiency across its cloud platform.

Agile AWS C++ Docker GCP Java Kafka Kubernetes OWASP Ruby Scala SIEM
1 day, 4 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers