66degrees

66degrees

66degrees: Google Cloud Premier Partner shaping the Future of Work with AI and data solutions.

IT Services
251-1K

Description

  • Ensure near-zero downtime through monitoring, alerting, self-healing automation, and continuous improvement.
  • Build highly automated, available, and scalable systems using software and infrastructure principles.
  • Advise clients on DevOps and SRE practices, including deployment pipelines, high availability, service reliability, technical debt, and operational toil.
  • Take a proactive approach to client workloads by anticipating failures, automating tasks, and ensuring availability.
  • Collaborate with clients, internal teams, and Google engineers to investigate and resolve infrastructure issues.
  • Write documentation, contribute to open-source efforts, and support operational improvements.
  • Design and deploy new cloud workloads for client environments.
  • Support and optimize live services running at scale.

Requirements

  • 3+ years of cloud and infrastructure experience, including Linux, Windows, Kubernetes, databases, and networking services.
  • 2+ years of Google Cloud experience; related certifications are strongly preferred but not required.
  • Proficiency with Python is required.
  • Strong provisioning and configuration experience with Terraform.
  • Experience with 24x7x365 monitoring, incident response, and on-call support.
  • Experience troubleshooting issues across systems, networks, and code.
  • Experience negotiating error budgets, SLIs, SLOs, and SLAs with product owners.
  • Ability to work independently and collaboratively across teams.
  • Experience working in Agile, Scrum, or Kanban methodologies within the SDLC.
  • Strong communication skills in a heavily customer-facing role.
  • Bachelor’s degree in computer science, electrical engineering, or equivalent is required.

Benefits

  • Remote candidates are welcome to apply.
  • Training and professional growth are supported.
  • Opportunity to work with cutting-edge Google Cloud technologies and varied client environments.
  • Chance to contribute at a rapidly growing Google Premier Partner.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Site Reliability Engineer

Alpaca 51-250 Capital Markets

Alpaca is hiring a Site Reliability Engineer to keep its brokerage platform reliable and operable across cloud, Kubernetes, observability, messaging, and database systems, with a strong focus on PostgreSQL reliability on the trading-critical path.

DNS GitOps Go Kafka Kubernetes Linux Load Balancing PostgreSQL Python RabbitMQ Secrets Management TLS
15 minutes ago

Site Reliability Engineer

Kaseya 1K-5K IT Services

Kaseya is hiring a Site Reliability Engineer to own the reliability, automation, and production stability of its AWS-based services used by thousands of MSPs worldwide.

Ansible AWS Chef CloudFormation Datadog DevSecOps Elasticsearch Kibana Kubernetes MySQL PostgreSQL Puppet Secrets Management Serverless Terraform
4 hours, 15 minutes ago

SRE - DevOps Engineer - Argentina

Coderio 51-250 Internet Software & Services

Coderio is hiring a remote DevOps/SRE Engineer in Argentina to ensure the stability, scalability, and efficient operation of the infrastructure that supports its global digital solutions.

Argo CD CI/CD Flux GitHub Actions GitOps Helm Jenkins Kubernetes OpenShift Terraform
7 hours, 54 minutes ago

Senior Site Reliability Engineer

Cribl 251-1K IT Services

Cribl is hiring a Senior Site Reliability Engineer in Poland to help build and operate the telemetry infrastructure and observability platform that supports its cloud products and enterprise customers.

Ansible AWS Azure CI/CD Grafana JavaScript Kibana Linux New Relic Node.js PagerDuty Prometheus Splunk Terraform TypeScript
15 hours, 27 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers