66degrees

66degrees

66degrees: Google Cloud Premier Partner shaping the Future of Work with AI and data solutions.

IT Services
251-1K

Description

  • Ensure near-zero downtime through monitoring, alerting, self-healing automation, and continuous improvement.
  • Build highly automated, available, and scalable systems using software and infrastructure principles.
  • Advise clients on DevOps and SRE practices, including deployment pipelines, high availability, service reliability, technical debt, and operational toil.
  • Take a proactive approach to client workloads by anticipating failures, automating tasks, and ensuring availability.
  • Collaborate with clients, internal teams, and Google engineers to investigate and resolve infrastructure issues.
  • Write documentation, contribute to open-source efforts, and support operational improvements.
  • Design and deploy new cloud workloads for client environments.
  • Support and optimize live services running at scale.

Requirements

  • 3+ years of cloud and infrastructure experience, including Linux, Windows, Kubernetes, databases, and networking services.
  • 2+ years of Google Cloud experience; related certifications are strongly preferred but not required.
  • Proficiency with Python is required.
  • Strong provisioning and configuration experience with Terraform.
  • Experience with 24x7x365 monitoring, incident response, and on-call support.
  • Experience troubleshooting issues across systems, networks, and code.
  • Experience negotiating error budgets, SLIs, SLOs, and SLAs with product owners.
  • Ability to work independently and collaboratively across teams.
  • Experience working in Agile, Scrum, or Kanban methodologies within the SDLC.
  • Strong communication skills in a heavily customer-facing role.
  • Bachelor’s degree in computer science, electrical engineering, or equivalent is required.

Benefits

  • Remote candidates are welcome to apply.
  • Training and professional growth are supported.
  • Opportunity to work with cutting-edge Google Cloud technologies and varied client environments.
  • Chance to contribute at a rapidly growing Google Premier Partner.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Staff Operations Engineer

Mozilla 251-1K Internet Software & Services

Mozilla is hiring a Staff Operations Engineer to lead the design, reliability, and evolution of hybrid-cloud and workplace infrastructure across teams.

Ansible DNS Linux Puppet Python TCP/IP Unix
3 hours, 38 minutes ago

Principal Site Reliability Engineer (SRE)

Symmetrio Professional Services

Symmetrio is recruiting a Principal Site Reliability Engineer for a rapidly growing healthcare technology company to own the reliability, scalability, security, and performance of a mission-critical SaaS platform used by healthcare providers across the United States.

Active Directory AWS CI/CD Datadog Django Grafana Kubernetes Python Terraform Windows Server
3 hours, 53 minutes ago

Performance Test Engineer Lead

PartnerOne 51-250 Media

An enterprise performance engineering role at a cloud-focused organization, responsible for validating the scalability, stability, and production readiness of distributed systems across Azure and hybrid environments.

Azure CI/CD Kubernetes PowerShell
4 hours, 8 minutes ago

Site Reliability Engineer

MLabs 11-50 Internet Software & Services

Remote UK-hours Site Reliability Engineering role at a financial technology company, focused on automating and operating the infrastructure that supports global integration services for financial institutions.

Active Directory Ansible AWS CI/CD GCP OAuth PostgreSQL SAML
4 hours, 23 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers