66degrees

66degrees

66degrees: Google Cloud Premier Partner shaping the Future of Work with AI and data solutions.

IT Services
251-1K

Description

  • Ensure near-zero downtime through monitoring, alerting, self-healing automation, and continuous improvement.
  • Build highly automated, available, and scalable systems using software and infrastructure principles.
  • Advise clients on DevOps and SRE practices, including deployment pipelines, high availability, service reliability, technical debt, and operational toil.
  • Take a proactive approach to client workloads by anticipating failures, automating tasks, and ensuring availability.
  • Collaborate with clients, internal teams, and Google engineers to investigate and resolve infrastructure issues.
  • Write documentation, contribute to open-source efforts, and support operational improvements.
  • Design and deploy new cloud workloads for client environments.
  • Support and optimize live services running at scale.

Requirements

  • 3+ years of cloud and infrastructure experience, including Linux, Windows, Kubernetes, databases, and networking services.
  • 2+ years of Google Cloud experience; related certifications are strongly preferred but not required.
  • Proficiency with Python is required.
  • Strong provisioning and configuration experience with Terraform.
  • Experience with 24x7x365 monitoring, incident response, and on-call support.
  • Experience troubleshooting issues across systems, networks, and code.
  • Experience negotiating error budgets, SLIs, SLOs, and SLAs with product owners.
  • Ability to work independently and collaboratively across teams.
  • Experience working in Agile, Scrum, or Kanban methodologies within the SDLC.
  • Strong communication skills in a heavily customer-facing role.
  • Bachelor’s degree in computer science, electrical engineering, or equivalent is required.

Benefits

  • Remote candidates are welcome to apply.
  • Training and professional growth are supported.
  • Opportunity to work with cutting-edge Google Cloud technologies and varied client environments.
  • Chance to contribute at a rapidly growing Google Premier Partner.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Manager, Software Engineering (Resilience Engineering)

Affirm 1K-5K Diversified Financial Services

Affirm is hiring an Engineering Manager to lead its Resilience Engineering team in building production load testing and chaos engineering capabilities that improve the safety and reliability of its production systems.

AWS Java Kotlin Kubernetes Python
2 hours, 42 minutes ago

Senior Site Reliability Engineer

Civica 1K-5K Internet Software & Services

Civica is hiring a Senior Site Reliability Engineer to own the reliability, performance, security, and automation of the cloud platform supporting its public-sector SaaS products.

Ansible AWS Azure CI/CD CloudFormation Datadog ELK Stack GCP GitHub Actions Go Grafana Jaeger Java Kubernetes .NET OpenSearch OpenShift Packer Prometheus Python Terraform
14 hours, 27 minutes ago

Manager, Software Engineering (Resilience Engineering)

Affirm 1K-5K Diversified Financial Services

Affirm is seeking an Engineering Manager to lead its Resilience Engineering team, building production load testing and chaos engineering capabilities that improve the safety and reliability of production systems.

AWS Java Kotlin Kubernetes Microservices Python
14 hours, 27 minutes ago

Site Reliability Engineer

Sitetracker 251-1K Diversified Telecommunication Services

Site Reliability Engineer at a Canada-based technology company, responsible for building and scaling a proactive reliability practice for AI-driven platform workloads in a remote environment.

AWS Bash CloudFormation EC2 GitHub Actions Load Balancing Terraform
14 hours, 27 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers