Overstory

Overstory

Overstory uses AI and satellite imagery to prevent wildfires and power outages by analyzing vegetation for electric utilities.

Utilities
11-50
Founded 2018
$25M raised

Description

  • Own the platform strategy across Platform, MLOps, and SRE, aligning it with company and product goals.
  • Lead and grow senior individual contributors across multiple teams, and eventually manage managers, while building strong technical leadership and healthy team cultures.
  • Define and evolve the platform vision for developer experience, internal tooling, CI/CD, infrastructure, observability, and reliability standards.
  • Oversee MLOps systems for model development, training, deployment, monitoring, and production governance.
  • Partner cross-functionally with Product, ML, Data, Security, and Compliance to meet current and future platform needs.
  • Make thoughtful tradeoffs between speed, stability, innovation, cost, reliability, and operational excellence.
  • Set and track metrics and accountability for platform performance, reliability, and developer productivity.
  • Help transition the company from early-stage infrastructure to a more mature, scalable, and repeatable platform.

Requirements

  • 10+ years of experience leading platform, infrastructure, or reliability teams in a scaling startup environment.
  • Experience navigating ambiguity, rapid growth, and the shift from early-stage systems to mature platforms.
  • Strong understanding of cloud-native infrastructure, with GCP strongly preferred.
  • Experience with Kubernetes, CI/CD, and modern DevOps practices.
  • Experience supporting machine learning systems in production, including deployment, monitoring, and lifecycle management.
  • A track record of building reliable, scalable systems used by fast-moving product teams.
  • Excellent people leadership skills, including coaching managers, growing talent, and building inclusive, high-performing teams.
  • Strong communication skills and the ability to influence across organizational boundaries.
  • Experience in data- or ML-heavy products.
  • Experience with geospatial data, image processing, or mapping technology is a nice to have.
  • Ability to work in one of the supported time zones/countries, including Europe or Eastern North America, and in one of the listed countries if remote.

Benefits

  • Competitive salary with equity.
  • Flexible working environment with a lot of autonomy.
  • Remote working budget.
  • Educational budget and time to develop new skills.
  • Opportunity to do mission-driven work focused on reducing wildfires and supporting climate resilience.
  • A supportive, vibrant team culture built on openness, tolerance, and respect.
  • Occasional in-person collaboration and an annual team gathering event.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

AI Learning Platform & Systems Lead (Fixed-Term Contract - Remote)

Pickle Professional Services

FYXER is hiring a freelance AI Learning Platform & Systems Lead to support the delivery, scalability, reliability, and optimisation of AI-enabled learning environments and platform ecosystems.

57 minutes ago

Staff Software Platform Responsible Engineer

Relativity Space 251-1K Aerospace & Defense

Relativity Space is hiring a Linux BSP and embedded software engineer for its Interplanetary Sciences Program to build and harden the operating system foundation for payload computers on a space mission.

C CI/CD Git Linux
2 hours, 32 minutes ago

Director, Prediction and ML Planning

Motional 1K-5K Automotive

Motional is hiring a Director of Behaviors to lead its machine learning-based Prediction and Planning teams for autonomous vehicles, driving the development of a unified behavior stack that supports joint prediction and planning.

LLM Machine Learning Reinforcement Learning
2 hours, 43 minutes ago

AI/ML Engineer

66degrees 251-1K IT Services

66degrees is hiring a Data Scientist/AI-ML Engineer to analyze complex client data and deliver AI-driven solutions that improve business decisions and outcomes.

Deep Learning Docker Feature Engineering GCP Generative AI Git Keras Kubernetes LLM Looker Machine Learning MLOps Python PyTorch Reinforcement Learning Scikit-learn Shell Scripting SQL Statistics TensorFlow Vertex AI
3 hours, 12 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers