Principal Engineer, Core Infrastructure

1 hour, 10 minutes ago
Full-time
Lead
DevOps and Infrastructure
Klaviyo

Klaviyo

Klaviyo offers intelligent email marketing, SMS, and automation services for ecommerce businesses, empowering brands to personalize customer interactions and drive growth.

IT Services
1K-5K
Founded 2012

Description

  • Architect and evolve the Kubernetes platform, service mesh, networking, storage, and CI/CD pipelines.
  • Ship golden paths and infrastructure-as-code modules for platform services.
  • Define platform service-level objectives and use error budgets to balance reliability and delivery speed.
  • Drive incident learning, readiness reviews, and operational excellence across platform systems.
  • Improve developer velocity by reducing build and deploy times, flaky tests, and local development friction.
  • Lead capacity planning and manage commitments for platform infrastructure.
  • Build guardrails for cost, security, and compliance in partnership with Security and FinOps teams.
  • Write high-impact code, automation, and tooling to support the platform.
  • Mentor teams and raise the bar for engineering and incident response practices.
  • Embed AI into the developer experience for code generation, observability, and incident response.

Requirements

  • 10+ years of experience building and operating cloud platforms, including compute, networking, storage, and runtimes such as Kubernetes.
  • Track record of operating multi-region highly available systems with strong SLO discipline.
  • Deep expertise in Kubernetes, service mesh, Terraform/infrastructure as code, CI/CD, and production observability.
  • Experience with databases and storage systems, including SQL and NoSQL databases and object, block, or file storage platforms.
  • Experience bringing AI into platform engineering, including copilot-assisted workflows, intelligent test generation, or AIOps.
  • Ability to lead through design reviews, incident excellence, and SLO/error-budget tradeoffs in business terms.
  • Hands-on fluency with AI tools and responsible AI adoption.
  • Experience with enterprise governance, compliance, and audit requirements (preferred).
  • Familiarity with GDPR and data privacy in large-scale production environments (preferred).
  • Willingness to travel up to 10% for onboarding, team meetings, client or partner work, and industry events.

Benefits

  • Base salary range of $244,000 to $366,000 USD.
  • Eligibility for the company’s annual cash bonus plan.
  • Equity as part of the total compensation package.
  • Sign-on payments may be included.
  • Comprehensive health, welfare, and wellbeing benefits based on eligibility.
  • Support for remote or flexible work is implied by location-based US salary coverage and coordinated travel requirements.
  • Klaviyo provides accommodations as needed during the hiring process.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior DevOps Engineer, APJ

Arize AI 51-250 IT Services

Arize AI is hiring an On-Prem engineer to support deployment and operations of its AI observability platform for customer environments across SaaS and on-prem offerings, with a focus on APJ accounts based in Malaysia.

AWS Azure GCP Kubernetes
25 minutes ago

Technical Infrastructure Engineer

Game Plan Tech Internet Software & Services

Game Plan Tech is hiring a Technical Infrastructure Engineer to design and improve the secure cloud, identity, endpoint, and operational infrastructure that supports internal operations and client-facing work.

GCP LLM OAuth SIEM
1 hour, 25 minutes ago

Windows DevOps Engineer

JustMarkets 1-10 Capital Markets

Windows DevOps Engineer at a fintech company, responsible for maintaining and improving the trading infrastructure and supporting reliable product operations.

Ansible AWS Azure C# Docker Kubernetes MySQL PostgreSQL PowerShell Python Terraform
1 hour, 25 minutes ago

VP, Trading Systems Developer

Galaxy 251-1K Capital Markets

Galaxy is seeking a Trading Systems Engineer to design and build low-latency trading infrastructure that supports market data ingestion, order routing, and execution across multiple trading businesses.

AWS C++ Docker Generative AI Java Kubernetes Linux TCP/IP
1 hour, 55 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers