Infrastructure Team Lead

3 weeks ago
Full-time
Lead
DevOps and Infrastructure
Leadfeeder

Leadfeeder

Leadfeeder is a B2B lead generation tool that reveals website visitors' companies, activities, and engagement, integrating with Google Analytics for effective lead tracking and conversion.

Professional Services
51-250
$5M raised

Description

  • Lead and develop a high-performing team of Site Reliability Engineers supporting hybrid cloud infrastructure in AWS with an on-premise extension in Hetzner.
  • Design, document, and implement reliable and secure infrastructure solutions aligned with industry best practices.
  • Oversee technical analysis, cost estimation, optimization, platform and system design, compliance, resource planning, and delivery milestones.
  • Work hands-on with the team to maintain a deep understanding of the infrastructure and lead incident response during critical issues.
  • Define team goals and strategy while building strong relationships with internal stakeholders across the organisation.
  • Manage the on-call rotation and escalation processes across infrastructure and software engineering teams.
  • Champion engineering best practices and drive continuous improvement in production environment quality and reliability.
  • Support product and engineering teams in spinning up, maintaining, and monitoring the infrastructure needed for their applications and services.

Requirements

  • 10+ years of hands-on experience in infrastructure and related services.
  • Strong technical background in hybrid cloud infrastructure, including AWS, Terraform, Kubernetes, and monitoring/observability tooling.
  • Proven leadership or management experience with a small infrastructure or SRE team.
  • Strong interpersonal, people management, and communication skills in English, both written and verbal.
  • Experience setting strategic vision and owning complex technical initiatives from conception to delivery.
  • Experience using AI to improve efficiencies.
  • A disciplined approach to maintaining and enforcing engineering best practices.
  • Ability to collaborate effectively with cross-functional teams and business units.
  • Detail-oriented and self-organised with the ability to manage multiple projects and priorities.
  • Comfortable working fully remote and physically located within Europe.

Benefits

  • Work with a knowledgeable, high-achieving, and fun team.
  • Join an international, diverse, dynamic, and committed work environment.
  • Remote work with a flexible work schedule.
  • Mental health support with Auntie.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Site Reliability Engineer (SRE)

The Investigo Group Professional Services

The Investigo Group is hiring a Senior Site Reliability Engineer to operate and mature its production Kubernetes and OpenShift platforms across secure on-premises and hybrid environments.

Ansible Argo CD CI/CD Flux GitHub Actions GitOps Go Grafana Helm Juniper Kubernetes Linux Load Balancing Machine Learning OpenID Connect OpenShift OpenTelemetry Palo Alto Prometheus Python SAML Shell Scripting Terraform
6 hours, 16 minutes ago

Senior DevOps Engineer - Cloud Operations

Black Duck Inn 1K-5K Internet Software & Services

Black Duck Software is hiring a Sr. DevOps Engineer, Cloud Operations to own and operate global customer-facing SaaS and hosted infrastructure on Google Cloud Platform for enterprise applications.

Argo CD Bash CI/CD DevSecOps DNS GCP GitHub Actions GitOps Go HashiCorp Vault Helm Java Kubernetes Load Balancing Microservices Python Terraform TLS
7 hours, 42 minutes ago

Site Reliability Engineer (Hosted Infra) - Platform

Elastic 1K-5K Internet Software & Services

Elastic is hiring a Cloud Infrastructure SRE to help build and operate large-scale multi-cloud infrastructure that powers Elastic Cloud across globally distributed regions.

Ansible Argo CD Docker Go Kubernetes Linux Prometheus Puppet Terraform Ubuntu
9 hours, 54 minutes ago

Senior AIOps Engineer, Incident Response [Remote-US]

Quanata 201-500 information technology & services

Quanata is hiring an experienced production operations and reliability leader to oversee production health, incident response, and operational support for its AI-driven insurance technology platform.

AWS Confluence JIRA
17 hours, 18 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers