MLabs

MLabs is a Haskell, Rust, Blockchain, and AI consultancy specializing in mission-critical software development, cross-team collaboration, and cutting-edge value delivery for fintech, blockchain, and information technology sectors.

Internet Software & Services

Information Technology

11-50 (30)

Founded 2018

27 open positions

Links

View All Jobs

Senior Site Reliability Engineer (Azure)

1 month ago

France, Spain, United States, Germany, Netherlands

Full-time

Senior

Site Reliability Engineer (SRE)

DevOps and Infrastructure

Azure CI/CD Go Grafana Kubernetes Prometheus Python Terraform

Apply Now

MLabs

Internet Software & Services

11-50

Founded 2018

View All Jobs 27

Description

Architect and deploy secure, scalable Azure infrastructure for production-grade distributed systems.
Develop and maintain Terraform-based infrastructure as code for repeatable multi-environment deployments.
Translate ambiguous product and customer requirements into technical architecture and execution plans.
Build and optimize platform services, APIs, and integrations to extend core system capabilities.
Partner with engineering, security, and product teams to deliver enterprise-ready infrastructure solutions.
Drive improvements in reliability, observability, and incident response.
Provide Tier 2 infrastructure support for customer deployments.
Establish operational excellence for a greenfield Azure environment.
Help achieve feature parity between Azure and the organization’s other cloud environments.

Requirements

Extensive experience designing and building production-grade systems on Azure.
Ability to transform high-level requirements into scalable, delivered systems.
Strong technical communication skills with both engineering and non-technical stakeholders.
High-ownership mindset with a strong bias for action and accountability.
Deep knowledge of Azure networking, compute, identity, security, and storage.
Advanced proficiency with Terraform at production scale.
Professional experience in Go and/or Python.
Background in distributed systems, high-availability architectures, or platform engineering.
Experience with automation tooling across the full infrastructure lifecycle and CI/CD.
Hands-on experience with Kubernetes and container orchestration (preferred).
Familiarity with observability tools such as Prometheus and Grafana (preferred).
Experience with workflow/orchestration platforms like Argo or Spacelift (preferred).

Benefits

Compensation of $150K–$200K.
Equity and tokens tied to long-term project growth.
Annual performance bonuses based on individual and company milestones.
Comprehensive health insurance for US-based employees.
401(k) plan for US-based employees.
Remote, full-time role with US coverage and Europe considered if working hours overlap with EST.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Site Reliability Engineer

Alpaca 51-250 Capital Markets

Alpaca is hiring a Site Reliability Engineer to keep its brokerage platform reliable and operable across cloud, Kubernetes, observability, messaging, and database systems, with a strong focus on PostgreSQL reliability on the trading-critical path.

Europe Full-time Mid Level Site Reliability Engineer (SRE)

DNS GitOps Go Kafka Kubernetes Linux Load Balancing PostgreSQL Python RabbitMQ Secrets Management TLS

47 minutes ago

Apply

47 minutes ago

Site Reliability Engineer

Kaseya 1K-5K IT Services

Kaseya is hiring a Site Reliability Engineer to own the reliability, automation, and production stability of its AWS-based services used by thousands of MSPs worldwide.

Canada Full-time Mid Level Site Reliability Engineer (SRE)

$85k-$96k

Ansible AWS Chef CloudFormation Datadog DevSecOps Elasticsearch Kibana Kubernetes MySQL PostgreSQL Puppet Secrets Management Serverless Terraform

4 hours, 47 minutes ago

Apply

4 hours, 47 minutes ago

SRE - DevOps Engineer - Argentina

Coderio 51-250 Internet Software & Services

Coderio is hiring a remote DevOps/SRE Engineer in Argentina to ensure the stability, scalability, and efficient operation of the infrastructure that supports its global digital solutions.

Argentina Full-time Mid Level Site Reliability Engineer (SRE)

Argo CD CI/CD Flux GitHub Actions GitOps Helm Jenkins Kubernetes OpenShift Terraform

8 hours, 27 minutes ago

Apply

8 hours, 27 minutes ago