Algolia

Algolia provides a hosted search platform that leverages AI to enhance user experience and developer engagement, enabling enterprises and developers to deliver fast, relevant search results across websites and mobile applications.

Internet Software & Services

Information Technology

251-1K (800)

Founded 2012

$334M raised

22 open positions

Links

View All Jobs

Senior Site Reliability Engineer, AI Research

2 months, 3 weeks ago

Australia

Full-time

Senior

Site Reliability Engineer (SRE)

DevOps and Infrastructure

Argo CD CI/CD Datadog GCP GitOps Go Kubernetes Python Terraform

Apply Now

Algolia

Internet Software & Services

251-1K

Founded 2012

$334M raised

View All Jobs 22

Description

Support and evolve the reliability of platforms used by the AI Research team.
Ensure production services meet expectations for availability, latency, and operational readiness.
Design infrastructure and operational patterns that balance iteration speed with production safeguards.
Work closely with researchers and engineers as an advisor on infrastructure, reliability, and operations.
Participate in team planning and execution from early exploration through production rollout.
Help researchers self-serve infrastructure safely and effectively.
Build and maintain Kubernetes-based services on Google Cloud Platform using infrastructure-as-code and GitOps.
Own and improve CI/CD pipelines for Go-based services and some Python-based services.
Design and operate observability systems, including tools such as Datadog.
Participate in a light on-call rotation and respond to incidents while improving systems over time.

Requirements

Strong experience operating cloud-first infrastructure.
Hands-on experience running production services on Kubernetes.
Proficiency with infrastructure-as-code, especially Terraform, and CI/CD systems.
Experience supporting production services written in Go; Python experience is a plus.
Solid grounding in service reliability, incident response, and operational best practices.
Comfort working in ambiguous environments where problems are not always well defined.
Experience supporting mission-critical internal platforms is preferred.
Exposure to research or experimentation-heavy environments is preferred.
Familiarity working alongside researchers or highly specialized domain experts is preferred.
AI, ML, or deep learning experience is not required.
Model training, tuning, or ML framework expertise such as PyTorch or JAX is not required.

Benefits

Remote-friendly work culture with flexibility to work remotely or in a hybrid model.
Australia-based role with occasional off-hours collaboration as needed.
High-impact work that directly enables new AI-powered capabilities for customers.
High agency to help shape what gets built and how it is built.
Opportunity to collaborate with experienced SREs, engineers, and PhD researchers.
Growth in research-adjacent infrastructure and platform reliability expertise.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Lead Site Reliability Engineer

Zeta Global 1K-5K Media

Lead Site Reliability Engineer at Zeta Global, responsible for improving reliability and operational resilience across cloud and on-prem systems supporting the Zeta Marketing Platform.

India Full-time Lead Site Reliability Engineer (SRE)

AWS Bash ELK Stack Go Grafana Honeycomb Kubernetes Linux OpenTelemetry Prometheus Pulumi Python Shell Scripting Terraform

3 hours, 12 minutes ago

Apply

3 hours, 12 minutes ago

Senior Infrastructure DevOps Engineer (100% Remote Germany)

Analytics Platform - Matomo 11-50 IT Services

Matomo is hiring a Senior Infrastructure DevOps Engineer to own and evolve the cloud platform behind its global SaaS analytics product, with responsibility for architecture, reliability, and operational excellence.

Germany Full-time Senior DevOps Engineer Site Reliability Engineer (SRE)

$76k-$86k

AWS CI/CD Kubernetes

4 hours, 12 minutes ago

Apply

4 hours, 12 minutes ago

Senior Site Reliability Engineer (SRE/DevOps)

qode Internet Software & Services

Senior DevOps Engineer at a company building secure, scalable cloud and AI platforms, focused on taking AI systems into production and improving reliability, observability, and operational excellence.

Vietnam Full-time Senior Site Reliability Engineer (SRE)

Argo CD AWS Azure CI/CD CloudFormation Flux GCP GitOps NestJS Node.js Pulumi Python Secrets Management Terraform

1 day, 4 hours ago

Apply

1 day, 4 hours ago

AI infrastructure Engineer (SRE) Bangalore

Together 1-10 IT Services

Together AI is hiring an AI Infrastructure Engineer (SRE) to keep its user-facing services and production systems reliable, scalable, and available as the company builds next-generation AI infrastructure.

India Senior AI Engineer Site Reliability Engineer (SRE)

Ansible Kubernetes Machine Learning PagerDuty Terraform

3 days, 3 hours ago

Apply

3 days, 3 hours ago

Algolia

Tags

Links

Senior Site Reliability Engineer, AI Research

Algolia

Description

Requirements

Benefits

Similar Roles

Lead Site Reliability Engineer

Senior Infrastructure DevOps Engineer (100% Remote Germany)

Senior Site Reliability Engineer (SRE/DevOps)

AI infrastructure Engineer (SRE) Bangalore

You're on a roll! Sign up now to keep applying.