Senior DevOps Engineer (Cloud & ML Infrastructure)

2 weeks, 6 days ago
Full-time
Senior
DevOps and Infrastructure
Kpler

Kpler

Kpler provides a comprehensive platform for global trade intelligence, offering real-time data and analytics that empower businesses to plan, grow, and operate sustainably across various commodities and markets.

Professional Services
251-1K
Founded 2014

Description

  • Design, operate, and improve cloud-native infrastructure across Kubernetes, networking, compute, and storage.
  • Contribute to Infrastructure as Code, CI/CD pipelines, and platform automation.
  • Ensure high availability, reliability, and security of production systems.
  • Improve observability, monitoring, alerting, and incident response processes.
  • Reduce MTTR and failure rates through structured reliability improvements.
  • Optimize infrastructure cost and performance for compute-intensive workloads.
  • Support and help standardize ML and GPU-based workloads within the platform model.
  • Collaborate with ML engineers, data engineers, and backend teams on production-grade deployments.
  • Contribute to architectural decisions that shape the platform’s evolution.

Requirements

  • 5+ years of experience in cloud/platform engineering in production environments.
  • Strong hands-on experience with Kubernetes in production.
  • Experience with Infrastructure as Code, with Terraform preferred.
  • Strong knowledge of AWS or an equivalent cloud provider.
  • Experience operating distributed systems in 24/7 environments.
  • Strong operational mindset with experience in SLOs, monitoring, and incident management.
  • Proven experience running ML/AI workloads in production is desirable.
  • Experience with GPU-based workloads is desirable.
  • Exposure to LLM-based or other compute-intensive systems is desirable.
  • Solid programming skills in Python or Go are preferred.
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or equivalent practical experience.

Benefits

  • Full-time remote work.
  • Opportunity to work on cloud, data, and ML infrastructure at a global company.
  • Inclusive and diverse work environment.
  • Supportive, collaborative team culture.
  • Equal opportunity employer with a commitment to diverse backgrounds and perspectives.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Lead DevOps Engineer - Developer Productivity

HighLevel 251-1K Internet Software & Services

HighLevel is hiring a Lead DevOps Engineer for its Developer Productivity platform team in India to improve CI/CD reliability, developer workflow efficiency, and key delivery metrics across a large-scale remote-first system.

Bash CI/CD Docker GitHub Actions Groovy Jenkins Kubernetes Node.js Python SonarQube
2 hours, 22 minutes ago

Senior DevOps Engineer, APJ

Arize AI 51-250 IT Services

Arize AI is hiring an On-Prem engineer to support deployment and operations of its AI observability platform for customer environments across SaaS and on-prem offerings, with a focus on APJ accounts based in Malaysia.

AWS Azure GCP Kubernetes
7 hours, 21 minutes ago

DevOps Engineer

Xolo 51-250 Diversified Financial Services

Xolo is hiring a remote DevOps Engineer to build and maintain infrastructure, automation, and deployment systems that support scalable product delivery and reliable operations across development through production.

Ansible AWS GitHub Actions Java Jenkins Kubernetes Maven Prometheus Terraform
11 hours, 48 minutes ago

Backend Ops Engineer Role

Weekday 11-50 Construction & Engineering

Weekday’s client is hiring a remote DevOps / Site Reliability Engineer in India to own cloud infrastructure and platform operations for a fast-scaling, AI-first environment.

AWS Azure CI/CD Docker GCP GitHub Actions Grafana LLM OpenTelemetry Prometheus Terraform
13 hours, 2 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers