AHEAD

AHEAD

AHEAD accelerates the impact of technology on clients by engineering customized data, developer, and infrastructure platforms that improve IT operations. By weaving together cloud infrastructure, intelligent operations, and modern applications, we help...

IT Services
1K-5K
$43M raised

Description

  • Own deployment, runtime management, and operational governance across all layers of the Agentic Platform.
  • Design and implement infrastructure-as-code using Terraform or AWS CDK.
  • Build and maintain CI/CD pipelines using AWS CodePipeline, GitHub Actions, or GitLab CI.
  • Configure observability and monitoring for LLMs using CloudWatch and OpenTelemetry.
  • Implement and manage containerization and orchestration using Docker, ECS Fargate, or EKS.
  • Manage environment isolation and prompt/model versioning to support safe, reproducible models.
  • Track and manage platform costs and budgets using CloudWatch budgets and cost governance practices.
  • Drive high reliability and cost-efficiency standards across the platform, including incident response and operational improvements.

Requirements

  • Deep AWS operational expertise (production experience operating AWS services).
  • Proven experience with container orchestration and containerization (Docker, ECS Fargate, EKS).
  • Strong observability skills and experience with CloudWatch and OpenTelemetry for monitoring LLMs or services.
  • Experience building IaC with Terraform or AWS CDK.
  • Experience implementing CI/CD pipelines with CodePipeline, GitHub Actions, or GitLab CI.
  • Demonstrated focus on reliability and cost-efficiency in platform operations.
  • Bachelor’s degree in Computer Science, Information Systems, or a related field.
  • AWS Solutions Architect Associate or Professional certification and Kubernetes/CNCF certifications (preferred).

Benefits

  • Salary range $170,000 - $200,000 per year (OTE includes base and target bonus).
  • Fully remote role within the United States.
  • Medical, dental, and vision insurance.
  • 401(k) retirement plan.
  • Paid company holidays, paid time off, and paid parental and caregiver leave.
  • Cross-department training, sponsored certifications, and professional development support.
  • Employee resource groups and diversity-focused programs (e.g., Moving Women AHEAD, RISE AHEAD) and access to a multi-million-dollar tech lab.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Machine Learning Engineer Specialist– Recommendation Systems

MUTT DATA 51-250 Internet Software & Services

Muttdata is hiring a remote Machine Learning Engineer Specialist to build and operate large-scale recommendation systems that improve personalization and user experience for consumer products and e-commerce clients.

Apache Spark AWS Azure Databricks dbt Feature Engineering GCP Machine Learning Python PyTorch SQL TensorFlow
27 minutes ago

Backend Engineer - Platform - Stacks | UK | Remote

Grafana 1K-5K IT Services

Grafana Labs is hiring a Backend Engineer for its Platform Stacks team to build and operate the systems that create, configure, reconcile, and manage Grafana Cloud stacks across regions and services.

AWS Azure Flux GCP Go Grafana Helm Kubernetes Microservices Node.js Terraform TypeScript
1 hour, 47 minutes ago

Senior Machine Learning Engineer - Personalization

Spotify Media

Senior Machine Learning Engineer on Spotify’s Personalization team, building recommendation systems that power music experiences like Home and Now Playing for millions of listeners.

Agile Apache Spark AWS GCP Generative AI Hugging Face Java LLM Machine Learning Python PyTorch Scala Statistics Transformers
3 hours, 41 minutes ago

Senior Software Engineer (Typescript / FrontEnd) - AI/ML

ClickHouse 51-250 IT Services

ClickHouse is hiring a Senior Software Engineer to build AI/ML-powered features for ClickHouse Cloud, connecting its high-performance database platform with end-to-end AI integrations and user-facing experiences.

AWS Azure ClickHouse GCP JavaScript Python React TypeScript
6 hours, 57 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers