Later

Later

Later is a top social media management and influencer platform that simplifies visual content marketing for Instagram, Facebook, Twitter, and Pinterest. With over 2 million users globally, including renowned brands like Yelp and The Huffington Post, La...

Media
51-250
Founded 2014

Description

  • Define and own the long-term ML infrastructure roadmap for current experimentation needs and future AI initiatives.
  • Establish best practices for model lifecycle management, deployment standards, monitoring, and governance.
  • Identify infrastructure gaps and design scalable solutions to support high-velocity ML development.
  • Contribute to cross-functional technical planning so ML systems align with product and platform strategy.
  • Design, build, and maintain production-grade model deployment and inference systems using CI/CD, Docker, and API frameworks such as Flask.
  • Automate end-to-end ML lifecycle workflows, including training pipelines, model validation, registry management, deployment, and rollback strategies.
  • Implement monitoring for model performance, latency, drift detection, and infrastructure health using tools such as CloudWatch, Prometheus, and Grafana.
  • Operate across AWS and GCP environments to manage training and inference workloads, including GPU-based infrastructure and BigQuery datasets.
  • Develop and maintain infrastructure-as-code using Terraform and CloudFormation for scalable, repeatable, secure cloud environments.
  • Partner with Data Scientists, Analysts, Platform Engineers, and Product Engineers to translate experimentation needs into production-ready infrastructure.

Requirements

  • 4+ years of experience in ML Ops, ML infrastructure, backend engineering, or related roles supporting production ML systems.
  • Experience working in cloud-native environments, especially AWS and/or GCP, with hands-on deployment of ML workloads.
  • Proven track record designing and implementing CI/CD pipelines for ML systems.
  • Strong experience with Amazon SageMaker, Docker, Flask-based APIs, and infrastructure automation tools.
  • Hands-on experience with ML lifecycle tooling such as MLflow, SageMaker Studio, or Weights & Biases.
  • Experience managing container orchestration platforms such as Kubernetes, EKS, or GKE.
  • Strong programming experience in Python; additional experience in Go, Java, or Scala is a plus.
  • Experience with infrastructure-as-code tools such as Terraform or CloudFormation.
  • Familiarity with observability tools such as CloudWatch, Prometheus, Grafana, Datadog, or centralized logging platforms.
  • Experience managing GPU-based workloads and scaling training/inference systems.
  • Familiarity with data infrastructure tools such as BigQuery and cloud-native data pipelines.
  • Bonus: Experience supporting LLMs or generative AI pipelines, distributed training systems, feature stores such as Feast, real-time inference systems, or ML governance frameworks.
  • A mindset focused on automation, reliability, performance, and continuous improvement in fast-scaling environments.

Benefits

  • Salary range of $145,000 to $165,000.
  • Market-based and data-driven compensation approach with biannual compensation reviews.
  • Permanent team members are eligible to participate in benefits plans as part of their overall compensation package.
  • Flexible location policy, with fully remote candidates considered for select positions.
  • Offices available in Boston, Vancouver, Chicago, and Vancouver, WA.
  • Equal opportunity employer with an inclusion-first culture.
  • Accommodations and support available during the recruitment process.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Machine Learning Engineer, Data & Intelligence Products

AcuityMD 51-250 Health Care Providers & Services

AcuityMD is hiring a Senior Machine Learning Engineer to turn messy healthcare data into predictive models and intelligence products that help MedTech companies understand product usage and improve patient access.

Feature Engineering Machine Learning Python SQL
32 minutes ago

Director, Machine Learning Engineering - Surfaces Foundation

Spotify Media

Spotify is hiring a Director of Machine Learning Engineering to lead the Surfaces Foundation team in building the platform systems that power personalized recommendations across its consumer audio experience.

Machine Learning
4 hours, 42 minutes ago

Machine Learning Principal Solutions Architect

phData 251-1K IT Services

phData is hiring a Principal Solutions Architect to lead delivery of AI/ML solutions for enterprise clients while also driving strategic account growth and client engagement.

AWS Azure Databricks dbt Django Docker Flask GCP Java Keras Kubernetes Machine Learning MLflow Python SageMaker Scala Scikit-learn Snowflake Spring TensorFlow Vertex AI
8 hours, 26 minutes ago

Senior AI/ML Engineer (LLM, GenAI, and Agentic Systems)

Astro Sirens / Astro Sirens Staffing and Consulting IT services, staffing, and consulting

Astro Sirens is hiring a Senior AI/ML Engineer to design and deploy advanced AI solutions for U.S. company projects, with a focus on modern large language models, generative AI, and intelligent agent systems.

Apache Spark AWS Azure CI/CD Deep Learning Docker GCP Generative AI Hugging Face Kubernetes Machine Learning Microservices MLOps Python SQL
8 hours, 57 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers