Artera

Artera

Artera specializes in modernizing and enhancing critical infrastructure for energy utilities and municipalities, providing solutions that ensure the reliable distribution and transmission of natural gas and electric power across America.

Construction & Engineering
51-250

Description

  • Develop the long-term vision and roadmap for Artera’s AI platform to support scaling inference volume and development workloads.
  • Own ML compute infrastructure, including distributed training infrastructure and developer libraries for foundation model development.
  • Build and evolve core libraries used by AI scientists to develop, launch, and monitor AI products.
  • Collaborate with model developers to improve GPU and CPU efficiency and data throughput for large-scale training runs.
  • Optimize storage and serving of terabytes of digital pathology data for large-scale training workflows.
  • Maintain and improve observability infrastructure to identify opportunities to optimize model performance across the platform.
  • Work closely with AI model developers, machine learning engineers, and platform engineering to support production deployment of optimized models.

Requirements

  • 8+ years of industry software engineering experience.
  • 4+ years of experience using ML orchestration frameworks such as Flyte, Ray, Kubeflow, Metaflow, MLflow, Dagster, Argo Workflows, or Prefect.
  • 4+ years of experience using PyTorch, TensorFlow, or JAX in Python.
  • 3+ years of experience building with AWS, Docker, and Kubernetes.
  • 1+ years of experience optimizing large-scale, high-throughput distributed machine learning training pipelines.
  • Experience with Terraform and SqlAlchemy is preferred.
  • Experience with multi-node and multi-GPU training is preferred.
  • Experience deploying and maintaining infrastructure for machine learning training and production inference is preferred.
  • Familiarity with TorchScript, ONNXRuntime, DeepSpeed, AWS Neuron, or similar inference optimization approaches is preferred.
  • Must be currently authorized to work in the United States or Canada without visa sponsorship.

Benefits

  • Base salary of $180,000 to $220,000 per year.
  • Equity is a core component of the compensation package.
  • 401(k) matching.
  • Unlimited paid time off (PTO).
  • Remote role open to candidates authorized to work in the U.S. or Canada.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Associate AI/ML Engineer

66degrees 251-1K IT Services

66degrees is hiring an Associate in AI/ML to build and support Google Cloud–based Generative AI and LLM systems that turn business requirements into scalable, production-ready solutions.

GCP Generative AI LLM MLOps Python SQL Vertex AI
38 minutes ago

Software Engineer - Human Motion Data

Apptronik 51-250 Aerospace & Defense

Apptronik is hiring a Software Engineer for Human Motion Data to build motion-data pipelines that connect human demonstrations to reinforcement learning and support the development of its humanoid robot Apollo.

C++ Generative AI Python Reinforcement Learning Unity Unreal Engine
1 hour, 14 minutes ago

Senior Machine Learning Engineer, Conversion Modeling

Unity 5K-10K Internet Software & Services

Unity is hiring a Senior ML Engineer to build and improve large-scale ad ranking, recommendation, and bidding optimization systems that power Unity Ads.

C++ Go Machine Learning Python Reinforcement Learning Scala Statistics
1 hour, 29 minutes ago

Senior Staff Machine Learning Engineer, Infrastructure

Airbnb 5K-10K Hotels, Restaurants & Leisure

Airbnb is hiring a remote-eligible ML Infrastructure leader to build the data and AI/ML foundations that power generative AI, production models, and scalable machine learning systems across the company.

Apache Airflow Apache Spark C++ Computer Vision Deep Learning Feature Engineering Generative AI Hive Java Kafka Kubernetes Machine Learning Microservices Neural Networks NLP Python PyTorch Scala TensorFlow
1 hour, 29 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers