Faire

Faire

Faire is an online wholesale marketplace connecting independent retailers with unique merchandise from around the world. With flexible payment terms, free returns, and personalized recommendations, Faire empowers small businesses to compete with larger...

Textiles, Apparel & Luxury Goods
1K-5K
Founded 2017
$1500M raised

Description

  • Design and operate ML infrastructure, including workspaces, clusters, jobs, and workflows.
  • Productionize ML workloads using Spark, Delta Lake, MLflow, and Databricks Workflows.
  • Teach data scientists how to move models from notebooks into production on the ML platform.
  • Implement Unity Catalog for data governance, lineage, access control, and secure multi-tenant usage.
  • Build CI/CD pipelines for machine learning using Terraform and Git-based workflows such as GitHub Actions.
  • Optimize performance, reliability, and cost across training and inference workloads.
  • Configure IAM and RBAC for sensitive datasets.
  • Establish observability for data quality, model performance, and platform health.
  • Build and maintain technical documentation for the ML platform.

Requirements

  • 8+ years of experience building production ML or data platforms.
  • A degree, preferably graduate level, in Computer Science, Engineering, Statistics, or a related technical field.
  • Strong hands-on expertise with Databricks, Spark, Delta Lake, and MLflow.
  • Proficiency in Python, SQL, and distributed systems concepts.
  • Experience with cloud platforms and infrastructure-as-code.
  • Solid understanding of MLOps best practices, including CI/CD, monitoring, reproducibility, and security.
  • Experience supporting multiple ML teams in a shared platform environment.
  • Experience with Kotlin, PyTorch, Kafka, Snowflake, Fivetran, Iceberg, Datadog, Airflow, Cockroach DB, or MySQL is preferred.
  • Experience with AWS, S3, SageMaker, Kubernetes, Docker, GitHub Actions, or Terraform is preferred.
  • Familiarity with generative AI tools such as Claude Sonnet 4.5 and ChatGPT 5.2 is listed in the tech stack.

Benefits

  • Salary range of $224,000 to $308,000 per year in San Francisco.
  • Eligibility for equity.
  • Hybrid work schedule with 3 days per week in the office.
  • Flexibility to work remotely up to 4 weeks per year in hybrid roles.
  • Reasonable accommodation support during the recruitment process.
  • Equal employment opportunity commitment.
  • Access to benefits, though specific plan details are not listed.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

ServiceNow Cloud Migration Lead (Senior Consultant - Platform Engineering)

Muller Internet Software & Services

Müller’s Solutions is hiring a ServiceNow Cloud Migration Lead to manage the migration of a self-hosted ServiceNow instance to ServiceNow on GCP, overseeing the project from assessment through go-live and stabilization.

DNS GCP REST API SAML SOAP
15 hours, 42 minutes ago

Senior Engineering Manager - Enablement

Honeycomb.io 51-250 Internet Software & Services

Honeycomb is seeking an Engineering Enablement leader to drive the developer experience, AI-assisted engineering workflows, and platform foundations that help the company ship faster and more safely as it scales.

CI/CD CircleCI GitHub Actions Go JavaScript OpenTelemetry TypeScript
15 hours, 42 minutes ago

Software Engineer - Machine Learning (Behaviors)

Motional 1K-5K Automotive

Motional’s Behaviors team is hiring an engineer to develop machine learning models that help autonomous vehicles understand and predict traffic behavior in complex real-world driving scenarios.

C++ Computer Vision Deep Learning Machine Learning Neural Networks Python PyTorch
1 day, 14 hours ago

Senior Platform Engineer / Senior DevOps Engineer / Senior Infrastructure Engineer / Senior Site Reliability Engineer

Anduril Industries 1K-5K Aerospace & Defense

Anduril Australia is hiring a senior infrastructure and reliability engineer to own a service or platform end to end across cloud and classified environments supporting defense programs.

Active Directory AWS Bash Go Kubernetes Python Terraform
1 day, 14 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers