Faire

Faire

Faire is an online wholesale marketplace connecting independent retailers with unique merchandise from around the world. With flexible payment terms, free returns, and personalized recommendations, Faire empowers small businesses to compete with larger...

Textiles, Apparel & Luxury Goods
1K-5K
Founded 2017
$1500M raised

Description

  • Design and operate ML infrastructure, including workspaces, clusters, jobs, and workflows.
  • Productionize ML workloads using Spark, Delta Lake, MLflow, and Databricks Workflows.
  • Teach data scientists how to move models from notebooks into production on the ML platform.
  • Implement Unity Catalog for data governance, lineage, access control, and secure multi-tenant usage.
  • Build CI/CD pipelines for machine learning using Terraform and Git-based workflows such as GitHub Actions.
  • Optimize performance, reliability, and cost across training and inference workloads.
  • Configure IAM and RBAC for sensitive datasets.
  • Establish observability for data quality, model performance, and platform health.
  • Build and maintain technical documentation for the ML platform.

Requirements

  • 8+ years of experience building production ML or data platforms.
  • A degree, preferably graduate level, in Computer Science, Engineering, Statistics, or a related technical field.
  • Strong hands-on expertise with Databricks, Spark, Delta Lake, and MLflow.
  • Proficiency in Python, SQL, and distributed systems concepts.
  • Experience with cloud platforms and infrastructure-as-code.
  • Solid understanding of MLOps best practices, including CI/CD, monitoring, reproducibility, and security.
  • Experience supporting multiple ML teams in a shared platform environment.
  • Experience with Kotlin, PyTorch, Kafka, Snowflake, Fivetran, Iceberg, Datadog, Airflow, Cockroach DB, or MySQL is preferred.
  • Experience with AWS, S3, SageMaker, Kubernetes, Docker, GitHub Actions, or Terraform is preferred.
  • Familiarity with generative AI tools such as Claude Sonnet 4.5 and ChatGPT 5.2 is listed in the tech stack.

Benefits

  • Salary range of $224,000 to $308,000 per year in San Francisco.
  • Eligibility for equity.
  • Hybrid work schedule with 3 days per week in the office.
  • Flexibility to work remotely up to 4 weeks per year in hybrid roles.
  • Reasonable accommodation support during the recruitment process.
  • Equal employment opportunity commitment.
  • Access to benefits, though specific plan details are not listed.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Platform Engineer

Ometria 51-250 Media

Ometria is hiring a remote Platform Engineer in Portugal to help build, scale, and maintain the cloud-based infrastructure and platform that supports its retail customer data and experience product.

AWS CI/CD DevSecOps Docker Go Kafka Kubernetes Microservices PostgreSQL Python React Terraform
4 hours, 51 minutes ago

Software Engineer II, Backend (ML Training & Serving)

Affirm 1K-5K Diversified Financial Services

Affirm is hiring a Software Engineer II for its ML Training & Serving engineering team to build the infrastructure that trains and serves machine learning models across the company.

AWS Kotlin Kubernetes Machine Learning MySQL Python
4 hours, 51 minutes ago

Ssr. Fullstack Engineer

Resilient Co 11-50 Professional Services

Resilient Co. is hiring a semi-senior Fullstack Engineer in Argentina or Brazil to build AI-driven full-stack solutions for enterprise workflows, with a focus on agentic AI, machine learning, backend services, and cloud integration.

Angular Azure C# CI/CD Django Docker Entity Framework FastAPI Flask Git JavaScript Microservices .NET NumPy Pandas Python RabbitMQ React Scikit-learn Terraform Vue.js YAML
5 hours, 6 minutes ago

[Job 29881] Senior Machine Learning Engineer, Brazil

CI&T 5K-10K Internet Software & Services

CI&T is hiring a Senior Machine Learning Engineer in Brazil to develop and deploy production ML solutions that turn data and AI capabilities into measurable business impact.

Apache Airflow Apache Spark CI/CD dbt Git Machine Learning OpenSearch Python PyTorch Scikit-learn Snowflake SQL TensorFlow XGBoost
5 hours, 21 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers