Senior Machine Learning Infrastructure Engineer

1 hour, 54 minutes ago
Full-time
Senior
DevOps and Infrastructure
Unity

Unity

Unity is the top platform for real-time 3D content creation, empowering creators across industries to bring their ideas to life with interactive 2D and 3D content.

Internet Software & Services
5K-10K
Founded 2004

Description

  • Design, build, and maintain infrastructure that serves machine learning models in real time across Unity's ads ecosystem.
  • Build and operate scalable model serving pipelines with ownership of latency, throughput, and reliability in a high-QPS production environment.
  • Partner with machine learning engineers to productionize models, manage deployments, and improve iteration speed.
  • Improve the observability, performance, and cost-efficiency of machine learning serving infrastructure.
  • Contribute to architectural decisions around feature serving, model versioning, and inference optimization.
  • Support reliable operation of systems that power ranking, bidding, and targeting for ads delivery.
  • Collaborate across teams in a remote-first environment to deliver ML systems at scale.

Requirements

  • Experience building and operating ML infrastructure or model serving systems in production.
  • Proficiency in Golang or Python, with strong systems engineering fundamentals.
  • Hands-on experience with Kubernetes and container orchestration at scale.
  • Familiarity with ML serving frameworks such as Ray Serve, Triton, TorchServe, or similar.
  • Understanding of distributed systems, API design, and system reliability.
  • Strong collaboration and communication skills in a remote-first environment.
  • Experience with feature stores, feature pipelines, or online/offline feature serving (preferred).
  • Background in ad tech, real-time bidding, or programmatic advertising systems (preferred).
  • Familiarity with infrastructure-as-code tools such as Terraform (preferred).
  • Experience with observability tooling such as Prometheus, Grafana, or OpenTelemetry (preferred).

Benefits

  • Gross pay salary range of $183,700 to $248,600 USD.
  • Comprehensive health, life, and disability insurance.
  • Commute subsidy.
  • Employee stock ownership.
  • Competitive retirement and pension plans.
  • Generous vacation and personal days.
  • Support for new parents through leave and family-care programs.
  • Mental health and wellbeing programs and support.
  • Training and development programs.
  • Volunteering and donation matching program.
  • Office food snacks.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Manager, Machine Learning Engineering (Underwriting)

Affirm 1K-5K Diversified Financial Services

Affirm is hiring a Machine Learning Engineering Manager to lead an underwriting ML team building decisioning systems that optimize application outcomes and support the company’s broader machine learning strategy.

Deep Learning Machine Learning Transformers
1 hour, 3 minutes ago

Systems & AI Cloud Architect

Endeavour. Inspired Infrastructure. 11-50 Electric Utilities

Endeavour is seeking a remote Systems & AI Cloud Architect to support its IT ecosystem by shaping enterprise architecture, modernizing infrastructure, and enabling scalable AI and cloud solutions for sustainable infrastructure initiatives.

AWS Azure CI/CD Cybersecurity GCP Generative AI Machine Learning Microservices MLOps
3 hours, 3 minutes ago

Senior Machine Learning Engineer, Personalization, Magenta

Spotify Media

Spotify is hiring a Senior Machine Learning Engineer on the Personalization team to build production conversational and agentic AI systems that improve how hundreds of millions of listeners discover and engage with audio.

LLM Machine Learning NLP
14 hours, 56 minutes ago

Machine Learning Engineer

Mindera 1K-5K Internet Software & Services

Mindera is seeking an ML Engineer to work with the ML Architect on machine learning frameworks and platform tooling that support scalable model development, deployment, and experimentation across business units.

Agile Apache Airflow CI/CD Docker Kubernetes MLflow MLOps PyTorch Scikit-learn TensorFlow
16 hours, 26 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers