Unity

Unity is the top platform for real-time 3D content creation, empowering creators across industries to bring their ideas to life with interactive 2D and 3D content.

Internet Software & Services

Information Technology

5K-10K (6748)

Founded 2004

125 open positions

Links

View All Jobs

Principal Machine Learning Engineer, Mobile AI Inference Optimization

2 months, 4 weeks ago

United States

Full-time

Lead

Machine Learning Engineer

Software Development

C++ Machine Learning Objective-C Python Swift Transformers Vulkan

Apply Now

Unity

Unity is the top platform for real-time 3D content creation, empowering creators across industries to bring their ideas to life with interactive 2D and 3D content.

Internet Software & Services

5K-10K

Founded 2004

View All Jobs 125

Description

Set the technical vision and roadmap for deploying multi-modal AI models to iOS and Android.
Make decisions on model compression, quantization, pruning, and knowledge distillation to meet mobile constraints.
Evaluate and adopt inference runtimes such as CoreML, ONNX Runtime Mobile, TFLite, and ExecuTorch.
Own the end-to-end optimization pipeline from model export through graph transformation and hardware-specific kernel tuning.
Collaborate with research scientists to translate new model architectures into deployable mobile implementations.
Design scalable multi-modal inference systems that handle images, text, primitives, and metadata with real-time performance.
Develop approaches for dynamic resolution, token reduction, and speculative decoding optimized for mobile devices.
Track and adopt advances in efficient diffusion and efficient attention methods.
Lead and mentor ML engineers while defining best practices, code review standards, and benchmarking methodology.
Partner with platform, product, and runtime teams to align ML capabilities with device constraints and roadmaps.

Requirements

8+ years in ML engineering, including at least 3 years focused on on-device or edge inference optimization.
Proven production deployment of transformer-based models and/or JAPE-style generative architectures on mobile or embedded hardware.
Hands-on experience with CoreML, TFLite, ONNX Runtime, and/or ExecuTorch.
Deep understanding of operator fusion, memory layout, and runtime scheduling.
Expert-level knowledge of INT8, INT4, and FP16 quantization, weight sharing, structured and unstructured pruning, and knowledge distillation.
Strong understanding of mobile SoC architectures including Apple Neural Engine, Qualcomm Hexagon/Adreno, and ARM Mali.
Proficiency in C++, Objective-C, or Swift for runtime integration, plus Python for tooling and export pipelines.
Ability to read, implement, and extend ML research papers, including efficient attention, diffusion samplers, and multi-modal fusion techniques.
Track record of technical leadership, cross-functional influence, and engineer development.
Experience shipping world-model or neural rendering pipelines such as NeRF or 3DGS on mobile, preferred.
Contributions to open-source ML inference frameworks or mobile ML research publications, preferred.
Familiarity with compiler stacks such as MLIR, TVM, or XLA for custom kernel generation, preferred.
Background in real-time graphics or game engine pipelines such as Metal, Vulkan, or OpenGL ES, preferred.
Strong English communication skills for frequent global collaboration.
International relocation support is not available for this position.

Benefits

Base salary range of $278,100 to $347,600 USD, depending on location and experience.
Comprehensive health, life, and disability insurance.
Employee stock ownership.
Competitive retirement or pension plans.
Generous vacation and personal days.
Support for new parents through leave and family-care programs.
Mental health and wellbeing programs and support.
Training and development programs.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

ML Infrastructure Engineer

x.ai 51-250 Internet Software & Services

SpaceXAI is hiring an ML Infrastructure Engineer to build and optimize the machine learning platform that powers recommendations on X.

United States Full-time Junior Infrastructure Engineer Machine Learning Engineer

$180k-$440k

Ansible C++ Linux Puppet Python PyTorch Rust

17 hours, 37 minutes ago

Apply

17 hours, 37 minutes ago

Senior Machine Learning Engineer, Safety

Reddit 1K-5K Internet Software & Services

Reddit is hiring a remote-friendly Machine Learning Engineer to build and improve safety systems that support enforcement of Reddit rules using large language models.

United States Full-time Senior Machine Learning Engineer

$217k-$303k

Deep Learning LLM Machine Learning NLP Python PyTorch TensorFlow

18 hours, 7 minutes ago

Apply

18 hours, 7 minutes ago

Senior, Machine Learning Engineer - 3D Perception

Torc 251-1K Road & Rail

Torc is hiring a Senior Machine Learning Engineer – 3D Perception to develop and deploy production perception models for autonomous trucks and improve Bird's Eye View understanding across its autonomy stack.

United States Full-time Senior Machine Learning Engineer

$177k-$213k

C++ Computer Vision Deep Learning Machine Learning Python PyTorch

18 hours, 22 minutes ago

Apply

18 hours, 22 minutes ago

Applied AI Engineer

Unframe Inc. 51-200 Technology, Information and Internet

Unframe is hiring an Applied AI Engineer to work with enterprise customers and internal teams on deploying AI-driven solutions that connect business problems to production systems.

Israel Mid Level AI Engineer Machine Learning Engineer

Machine Learning Python React TypeScript

1 day, 16 hours ago

Apply

1 day, 16 hours ago

Unity

Tags

Links

Principal Machine Learning Engineer, Mobile AI Inference Optimization

Unity

Description

Requirements

Benefits

Similar Roles

ML Infrastructure Engineer

Senior Machine Learning Engineer, Safety

Senior, Machine Learning Engineer - 3D Perception

Applied AI Engineer

You're on a roll! Sign up now to keep applying.