Senior Machine Learning Engineer

3 days, 1 hour ago
Full-time
Senior
Software Development
RUNWARE

RUNWARE

RUNWARE provides an affordable API that enables AI developers to efficiently run image, video, and custom generative AI models without the need for extensive infrastructure or machine learning expertise.

Internet Software & Services
1-10
Founded 2023

Description

  • Integrate open-source and third-party models into the inference platform.
  • Lead fine-tuning initiatives using LoRA, adapters, PEFT, and domain adaptation techniques.
  • Optimize inference workloads for latency, batching, memory efficiency, and throughput.
  • Benchmark model quality, cost, and performance across multiple modalities.
  • Improve inference startup times and stability under high load.
  • Build evaluation frameworks and internal tooling for model validation.
  • Collaborate with Infrastructure and Backend teams on scalable serving systems.
  • Monitor production performance and drive continuous optimization.
  • Mentor engineers and help raise the ML engineering standards across the team.

Requirements

  • Proven experience delivering machine learning systems to production environments.
  • Strong low-level Python skills and deep hands-on experience with PyTorch.
  • Experience working with diffusion models, LLMs, or multimodal architectures.
  • Practical experience fine-tuning large models using LoRA, PEFT, adapters, or similar methods.
  • Experience optimizing inference workloads in GPU environments.
  • Strong understanding of model evaluation, experimentation, and monitoring.
  • Ability to debug performance, memory, and reliability issues in production.
  • Strong systems thinking and understanding of how ML decisions impact infrastructure.
  • High ownership and comfort operating in a fast-paced startup environment.
  • Experience with vLLM or custom inference servers is preferred.
  • Experience with Kubernetes or containerised ML workloads is preferred.
  • Experience working in high-throughput distributed systems is preferred.
  • Background in AI media generation, including image, video, or audio, is preferred.
  • Experience building internal ML tooling or developer-facing APIs is preferred.
  • Experience with CUDA/C++ kernels is preferred.

Benefits

  • Remote-first setup with the ability to work from home anywhere the company can employ you.
  • Flexible hours outside core collaboration blocks.
  • Generous paid time off, including vacation, sick days, and public holidays.
  • Meaningful stock options.
  • Paid family leave for maternity, paternity, and caregiver time.
  • Company retreats twice a year in inspiring locations.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Staff Machine Learning Engineer, AI Researcher

Cribl 251-1K IT Services

Cribl is hiring a remote-first machine learning engineer to help build AI-enabled security and observability products that solve real customer problems.

Computer Vision Feature Engineering Kubeflow Machine Learning MLflow MLOps NLP Python PyTorch Reinforcement Learning TensorFlow
1 day, 10 hours ago

Staff Machine Learning Engineer - Platform (Core AI Automation)

Coinbase 1K-5K Capital Markets

Coinbase is hiring a Machine Learning Engineer for its Core Automation Team to build AI infrastructure and automation that improve customer support, compliance operations, and AI-powered customer interactions on its onchain platform.

Apache Airflow Apache Spark Blockchain Computer Vision Databricks Deep Learning Flink Generative AI Kafka LLM Machine Learning NLP Python Snowflake
1 day, 10 hours ago

Software Engineer - ML Platform

Veriff 51-250 IT Services

Veriff’s ML Platform team is hiring a software or ML engineer to build the systems that support machine learning development, experimentation, observability, and scalable model deployment.

Apache Spark dbt Grafana Kubeflow MLflow MLOps Prometheus Python Snowflake SQL
1 day, 10 hours ago

Staff ML Engineer - ML Infrastructure

Samsara 1K-5K IT Services

Samsara is hiring a Staff / Senior Staff Machine Learning Infrastructure Engineer in Canada to lead the end-to-end ML platform for Safety AI and adjacent product areas that improve real-world operational safety.

Apache Spark AWS Computer Vision Embedded Systems IoT Kubernetes LLM Machine Learning
1 day, 11 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers