Senior Machine Learning Engineer

2 months ago
Full-time
Senior
Software Development
RUNWARE

RUNWARE

RUNWARE provides an affordable API that enables AI developers to efficiently run image, video, and custom generative AI models without the need for extensive infrastructure or machine learning expertise.

Internet Software & Services
1-10
Founded 2023

Description

  • Integrate open-source and third-party models into the inference platform.
  • Lead fine-tuning initiatives using LoRA, adapters, PEFT, and domain adaptation techniques.
  • Optimize inference workloads for latency, batching, memory efficiency, and throughput.
  • Benchmark model quality, cost, and performance across multiple modalities.
  • Improve inference startup times and stability under high load.
  • Build evaluation frameworks and internal tooling for model validation.
  • Collaborate with Infrastructure and Backend teams on scalable serving systems.
  • Monitor production performance and drive continuous optimization.
  • Mentor engineers and help raise the ML engineering standards across the team.

Requirements

  • Proven experience delivering machine learning systems to production environments.
  • Strong low-level Python skills and deep hands-on experience with PyTorch.
  • Experience working with diffusion models, LLMs, or multimodal architectures.
  • Practical experience fine-tuning large models using LoRA, PEFT, adapters, or similar methods.
  • Experience optimizing inference workloads in GPU environments.
  • Strong understanding of model evaluation, experimentation, and monitoring.
  • Ability to debug performance, memory, and reliability issues in production.
  • Strong systems thinking and understanding of how ML decisions impact infrastructure.
  • High ownership and comfort operating in a fast-paced startup environment.
  • Experience with vLLM or custom inference servers is preferred.
  • Experience with Kubernetes or containerised ML workloads is preferred.
  • Experience working in high-throughput distributed systems is preferred.
  • Background in AI media generation, including image, video, or audio, is preferred.
  • Experience building internal ML tooling or developer-facing APIs is preferred.
  • Experience with CUDA/C++ kernels is preferred.

Benefits

  • Remote-first setup with the ability to work from home anywhere the company can employ you.
  • Flexible hours outside core collaboration blocks.
  • Generous paid time off, including vacation, sick days, and public holidays.
  • Meaningful stock options.
  • Paid family leave for maternity, paternity, and caregiver time.
  • Company retreats twice a year in inspiring locations.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Software Engineer II, Backend (ML Training & Serving)

Affirm 1K-5K Diversified Financial Services

Affirm is hiring a Software Engineer II for its ML Training & Serving engineering team to build the infrastructure that trains and serves machine learning models across the company.

AWS Kotlin Kubernetes Machine Learning MySQL Python
17 hours, 17 minutes ago

Ssr. Fullstack Engineer

Resilient Co 11-50 Professional Services

Resilient Co. is hiring a semi-senior Fullstack Engineer in Argentina or Brazil to build AI-driven full-stack solutions for enterprise workflows, with a focus on agentic AI, machine learning, backend services, and cloud integration.

Angular Azure C# CI/CD Django Docker Entity Framework FastAPI Flask Git JavaScript Microservices .NET NumPy Pandas Python RabbitMQ React Scikit-learn Terraform Vue.js YAML
17 hours, 32 minutes ago

[Job 29881] Senior Machine Learning Engineer, Brazil

CI&T 5K-10K Internet Software & Services

CI&T is hiring a Senior Machine Learning Engineer in Brazil to develop and deploy production ML solutions that turn data and AI capabilities into measurable business impact.

Apache Airflow Apache Spark CI/CD dbt Git Machine Learning OpenSearch Python PyTorch Scikit-learn Snowflake SQL TensorFlow XGBoost
17 hours, 47 minutes ago

AI Native Engineer

CookUnity 251-1K Hotels, Restaurants & Leisure

CookUnity is hiring a dedicated AI engineer to redesign, automate, and own high-value internal workflows across the company’s cross-functional teams.

AWS dbt Git JIRA Kotlin Linear NetSuite Notion PostgreSQL Python Snowflake SQL TypeScript Vercel
17 hours, 47 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers