Senior ML Solutions Architect - Token Factory

1 hour, 41 minutes ago
Full-time
Senior
Artificial Intelligence and Machine Learning
Nebius

Nebius

Nebius enables B2B companies to build local hyperscaling cloud platforms with cost-effective GPUs, InfiniBand network, and 50% less compute cost. They offer managed Kubernetes and a launch-ready business model for innovative cloud solutions.

Internet Software & Services
51-250

Description

  • Design and implement LLM-based solutions using Nebius Token Factory’s inference services to support customer goals.
  • Build production-ready applications with serverless LLM APIs, including multimodal and domain-specific models.
  • Provide technical guidance on prompt engineering, RAG architectures, model selection, and inference optimization.
  • Collaborate with product and engineering teams to communicate customer feedback and influence the platform roadmap.
  • Guide customers from proof of concept to production with attention to performance, reliability, and cost efficiency.
  • Work with the backend team to improve the platform based on client needs.

Requirements

  • 5+ years of experience in ML/AI systems, including at least 2 years focused on LLMs and generative AI.
  • Deep knowledge of the LLM ecosystem, including model architectures and fine-tuning approaches.
  • Hands-on experience with prompt engineering and LLM pipeline development, including evaluation.
  • Experience with agentic frameworks such as Langchain, Langsmith, smolagents, or equivalent.
  • Experience with vector databases and RAG implementation patterns.
  • Experience deploying LLM-powered applications using APIs from OpenAI, Anthropic, or open-source models.
  • Strong Python programming skills.
  • Excellent communication skills with the ability to explain technical concepts to diverse audiences.
  • Experience with inference frameworks and libraries such as vLLM, SGLang, TensorRT-LLM, or Transformers (preferred).
  • Familiarity with inference optimization techniques such as quantization, batching, caching, and routing (preferred).
  • Experience with multimodal AI models such as vision-language or speech (preferred).
  • Proficiency with DevOps tools such as Docker and Kubernetes (preferred).
  • Contributions to open-source ML/AI projects (preferred).

Benefits

  • Competitive salary of $225k-$315k OTE, based on experience, skills, and location.
  • Equity compensation.
  • 100% company-paid medical, dental, and vision coverage for employees and families.
  • 401(k) plan with up to 4% company match and immediate vesting.
  • 20 weeks of paid parental leave for primary caregivers and 12 weeks for secondary caregivers.
  • Up to $85/month remote work reimbursement for mobile and internet.
  • Company-paid short-term, long-term, and life insurance coverage.
  • Flexible working arrangements and opportunities for professional growth.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Associate, AI Solutions & Analytics - Battery Storage

Plus Power 51-250 Electric Utilities

Plus Power is seeking a Senior Associate on its Business Operations & Strategic Analytics team to help develop and operationalize applied AI and analytics solutions that support decision-making across its energy storage business.

Power BI Python SQL Tableau
11 minutes ago

Senior Director of AI, R&D & Agentic Systems

Learneo 51-250 Diversified Consumer Services

QuillBot, part of Learneo, is hiring a Senior Director of AI to lead the evolution of its AI-powered writing platform into a unified, agent-driven creative system for millions of users.

Computer Vision Machine Learning MLOps NLP
26 minutes ago

Senior Data Scientist II - AI for Analytics

instacart.careers 1K-5K Internet Software & Services

Instacart is seeking a Data Science leader to drive AI for Analytics initiatives that accelerate product decision-making and feature launches through production AI systems and cross-functional collaboration.

Flask Git Machine Learning Pandas Python PyTorch Scikit-learn Snowflake SQL
56 minutes ago

Salesforce Experience Architect, AI and Agentforce (Solution Architect)

NeuraFlash 251-1K IT Services

NeuraFlash, Part of Accenture is hiring an Experience Architect, AI & Agentforce to design and deliver Agentforce-powered customer experiences for clients using Salesforce and generative AI technologies.

CRM Generative AI Salesforce
1 hour, 11 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers