Senior ML Solutions Architect - Token Factory

1 hour, 4 minutes ago
Full-time
Senior
Artificial Intelligence and Machine Learning
Nebius

Nebius

Nebius enables B2B companies to build local hyperscaling cloud platforms with cost-effective GPUs, InfiniBand network, and 50% less compute cost. They offer managed Kubernetes and a launch-ready business model for innovative cloud solutions.

Internet Software & Services
51-250

Description

  • Design and implement LLM-based solutions using Nebius Token Factory inference services to support customer goals.
  • Build production-ready applications using serverless LLM APIs, including multimodal and domain-specific models.
  • Provide technical expertise in prompt engineering, RAG architectures, model selection, and inference optimization.
  • Collaborate with product and engineering teams to collect customer feedback and influence the platform roadmap.
  • Guide customers from proof of concept to production with an emphasis on performance, reliability, and cost efficiency.

Requirements

  • 5+ years of experience in ML/AI systems, including at least 2 years focused on LLMs and generative AI.
  • Deep knowledge of the LLM ecosystem, including model architectures and fine-tuning approaches.
  • Hands-on experience with prompt engineering and LLM pipeline development, including evaluation.
  • Experience with agentic frameworks such as Langchain, Langsmith, smolagents, or equivalent.
  • Experience with vector databases and RAG implementation patterns.
  • Experience deploying LLM-powered applications using APIs from OpenAI, Anthropic, or open-source models.
  • Strong Python programming skills.
  • Excellent communication skills with the ability to explain technical concepts to diverse audiences.
  • Experience with inference frameworks and libraries such as vLLM, SGLang, TensorRT-LLM, or Transformers (bonus).
  • Familiarity with inference optimization techniques such as quantization, batching, caching, and routing (bonus).
  • Experience working with multimodal AI models such as vision-language or speech models (bonus).
  • Proficiency with DevOps tools such as Docker and Kubernetes (bonus).
  • Contributions to open-source ML/AI projects (bonus).

Benefits

  • Competitive salary and comprehensive benefits package.
  • Opportunities for professional growth within Nebius.
  • Flexible working arrangements.
  • A dynamic and collaborative work environment that values initiative and innovation.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Solutions Engineer, Growth

AcuityMD 51-250 Health Care Providers & Services

AcuityMD is seeking a Solutions Engineer, Growth to support its medical technology software and data platform by driving the sales process, delivering tailored product demos, and helping customers adopt solutions that improve access to advanced medical devices.

CRM
19 minutes ago

Senior Forward Deployed AI Engineer (Remote Eligible in the UK)

Smartsheet 1K-5K Internet Software & Services

Smartsheet is hiring a senior Field Deployment Engineer to lead complex AI deployments from discovery through production handoff, building reusable deployment assets and enabling consultants and partners to deliver customer-specific AI solutions.

Databricks JavaScript LLM Python TypeScript
19 minutes ago

Principal Solutions Engineer

Wiz 251-1K IT Services

Wiz is hiring a Principal Solutions Engineer to partner with sales, product, and technical leadership on cloud security engagements and help shape how the platform is positioned and delivered to prospects and customers.

AWS Azure CI/CD Cybersecurity GCP Git Helm Kubernetes Terraform
34 minutes ago

Senior Solutions Engineer, Canada

Wiz 251-1K IT Services

Wiz is seeking a Senior Solutions Engineer to support enterprise customers and partner with regional account executives to advance cloud security adoption across AWS, Azure, and GCP.

AWS Azure CI/CD GCP
45 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers