Nebius

Nebius

Nebius enables B2B companies to build local hyperscaling cloud platforms with cost-effective GPUs, InfiniBand network, and 50% less compute cost. They offer managed Kubernetes and a launch-ready business model for innovative cloud solutions.

Internet Software & Services
51-250

Description

  • Design and implement LLM-based solutions using Nebius Token Factory’s inference services to support customer goals and business value.
  • Build production-ready applications using serverless LLM APIs for text, vision, audio, and domain-specific models.
  • Provide technical guidance on prompt engineering, RAG architectures, model selection, and inference optimization.
  • Collaborate with product and engineering teams to gather customer feedback and influence the platform roadmap.
  • Guide customers from proof of concept to production with attention to performance, reliability, and cost efficiency.
  • Work with the backend team to improve the platform based on client needs.
  • Architect scalable AI applications using served models.
  • Support customers using Nebius Token Factory across multiple modalities.

Requirements

  • 5+ years of experience in ML/AI systems, including at least 2 years focused on LLMs and generative AI.
  • Deep knowledge of the LLM ecosystem, including model architectures and fine-tuning approaches.
  • Hands-on experience with prompt engineering and LLM pipeline development, including evaluation.
  • Experience with agentic frameworks such as LangChain, LangSmith, smolagents, or equivalent.
  • Experience with vector databases and RAG implementation patterns.
  • Experience deploying LLM-powered applications using APIs from OpenAI, Anthropic, or open-source models.
  • Strong Python programming skills.
  • Excellent communication skills and the ability to explain technical concepts to diverse audiences.
  • Experience with inference frameworks and libraries such as vLLM, SGLang, TensorRT-LLM, or Transformers is preferred.
  • Familiarity with inference optimization techniques such as quantization, batching, caching, and routing is preferred.
  • Experience with multimodal AI models such as vision-language or speech is preferred.
  • Proficiency with DevOps tools such as Docker and Kubernetes is preferred.
  • Contributions to open-source ML/AI projects are preferred.

Benefits

  • Competitive salary and comprehensive benefits package.
  • Opportunities for professional growth within Nebius.
  • Flexible working arrangements, including remote work from Europe.
  • A dynamic and collaborative work environment that values initiative and innovation.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Consultant - Data & AI

Alpha FMC 251-1K Professional Services

Alpha Alternatives is hiring a Senior Consultant to support private markets data strategy engagements, helping define workflows, governance, and operating model frameworks for front-, middle-, and back-office transformation.

8 minutes ago

Senior Manager - Data & AI

Alpha FMC 251-1K Professional Services

Alpha Alternatives is seeking a Senior Manager to lead complex private markets data strategy and transformation programs across front-, middle-, and back-office workflows for alternative investment clients.

53 minutes ago

Senior Associate, AI Solutions & Analytics - Battery Storage

Plus Power 51-250 Electric Utilities

Plus Power is seeking a Senior Associate on its Business Operations & Strategic Analytics team to help develop and operationalize applied AI and analytics solutions that support decision-making across its energy storage business.

Power BI Python SQL Tableau
1 hour, 8 minutes ago

Senior Director of AI, R&D & Agentic Systems

Learneo 51-250 Diversified Consumer Services

QuillBot, part of Learneo, is hiring a Senior Director of AI to lead the evolution of its AI-powered writing platform into a unified, agent-driven creative system for millions of users.

Computer Vision Machine Learning MLOps NLP
1 hour, 23 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers