Senior Software Developer: Models Team (Token Factory)

1 hour, 3 minutes ago
Full-time
Senior
Software Development
Nebius

Nebius

Nebius enables B2B companies to build local hyperscaling cloud platforms with cost-effective GPUs, InfiniBand network, and 50% less compute cost. They offer managed Kubernetes and a launch-ready business model for innovative cloud solutions.

Internet Software & Services
51-250

Description

  • Onboard state-of-the-art open-source models into Nebius Token Factory.
  • Serve large language models in production with strong reliability and scale.
  • Design and operate highly scalable, highly available distributed services.
  • Build and improve advanced inference and systems optimization techniques.
  • Maintain and extend forks of inference frameworks such as vLLM and TRT-LLM.
  • Support Solutions Architects on demanding customer proof-of-concepts.
  • Develop tooling and automation for testing, rollout, diagnostics, observability, and deployment optimization.
  • Collaborate with model builders, open-source communities, Nebius Cloud teams, and hardware vendors.
  • Work closely with teams focused on cloud infrastructure, observability, reliability, fault tolerance, and platform engineering.

Requirements

  • Experience serving LLMs in production.
  • Strong programming skills in Python and/or Go.
  • Experience designing and operating highly scalable, highly available distributed services.
  • Contributions to vLLM, SGLang, TRT-LLM, or NVIDIA ecosystem open-source projects are preferred.
  • Deep understanding of KV cache management, speculative decoding, and quantization is preferred.
  • Experience with LLM evaluation frameworks is preferred.
  • Hands-on experience with performance benchmarking and optimization is preferred.
  • Deep understanding of Kubernetes is preferred.
  • Familiarity with distributed serving architectures and autoscaling is preferred.
  • Knowledge of InfiniBand, RoCE, or high-performance networking is preferred.
  • Applicants must be authorized to work in the country where they apply and provide proof of employment eligibility.

Benefits

  • Competitive compensation.
  • Career growth and learning opportunities.
  • Flexibility and ownership in how work is done.
  • Collaborative and innovative team culture.
  • Opportunity to work on impactful AI projects.
  • International environment with talented teams.
  • Remote-friendly collaboration with periodic in-person team meetups.
  • Equal opportunity employer with inclusive hiring practices.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Principal Architect - AI Product Engineering

3Cloud 251-1K Internet Software & Services

3Cloud is seeking an AI Product Engineering Principal Architect to lead the design and delivery of enterprise-grade AI solutions and shape the company’s AI Product Engineering practice.

Agile Azure C# CI/CD Databricks Generative AI GitHub Machine Learning Microservices MLOps .NET Python
33 minutes ago

Staff AI Engineer - Grafana AI/ML | Canada | Remote

Grafana 1K-5K IT Services

Grafana Labs is hiring a Staff AI Engineer to build and ship AI-powered observability features that help users detect, triage, and resolve incidents across its cloud platform.

AWS Azure Docker GCP Generative AI Kubernetes LLM Terraform
48 minutes ago

AI Solutions Engineer, Talent Acquisition

Anduril Industries 1K-5K Aerospace & Defense

Anduril Industries is hiring an AI Solutions Engineer, Talent Acquisition to own AI and automation efforts for its recruiting function and build practical systems that improve hiring operations at scale.

CI/CD Git LLM Python
48 minutes ago

Senior Software Engineer - Grafana Cloud Observability Provider | Germany | Remote

Grafana 1K-5K IT Services

Grafana Labs is hiring a Senior Software Engineer to help design, build, and scale Grafana Cloud’s observability products for metrics and logs in a fully remote, open-source-driven environment.

Go Grafana Java Kubernetes Microservices .NET OpenTelemetry Prometheus Python Rust
1 hour, 3 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers