Senior Data Center Deployment Engineer

1 hour, 45 minutes ago
Full-time
Senior
DevOps and Infrastructure
Nebius

Nebius

Nebius enables B2B companies to build local hyperscaling cloud platforms with cost-effective GPUs, InfiniBand network, and 50% less compute cost. They offer managed Kubernetes and a launch-ready business model for innovative cloud solutions.

Internet Software & Services
51-250

Description

  • Lead end-to-end deployment of GB-series racks within data center environments.
  • Oversee installation, bring-up, validation, and production readiness of NVIDIA H200 and B200-based servers.
  • Troubleshoot hardware, firmware, Linux OS, and networking issues across the deployment stack.
  • Execute structured testing and validation procedures during deployment.
  • Develop and maintain basic Linux-based hardware health-check and diagnostic scripts.
  • Coordinate on-site hardware repairs, part replacements, and vendor escalations.
  • Drive root cause analysis and ensure corrective actions are implemented.
  • Manage and prioritize deployment timelines across multiple concurrent rollouts.
  • Provide technical leadership and guidance to on-site engineers and technicians.
  • Partner with networking and infrastructure teams to ensure seamless integration.
  • Document deployment processes, validation standards, and operational runbooks.

Requirements

  • Strong hands-on experience deploying and operating data center infrastructure.
  • Deep familiarity with GPU-dense systems, ideally NVIDIA H-series platforms.
  • Experience working with high-density rack deployments (GB-series or similar).
  • Solid Linux experience, including troubleshooting and scripting.
  • Ability to diagnose issues across hardware, OS, firmware, and network layers.
  • Experience coordinating field repairs and working directly with hardware vendors.
  • Proven experience leading technical teams or overseeing field operations.
  • High ownership mindset and ability to operate in production-critical environments.
  • Clear communication skills and ability to collaborate across distributed teams.
  • Experience deploying AI or HPC clusters at scale is a plus.
  • Familiarity with automated provisioning or infrastructure lifecycle systems is a plus.
  • Background in hardware qualification, burn-in testing, or factory validation is a plus.
  • Experience supporting rapid infrastructure expansion is a plus.
  • Exposure to ARM-based or heterogeneous compute environments is a plus.

Benefits

  • 100% company-paid medical, dental, and vision coverage for employees and families.
  • 401(k) plan with up to 4% company match and immediate vesting.
  • Paid parental leave: 20 weeks for primary caregivers and 12 weeks for secondary caregivers.
  • Remote work reimbursement of up to $85 per month for mobile and internet costs.
  • Company-paid short-term, long-term, and life insurance coverage.
  • Competitive salary ranging from $125k to $180k base plus quarterly performance bonuses.
  • Flexible working arrangements.
  • Opportunities for professional growth within Nebius.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Software Engineer - Maps Infrastructure

Applied Intuition 251-1K Internet Software & Services

Applied Intuition is hiring a senior engineer to own and evolve its HD maps infrastructure, supporting map-based products used for localization, querying, and visualization across autonomy and simulation applications.

C++ Python
15 minutes ago

Software Engineer - Developer Infrastructure

Applied Intuition 251-1K Internet Software & Services

Applied Intuition is hiring a Developer Frameworks engineer to build the internal libraries, frameworks, and build/CI infrastructure that help its engineering teams move faster and more reliably.

Ansible AWS Buildkite C++ CI/CD Docker GCP Go gRPC Kubernetes Linux Python SQLAlchemy Terraform TypeScript
2 hours, 15 minutes ago

Senior Infrastructure Engineer

Descript 51-250 Internet Software & Services

Descript is hiring an Infrastructure Engineer to improve the reliability, performance, and core production infrastructure behind its AI-powered audio and video editing platform.

GCP Kubernetes Linux Terraform TypeScript
3 hours ago

Distinguished Engineer / Technical Fellow

Armada 201-500 information technology & services

Armada is hiring a Distinguished Engineer / Technical Fellow to shape its long-term architecture for edge, AI, and emerging space-based infrastructure.

C++ Computer Vision Go Microservices Rust Serverless
3 hours, 15 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers