Senior Software Engineer, Observability

2 weeks, 5 days ago
Full-time
Senior
Software Development
Nebius

Nebius

Nebius enables B2B companies to build local hyperscaling cloud platforms with cost-effective GPUs, InfiniBand network, and 50% less compute cost. They offer managed Kubernetes and a launch-ready business model for innovative cloud solutions.

Internet Software & Services
51-250

Description

  • Design and build services and agents that provide visibility into large-scale server fleets and data center engineering systems.
  • Evolve metrics, aggregation, and alerting pipelines with a focus on signal quality and reliability.
  • Design and operate maintenance and remediation systems that enable safe, predictable fleet-wide changes and keep infrastructure healthy.
  • Investigate production incidents hands-on, including on-host Linux debugging, and drive root-cause fixes.
  • Collaborate closely with hardware, networking, and data center operations teams to improve reliability.
  • Own critical backend services that power infrastructure monitoring and maintenance.
  • Improve the operation of production systems at scale.

Requirements

  • 5+ years of professional software engineering experience.
  • Strong production experience with Python and Go, or the ability to ramp up quickly.
  • Solid Linux fundamentals and comfort debugging live systems.
  • Ability to write reliable, maintainable code and work through complex, ambiguous problems.
  • Experience building and operating production systems at scale.
  • Ubuntu experience, including internal tooling and packaging workflows such as building Debian packages, is a plus.
  • CCNA certification or equivalent networking experience is a plus.

Benefits

  • Competitive salary of $130k-$170k base plus quarterly performance bonuses.
  • 100% company-paid medical, dental, and vision coverage for employees and families.
  • 401(k) plan with up to 4% company match and immediate vesting.
  • 20 weeks of paid parental leave for primary caregivers and 12 weeks for secondary caregivers.
  • Up to $85 per month remote work reimbursement for mobile and internet expenses.
  • Company-paid short-term, long-term, and life insurance coverage.
  • Flexible working arrangements.
  • Opportunities for professional growth within Nebius.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Intermediate Software Engineer - Artificial Intelligence (AI)

Tucows 251-1K Diversified Telecommunication Services

Tucows Domains is hiring a remote Intermediate Software Engineer specializing in Artificial Intelligence to help build AI-powered systems for domain services and related tools.

Go Hugging Face LLM Machine Learning Python REST API TensorFlow
30 minutes ago

Senior Software Engineer, Windows/Desktop Applications - Ottawa, Canada

Speechify 51-250 Internet Software & Services

Speechify is hiring a Windows desktop engineer to lead the architecture, development, and accessibility of its audio-based reading products for millions of users.

C# C++ CI/CD .NET
45 minutes ago

Software Engineer, Platform - Reading, United Kingdom

Speechify 51-250 Internet Software & Services

Speechify is hiring a Platform engineer to build and maintain backend services and APIs that support its text-to-speech products and enterprise integrations in a fully distributed environment.

Android AWS Azure Docker GCP iOS Kubernetes macOS Microservices Node.js REST API TypeScript
53 minutes ago

Software Engineer, Data Infrastructure & Acquisition - Charlotte, NC, USA

Speechify 51-250 Internet Software & Services

Speechify is hiring a Software Engineer for its AI data team to build and operate the data collection and ingestion infrastructure that powers model training for its text-to-speech products.

Bash Docker GCP Linux Python Terraform
58 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers