NetSRE

3 weeks, 3 days ago
Mid Level
Software Development
Nebius

Nebius

Nebius enables B2B companies to build local hyperscaling cloud platforms with cost-effective GPUs, InfiniBand network, and 50% less compute cost. They offer managed Kubernetes and a launch-ready business model for innovative cloud solutions.

Internet Software & Services
51-250

Description

  • Ensure fault tolerance, scalability, and uninterrupted operation of infrastructure services.
  • Use modern technologies to solve infrastructure and operational problems.
  • Implement and improve CI/CD processes.
  • Support systems used for functional and load testing.
  • Monitor engineering equipment in data centers, including power supply, air cooling, and water cooling systems.
  • Monitor IT equipment such as racks, servers, JBODs, JBOGs, power shelves, and network devices.
  • Track assets and hardware repair tasks.
  • Support server production activities.

Requirements

  • Proficiency in Linux systems.
  • Strong Python and Bash scripting skills for automation.
  • Demonstrated ability to troubleshoot complex hardware, software, and networking issues.
  • Strong analytical and problem-solving skills focused on system performance optimization.
  • Working proficiency in English.
  • Experience designing, developing, and running high-load distributed systems is a plus.
  • Interest in backend development is a plus.
  • Applicants must be authorized to work in the country where they apply and provide proof of employment eligibility.

Benefits

  • Competitive compensation.
  • Career growth and learning opportunities.
  • Flexibility and work-life balance.
  • Collaborative and innovative culture.
  • Opportunity to work on impactful AI projects.
  • International environment with talented teams.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

NoSQL Database Engineer II

LivePerson 1K-5K Internet Software & Services

LivePerson is hiring a NoSQL Database Engineer (L2) in India to support production reliability and platform engineering for large-scale NoSQL systems and cloud infrastructure.

Bash Cassandra Couchbase GCP Go Grafana Prometheus Python Redis Terraform
5 hours, 38 minutes ago

Sr. Production Engineer, Solutions Engineering

Pinterest 5K-10K Internet Software & Services

Pinterest is hiring a Senior Production Engineer on Solutions Engineering to design AI-driven reliability and automation systems that improve the operation of large-scale distributed infrastructure serving hundreds of millions of users.

Ansible AWS Azure Chef Docker Envoy GCP Go Hadoop Kafka Kubernetes Linux MySQL Puppet Python Terraform Unix
5 hours, 38 minutes ago

Senior Network Site Reliability Engineer

Miro 1K-5K Internet Software & Services

Miro is hiring a Senior Network Site Reliability Engineer to strengthen the reliability, availability, and scalability of its AWS-based production infrastructure.

Agile AWS Azure Bash CI/CD DNS EC2 GCP GitHub GitLab Kubernetes Linux Python TCP/IP Terraform
5 hours, 53 minutes ago

Sênior Site Reliability Engineer - Network

Harford County Public Library 51-250 Diversified Consumer Services

Stone Tech, da Stone Co., busca um Senior Site Reliability Engineer - Network para liderar projetos críticos de infraestrutura de redes e evoluir a arquitetura global de conectividade do grupo.

Ansible API Gateway AWS Azure Cisco Datadog Fortinet GCP Kong Palo Alto Prometheus SIEM Splunk Terraform Zabbix
6 hours, 8 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers