Engineer - HPC Platform

2 months, 3 weeks ago
Full-time
Senior
DevOps and Infrastructure
Xenon7

Xenon7

Xenon7 provides advanced AI solutions and consultancy services, leveraging a team of highly qualified experts and a strong emphasis on research and innovation to address complex industry challenges and enhance operational efficiency.

Internet Software & Services
Founded 2014

Description

  • Design, build, and maintain scalable HPC platforms and cluster architectures.
  • Lead engineering and operations for HPC infrastructure, ensuring availability and performance for scientific workloads.
  • Collaborate with researchers and scientists to optimize performance and streamline computational workflows.
  • Automate orchestration, resource scheduling, data access, and reproducibility using tooling and automation.
  • Evolve and operate both public cloud and on-premises environments for HPC use cases.
  • Define, monitor, and report infrastructure metrics and resource utilization to drive platform improvements.
  • Advance initiatives that enable critical business projects and identify opportunities to accelerate the HPC roadmap.
  • Apply agile ways of working to deploy and operate HPC solutions at scale.

Requirements

  • Bachelor’s degree in Computer Science, Information Technology, or a related technical field.
  • 5+ years of experience as an HPC Platform Engineer.
  • Demonstrated experience leading a global large-scale infrastructure project.
  • Hands-on experience with HPC platforms, including accelerators (e.g., GPUs) and HPC schedulers (e.g., Altair Grid Engine, Slurm).
  • Experience with Kubernetes platforms and container technologies (Docker, Apptainer).
  • Demonstrated experience with HPC workloads, infrastructure, and cluster architectures.
  • Expertise with the Linux command line, Linux troubleshooting, and HPC administration.
  • Experience with DevOps and infrastructure-as-code tools such as GitHub, Chef, Ansible, and Terraform.
  • Experience automating infrastructure and applications and strong programming/scripting skills in Python or Bash.
  • Continuous learning mindset and willingness to stay current with new HPC technologies and infrastructure trends.

Benefits

  • Attractive, market-leading salary package.
  • Clear career advancement path.
  • Professional development opportunities and support for learning new HPC technologies.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

ServiceNow Cloud Migration Lead (Senior Consultant - Platform Engineering)

Muller Internet Software & Services

Müller’s Solutions is hiring a ServiceNow Cloud Migration Lead to manage the migration of a self-hosted ServiceNow instance to ServiceNow on GCP, overseeing the project from assessment through go-live and stabilization.

DNS GCP REST API SAML SOAP
15 hours, 41 minutes ago

Senior Engineering Manager - Enablement

Honeycomb.io 51-250 Internet Software & Services

Honeycomb is seeking an Engineering Enablement leader to drive the developer experience, AI-assisted engineering workflows, and platform foundations that help the company ship faster and more safely as it scales.

CI/CD CircleCI GitHub Actions Go JavaScript OpenTelemetry TypeScript
15 hours, 41 minutes ago

Senior Platform Engineer / Senior DevOps Engineer / Senior Infrastructure Engineer / Senior Site Reliability Engineer

Anduril Industries 1K-5K Aerospace & Defense

Anduril Australia is hiring a senior infrastructure and reliability engineer to own a service or platform end to end across cloud and classified environments supporting defense programs.

Active Directory AWS Bash Go Kubernetes Python Terraform
1 day, 14 hours ago

Platform Architect

Auraverse 1-10 Professional Services

Aura is hiring a Boston-based Platform Architect to lead the architecture and evolution of its backend platform for digital safety products serving millions of customers.

API Gateway AWS CI/CD Databricks DynamoDB GitHub Actions Serverless Snowflake Terraform
1 day, 15 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers