SWORD Health

SWORD Health

SWORD Health provides AI-powered digital physical therapy solutions designed to prevent pain, support recovery, and enhance overall health, while also aiming to transform the rehabilitation industry through innovative technology and clinical oversight.

Health Care Providers & Services
251-1K
Founded 2015
$324M raised

Description

  • Design, build, and maintain inference infrastructure for AI products with high throughput, low latency, and cost efficiency.
  • Own the end-to-end deployment pipeline for AI models, from computer vision to large language models.
  • Architect and scale Kubernetes clusters for GPU-accelerated workloads, including autoscaling and resource scheduling.
  • Build and operate infrastructure for real-time AI agents, including WebRTC cluster provisioning and low-latency speech services.
  • Drive inference scaling strategies such as speculative decoding, continuous batching, and model parallelism.
  • Develop and maintain Infrastructure as Code and GitOps workflows for GPU-enabled environments.
  • Instrument and monitor inference systems for GPU utilization, model latency, throughput, and error rates.
  • Collaborate with ML Engineers, Data Scientists, and Product teams to turn model requirements into production-ready infrastructure.
  • Evaluate emerging AI infrastructure tools, frameworks, and hardware to improve performance and efficiency.
  • Mentor team members on AI infrastructure best practices and production ML systems.

Requirements

  • 5+ years of infrastructure engineering experience, including at least 2 years focused on AI/ML workloads in production.
  • Strong Kubernetes experience for GPU-accelerated workloads, including scheduling, resource management, and autoscaling.
  • Hands-on experience with model serving and inference optimization for computer vision and large language model workloads.
  • Solid understanding of LLM inference optimization techniques, including speculative decoding, batching, quantization, and model parallelism.
  • Experience provisioning and managing infrastructure for real-time AI systems, including WebRTC clusters and AI agent architectures.
  • Familiarity with real-time video/computer vision inference pipelines and low-latency continuous data processing.
  • Familiarity with speech-to-text and text-to-speech serving infrastructure for low-latency voice AI.
  • Experience with Infrastructure as Code tools such as Terraform and with GitOps methodologies.
  • Working knowledge of GPU infrastructure, including the NVIDIA CUDA ecosystem, multi-GPU setups, and GPU monitoring/profiling.
  • Strong Linux systems and networking fundamentals for latency-sensitive workloads.
  • Fluent in English, both written and oral.
  • Proactive, ownership-driven mindset with the ability to identify and resolve inference bottlenecks early.
  • Experience with LLM serving engines such as vLLM, SGLang, or LLM-D (preferred).
  • Experience with NVIDIA Triton Inference Server and TensorRT for real-time computer vision workloads (preferred).
  • Familiarity with NVIDIA Riva or similar STT/TTS platforms (preferred).
  • Experience with Istio or similar service mesh tools (preferred).
  • Experience with Kafka for event streaming (preferred).
  • Experience with Prometheus, AlertManager, and Grafana for observability (preferred).
  • Experience with Elasticsearch, Logstash, and Kibana for log management (preferred).
  • Experience with Vault for secrets management (preferred).
  • Experience with Redis, MySQL, and DNS management (preferred).
  • Experience provisioning infrastructure on AWS, Azure, or GCP (preferred).
  • Good knowledge of cloud networking, including VPCs, routing, NAT, and troubleshooting with tools like TCPdump (preferred).
  • Experience with WebRTC infrastructure and real-time media streaming (preferred).
  • Experience with Python, Go, or similar languages used in ML infrastructure tooling (preferred).
  • Familiarity with SCRUM methodology (preferred).

Benefits

  • €66,500 - €104,500 annual salary range, including base, variable, and equity.
  • Competitive compensation with potential bonus and stock option value.
  • Flexible remote or hybrid work policy.
  • Unlimited vacation and the ability to control your working hours remotely.
  • Health and well-being program, including digital therapist sessions.
  • Career development and growth opportunities.
  • Opportunity to work with a talented team on an innovative healthcare solution.
  • Fast-paced, stimulating environment with room for creativity.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Staff Software Engineer - Back End Ai (Gurugram based)

Agoda 10K-50K Consumer Services

Agoda is hiring an experienced Software Engineer to design and build mission-critical backend APIs and distributed systems that support millions of daily search requests on its global travel platform.

ActiveMQ Agile Apache Spark C# Cassandra Git Hadoop Java Kafka MongoDB Play Framework Puppet RabbitMQ Scala Scrum SQL TeamCity
1 hour, 5 minutes ago

Applied AI Engineer

Future 251-1K Hotels, Restaurants & Leisure

Future is hiring an Applied AI Engineer to build and ship production AI features for its digital personal training platform, improving the product experience and business outcomes.

AWS AWS CDK Datadog LLM OpenTelemetry Python Terraform
1 hour, 17 minutes ago

Principal IT Engineer

K2 Space Corporation 51-200 Defense and Space Manufacturing

K2 Space is hiring an IT Systems Architect/Engineer to own and scale the core IT foundation supporting its large satellite development, testing, and mission operations.

2 hours ago

Intermediate Software Engineer - Artificial Intelligence (AI)

Tucows 251-1K Diversified Telecommunication Services

Tucows Domains is hiring a remote Intermediate Software Engineer specializing in Artificial Intelligence to help build AI-powered systems for domain services and related tools.

Go Hugging Face LLM Machine Learning Python REST API TensorFlow
2 hours, 36 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers