AI Security - AI Platform Team Lead

25 minutes ago
Lead
Artificial Intelligence and Machine Learning
Cato Networks

Cato Networks

Cato Networks is the world's leading single vendor SASE platform that converges SD WAN, security, global backbone, and remote access into a global cloud-native service. Their robust platform optimizes and secures application access for all users and lo...

Diversified Telecommunication Services
251-1K
Founded 2015
$770M raised

Description

  • Build and lead the AI Platform team, including hiring, mentoring, architecture, technical direction, and execution.
  • Own the AI security runtime platform for high-throughput, low-latency inline security decisions across the global cloud and PoPs.
  • Design the orchestration layer for running GPU models, CPU heuristics, and security logic as one production engine.
  • Own production readiness, including observability, SLOs, autoscaling, reliability, rollout, rollback, and operational health.
  • Own the model lifecycle platform, including registry, versioning, deployment, monitoring, and safe production rollout.
  • Work closely with research and algorithm teams to productionize AI security models and algorithms at scale.
  • Define the long-term platform strategy for AI runtime and model serving at Cato.

Requirements

  • 3+ years of leadership experience as a team lead, tech lead, or engineering manager.
  • 3+ years of hands-on experience in AI inference, production ML infrastructure, model serving, or AI runtime platforms.
  • Strong experience with production inference technologies such as Triton, vLLM, CUDA, Kubernetes, Docker, PyTorch, ONNX, TensorRT, or similar.
  • 3+ years of experience with Go, or strong experience with a similar high-performance backend language such as C++, Rust, or Java.
  • Experience with performance optimization, scalability, observability, and SLO-driven production ownership.
  • Strong system design skills, especially around distributed systems, performance, reliability, and production infrastructure.
  • Experience with GPU optimization, GPU scheduling, GPU resource efficiency, quantization, runtime acceleration, or large-scale model serving.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Software Engineer III, Voice AI

Natera 1K-5K Pharmaceuticals

Natera is hiring a Software Engineer III to build and maintain a real-time Voice AI platform that supports automated patient calls and improves access to genetic testing information.

Agile API Gateway AWS AWS CDK Docker DynamoDB Express.js GitLab HIPAA Jest Kafka LLM Microservices MySQL NestJS Node.js OAuth Redis Twilio TypeScript WebSockets
10 minutes ago

Senior Full Stack Engineer (NLP)

Noodle 251-1K Diversified Consumer Services

Noodle is hiring a Senior Full Stack Engineer to help build and improve its Noodle Learning Platform, a direct-to-consumer product serving universities, learners, and corporate upskilling markets.

CSS Django Flask Flux HTML JavaScript Python React REST API Sass SQLAlchemy Wireframing
25 minutes ago

CX AI & Automation Lead

Remote 251-1K Professional Services

Remote is hiring a CX-focused technical programme lead to build and scale automation and AI capabilities that improve customer support operations across its globally distributed team.

REST API
25 minutes ago

Sr. AI Engineer, Platform Infrastructure, Special Programs

SpaceX 10K-50K Aerospace & Defense

SpaceX is hiring a Sr. AI Engineer, Platform Infrastructure, Special Programs to build deterministic tooling and deployment infrastructure for classified and high-security environments supporting teams working at public cloud, enterprise on-prem, and air-gapped sites.

Buildkite CI/CD GitHub Actions Go Helm Kubernetes Linux Pulumi Python Terraform
34 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers