dv01

dv01

dv01 provides a comprehensive data management, reporting, and analytics platform specifically designed for lending markets, enabling users to gain transparency and insights into loan-level data while streamlining asset-based finance operations.

IT Services
51-250
Founded 2014
$34M raised

Description

  • Design, build, and operate cloud-native infrastructure and platform tooling for AI development.
  • Own the DevOps and infrastructure foundations for MLOps and agentic systems, including CI/CD, observability, reliability, and cost management.
  • Build and maintain infrastructure for AI services such as LLM-backed APIs, MCP servers, and production agentic systems.
  • Enable secure runtime orchestration, tool access, and isolation boundaries for AI-driven workloads.
  • Apply MLOps practices to platform operations, including monitoring, alerting, anomaly detection, and incident response.
  • Define and implement governance, security, access control, deployment policies, and auditability for AI infrastructure.
  • Partner with security and compliance teams to align AI infrastructure with organizational and regulatory requirements.
  • Influence platform architecture and best practices as a technical leader across teams.
  • Mentor engineers and provide technical guidance to cross-functional partners.
  • Contribute to dv01’s AI and data roadmap and technical strategy.

Requirements

  • 8+ years of experience in cloud infrastructure, DevOps, or platform engineering roles.
  • 5+ years of MLOps experience is required.
  • Deep experience designing and operating distributed systems in production.
  • Experience with cloud-native infrastructure, including Kubernetes, containerized workloads, and infrastructure-as-code tools such as Terraform.
  • Hands-on experience supporting systems that host or run deep neural networks, including LLM runtimes such as vLLM or llama.cpp.
  • Experience with ML compiler stacks such as LLVM/MLIR and PyTorch-based production systems.
  • Strong understanding of infrastructure security, IAM, secrets management, and operational risk for AI-enabled systems.
  • Ability to lead technically, influence architecture and standards, and work effectively in ambiguous, cross-functional environments.
  • Experience designing and operating scalable benchmarking and evaluation frameworks for agentic AI systems, including LLM-as-a-judge approaches, is expected.
  • Nice to have: experience with Pulumi, GCP, Cloudflare, GHA, Harness, or Go.
  • Nice to have: experience supporting data engineering platforms or working with data warehousing and ETL/ELT tools or operations.

Benefits

  • Unlimited PTO.
  • $1,000 learning and development fund.
  • Remote-first, flexible work environment.
  • Comprehensive medical, dental, and vision insurance for employees and their families.
  • 401(k) retirement plan.
  • $138/month fitness stipend plus up to $1,650 per year through the Fitness Fund.
  • 16 weeks of 100% paid parental leave for primary caregivers and 4 weeks for secondary caregivers.
  • Salary range of $185,000 to $200,000.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Principal Software Engineer - (Platform & Applications - CloudApps)

Motional 1K-5K Automotive

Motional is seeking a Principal Engineer for Cloud Applications in Singapore to architect and scale fleet management platforms for autonomous vehicles as the company transitions to global commercial operations.

AWS Azure C++ CI/CD ClickHouse Embedded Systems GCP Go IoT Python React
13 minutes ago

Director, AI Platforms

SoFi 1K-5K Capital Markets

SoFi is seeking a Director, AI Platforms to build and lead internal AI and SDLC platform services that enable secure, scalable AI development and deployment across the company.

AWS CI/CD Kubernetes
13 minutes ago

Senior Machine Learning Engineer, Data Mining

Motional 1K-5K Automotive

Motional is hiring a Senior Machine Learning Engineer to build and deploy multimodal data mining systems that help autonomous vehicles uncover rare edge cases, model errors, and critical intelligence from large-scale sensor data.

AWS Azure CI/CD GCP Machine Learning MLOps Python PyTorch Reinforcement Learning System Design TensorFlow
13 minutes ago

Manager II, Machine Learning Engineering, Core Engineering

Pinterest 5K-10K Internet Software & Services

Pinterest is hiring a Machine Learning Engineer Engineering Manager II to lead the technical direction and execution of core recommendation and search systems that shape the experience for 500M+ monthly users.

Computer Vision Machine Learning NLP SQL
13 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers