RUNWARE

RUNWARE

RUNWARE provides an affordable API that enables AI developers to efficiently run image, video, and custom generative AI models without the need for extensive infrastructure or machine learning expertise.

Internet Software & Services
1-10
Founded 2023

Description

  • Build and maintain end-to-end inference time tracking across the platform, globally and per model.
  • Monitor how implementation changes affect total request latency and identify performance regressions.
  • Create automated alerts and historical trend reporting for errors, delays, bottlenecks, and regressions.
  • Build internal dashboards for engineering, product, and leadership teams.
  • Build client-facing usage dashboards covering requests, errors, success rates, performance, and model or API adoption.
  • Support clients in using analytics and visibility tools to debug integrations.
  • Implement metrics, logs, and traces to improve platform observability and scalability.
  • Work closely with DevOps and backend teams to improve system monitoring and observability.
  • Provide data insights that inform infrastructure decisions such as GPU allocation, autoscaling, caching, and batching.
  • Select, maintain, and support the tooling and data pipelines used for analytics and monitoring.

Requirements

  • Strong experience in data analytics, observability, or monitoring.
  • Hands-on experience with metrics, logging, and tracing frameworks such as Prometheus, Grafana, Datadog, New Relic, or similar tools.
  • Good understanding of backend systems and distributed architectures.
  • Ability to turn raw metrics into actionable insights.
  • Experience building dashboards for internal and external stakeholders.
  • Familiarity with AI model monitoring, including latency, throughput, error codes, and GPU utilization.
  • Experience with AI/ML infrastructure or inference pipelines is preferred.
  • Experience with GPUs is preferred.
  • Understanding of Python APIs, FastAPI, or Node environments is preferred.
  • Experience with high-throughput real-time systems is preferred.
  • Startup or scale-up experience is preferred.
  • Ability to work autonomously and own problems end to end.
  • Comfort collaborating with ML, backend, DevOps, and product teams.
  • UK right to work is required; visa sponsorship is not available.

Benefits

  • Remote-first work setup with the ability to work from home in eligible locations.
  • Flexible hours outside core collaboration blocks.
  • Generous paid time off, including vacation, sick days, and public holidays.
  • Meaningful stock options.
  • Paid family leave, including maternity, paternity, and caregiver time.
  • Twice-yearly company retreats in inspiring locations.
  • Built-in downtime after major release cycles to rest and recharge.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Insights Manager II

instacart.careers 1K-5K Internet Software & Services

Instacart is hiring an Insights Manager II for its Retail Insights team to turn grocery retail data and partner trends into clear analysis, business cases, and strategies that support key retail partnerships.

E-commerce SQL
16 hours, 46 minutes ago

COLLECTION ANALYST II

Inter 51-250 Banks

Inter is hiring for its Credit Recovery Policies team to develop and implement data-driven recovery strategies that improve collections results in a global financial super app environment.

Python R SQL Tableau
16 hours, 46 minutes ago

Measurement Lead

Brandtech+ 501-1000 Marketing services

Brandtech+ is hiring a remote Measurement Lead to shape reporting, data governance, and performance measurement across global client accounts and internal teams.

Power BI Tableau
17 hours, 1 minute ago

Lead Data Platform Engineer

PR Newswire 1K-5K Internet Software & Services

INFOnline, part of saas.group, is seeking a Lead Data Platform Engineer to own and evolve its GCP-native data platform that powers digital audience measurement for the German and Austrian media industry.

CI/CD dbt Docker GCP Go Serverless SQL Terraform
17 hours, 1 minute ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers