Kitman Labs

Kitman Labs

Kitman Labs is the industry-leading sports analytics company that provides a holistic, unified view of individual and team performance. Their Performance Intelligence Platform uses artificial intelligence to optimize athlete performance and health, ser...

Internet Software & Services
11-50
Founded 2012
$77M raised

Description

  • Build and maintain BigQuery data models using Dataform and medallion architecture patterns.
  • Contribute to Looker dashboards and LookML models alongside senior engineers and analysts.
  • Write performant SQL for large-scale transformations in BigQuery.
  • Implement data quality checks with Dataform assertions and automated alerting.
  • Monitor pipeline health, data freshness, and anomalies to support warehouse observability.
  • Build and maintain Python data pipelines with testing, linting, and CI/CD integration.
  • Use Cloud Composer or Airflow to schedule and monitor data workflows.
  • Develop CDC and event-driven ingestion patterns using tools such as Datastream and Pub/Sub.
  • Containerise workloads with Docker for deployment on Cloud Run or similar GCP services.
  • Support data scientists by moving work from notebooks into production pipelines and preparing features for ML workloads.

Requirements

  • Experience writing complex, performant SQL against large datasets in BigQuery.
  • Experience with Dataform, or strong dbt experience with willingness to work in Dataform.
  • Strong Python development skills with clean, tested, linted code.
  • Comfortable using Git and CI/CD workflows.
  • Hands-on experience with BigQuery and broader familiarity with GCP services such as Cloud Storage, Cloud Run, Pub/Sub, and Datastream.
  • Hands-on experience with orchestration tools such as Cloud Composer, Airflow, or comparable tools.
  • Understanding of data modelling fundamentals, including dimensional modelling, Kimball principles, or medallion architecture.
  • Ability to containerise and deploy data workloads with Docker.
  • Ability to translate business requirements into data models and collaborate effectively with Analytics, Product, and Data Science stakeholders.
  • Pragmatic use of AI-assisted development tools to improve productivity and code quality.
  • Looker and LookML experience is preferred.
  • Familiarity with CDC concepts and tools such as Datastream or Debezium is preferred.
  • Exposure to ML frameworks or MLOps tooling such as scikit-learn, MLflow, or Vertex AI is preferred.
  • AWS experience, including Redshift, Glue, or RDS, is preferred.
  • Curiosity about sports performance data is preferred.

Benefits

  • Competitive salary.
  • Health insurance for employees and dependants.
  • Meaningful equity.
  • Pension plan.
  • Life cover.
  • Income protection.
  • Wellbeing benefits.
  • Remote work with occasional face-to-face gatherings recommended.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Data Engineer

Prolific 51-250 Professional Services

Prolific is hiring a Senior Data Engineer to build scalable, privacy-conscious data solutions that support analytics, AI, and product teams across a modern cloud-native platform.

Apache Airflow dbt Java Kubernetes Python REST API Scala SQL Terraform
3 hours, 57 minutes ago

Sr. Data Engineer II (6682)

MetroStar 251-1K IT Services

MetroStar is hiring a Sr. Data Engineer II to work with a cross-functional team supporting mission operations for a classified customer by designing and operationalizing data and AI/ML pipelines.

AWS Databricks Kafka Python Snowflake SQL
4 hours, 27 minutes ago

Product Data Engineer

Anduril Industries 1K-5K Aerospace & Defense

Anduril Industries is hiring a Product Data Engineer to support configuration management across complex defense programs, maintaining accurate product data from requirements through manufacturing and sustainment.

Confluence JIRA
4 hours, 27 minutes ago

Sr. Data Ops Engineer

Samsara 1K-5K IT Services

Samsara is hiring a remote Data Operations Engineer in Canada to own the reliability and performance of its production data platform supporting integrations, pipelines, APIs, and data warehousing for business-critical analytics.

Agile AWS Azure CI/CD Databricks Datadog dbt GCP MySQL Oracle PostgreSQL Python Snowflake Splunk SQL
4 hours, 42 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers