Geotab

Geotab

Geotab is a leading provider of GPS fleet tracking and management solutions, leveraging data analytics and machine learning to optimize fleet performance, enhance driver safety, and ensure regulatory compliance worldwide.

Road & Rail
1K-5K
Founded 2000

Description

  • Define and own the enterprise-wide observability architecture, including technical standards, reference architectures, and multi-year roadmaps.
  • Evaluate, select, and standardize observability tools to reduce tool sprawl and optimize total cost of ownership.
  • Design scalable data pipelines and storage strategies for ingesting and querying petabyte-scale telemetry data across metrics, traces, logs, and profiling.
  • Design Terraform modules and Helm charts for declarative observability infrastructure provisioning across multi-cloud environments.
  • Establish and enforce instrumentation standards using OpenTelemetry, including SDK guidelines, collector deployment patterns, and semantic conventions.
  • Define and champion SLO, SLI, and error-budget frameworks across engineering teams.
  • Serve as a senior escalation point during critical incidents to accelerate diagnosis and resolution.
  • Provide architectural mentorship and technical guidance to Observability Engineers and SRE team members.
  • Collaborate closely with SRE, platform engineering, application development, security, and compliance stakeholders.
  • Influence and drive technical direction across multiple teams and organizational boundaries.

Requirements

  • 5-8 years of experience in Observability Architecture, Site Reliability Engineering, or Platform/Infrastructure Engineering.
  • Post-secondary diploma or degree in Engineering, Computer Science, or a related field.
  • Mastery of the OpenTelemetry ecosystem and expert-level knowledge of Prometheus-compatible metrics systems such as VictoriaMetrics and Thanos.
  • Advanced experience with tracing systems such as Grafana Tempo and Jaeger, and log aggregation platforms such as Loki, Elasticsearch, and Google BigQuery.
  • Expert-level proficiency in cloud infrastructure, with GCP strongly preferred, and Kubernetes architecture.
  • Strong software engineering skills in Go, Python, or similar languages for building cloud-native tooling.
  • Excellent communication skills with the ability to articulate technical architecture to executive audiences and influence across organizational boundaries.
  • Deep expertise in designing enterprise-scale observability platforms.
  • Preferred certifications: Google Cloud Professional Cloud Architect or Certified Kubernetes Administrator (CKA).
  • Ability to work in a fast-paced, evolving environment with willingness to take on new tasks and activities.

Benefits

  • Hiring range of $116,200 to $155,000 CAD annually.
  • Flex working arrangements and a flexible hybrid working model.
  • Home office reimbursement program.
  • Baby bonus and parental leave top-up program.
  • Online learning and networking opportunities.
  • Electric vehicle purchase incentive program.
  • Competitive medical and dental benefits.
  • Retirement savings program.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Site Reliability Engineer (SRE)

Sleek 251-1K Professional Services

Sleek is hiring a Senior SRE Engineer to architect and scale its cloud and AI-ready infrastructure across a multi-country, fast-growing platform serving micro SMEs.

API Gateway Argo CD AWS Azure CI/CD Cloudflare CloudFormation Flux GCP GitOps Kong Kubernetes Microservices NestJS Node.js OpenSearch OpenTelemetry Prometheus Pulumi Python Secrets Management Serverless Terraform Traefik WAF
1 hour, 43 minutes ago

[Job 30278] SRE (DevOps)

CI&T 5K-10K Internet Software & Services

CI&T is hiring a senior SRE/DevOps to evolve the infrastructure behind critical digital products, with a focus on resilient multi-region AWS architecture and mobile delivery pipelines.

Android Ansible API Gateway AWS Bash CI/CD DynamoDB GitHub Actions GitLab CI Grafana iOS Jenkins Kubernetes Prometheus Python Secrets Management Terraform
1 hour, 58 minutes ago

Senior Manager, Engineering

Sumo Logic 251-1K Internet Software & Services

Sumo Logic is hiring a Senior Manager, Engineering for Application Security to lead global programs that improve product security, reliability, and operational efficiency across its cloud platform.

Agile AWS C++ Docker GCP Java Kafka Kubernetes OWASP Ruby Scala SIEM
1 day, 2 hours ago

Staff Software Engineer - Databases SRE | Sweden | Remote

Grafana 1K-5K IT Services

Grafana Labs is hiring a Staff Software Engineer, SRE to improve the reliability and scalability of Grafana Cloud’s database products for high-value customers across AWS, GCP, and Azure.

AWS Azure GCP Go Helm Java Kubernetes Linux Microservices Python Terraform
2 days, 1 hour ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers