Tyk API Management

Tyk is a leading API Management Platform that enables interconnectivity between systems and devices through its fast, scalable, and open-source API Gateway, Analytics, Dev Portal, and Dashboard.

Internet Software & Services

Information Technology

51-250 (240)

Founded 2015

$40M raised

9 open positions

Links

View All Jobs

Site Reliability Engineer

3 months, 1 week ago

Canada

Full-time

Senior

Site Reliability Engineer (SRE)

DevOps and Infrastructure

AWS Azure DNS GCP Go Grafana Helm HTTP Kubernetes Linux Load Balancing Microservices MongoDB Penetration Testing Prometheus Rancher Redis TCP/IP Terraform TLS

Apply Now

Tyk API Management

Tyk is a leading API Management Platform that enables interconnectivity between systems and devices through its fast, scalable, and open-source API Gateway, Analytics, Dev Portal, and Dashboard.

Internet Software & Services

51-250

Founded 2015

$40M raised

View All Jobs 9

Description

Maintain Tyk Cloud availability and help define SLA/SLO/SI targets.
Identify reliability issues and work with the squad to resolve them.
Create and improve metrics and dashboards to monitor platform health.
Participate in the on-call rotation and serve as first-line incident management support.
Conduct post-incident analysis and help define response processes.
Automate common operational tasks and improve support workflows.
Document operational knowledge, SRE processes, and policies.
Support the expansion of the platform across multi-region and multi-cloud environments.
Recommend and implement ways to improve operational efficiency and reduce running costs without affecting service.
Assist with cloud penetration testing by coordinating with the provider and preparing technical details and environment setup.

Requirements

Experience launching and operating production-scale Kubernetes clusters.
Experience designing and operating infrastructure on AWS and other cloud providers.
Experience operating MongoDB or similar document databases.
Experience operating Redis or similar key-value storage clusters.
Experience administering Linux servers and maintaining distributed software.
Experience operating Prometheus, Grafana, and logging collection/analysis systems.
Strong collaboration skills and a proactive, energetic, innovative, change-oriented mindset.
Advanced knowledge of Kubernetes and containers, AWS/EKS, and Linux.
Proficient with Terraform and infrastructure as code, and Helm.
Familiarity with Go, monitoring tools such as Thanos, and networking concepts including subnets, routing, peering, load balancing, NAT, DNS, TCP/IP, HTTP, TLS, and UDP.
Availability to participate in the on-call rotation, including 16:00–4:00 UTC.
Nice to have: experience with GCP or Azure, bare metal infrastructure, API management, large-scale distributed storage, Rancher, CKA/CKAD/CKS certifications, or production software delivery in Go.

Benefits

Unlimited paid holiday.
Remote working from anywhere in the world.
Flexible working hours.
Employee share scheme.
Generous maternity and paternity leave.
Company retreats.
An inclusive, values-driven culture that emphasizes authenticity, respect, responsibility, independence, honesty, diversity, and inclusion.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Site Reliability Engineer

Counterpart Health 51-200 hospital & health care

Counterpart Health is hiring a Senior Site Reliability and Infrastructure Engineer to support and evolve the technology platform behind its primary care tool and maintain reliable infrastructure for domestic and international workloads.

United States Full-time Senior Site Reliability Engineer (SRE)

$160k-$208k

AWS Azure CI/CD Containerd DNS Docker GCP Go gRPC Helm Kubernetes Linux Load Balancing Prometheus Python Shell Scripting TCP/IP

19 hours, 6 minutes ago

Apply

19 hours, 6 minutes ago

Senior Test Platform & Reliability Engineer - Star Trek Fleet Command

Scopely 1K-5K Internet Software & Services

Scopely is hiring a Senior Test Platform & Reliability Engineer in Ireland to build validation, reliability, and developer enablement platforms for Star Trek Fleet Command’s large-scale live-service backend systems.

Ireland Full-time Senior SDET (Software Development Engineer in Test) Site Reliability Engineer (SRE)

AWS Bash CI/CD Docker GitLab Go Python Terraform

19 hours, 21 minutes ago

Apply

19 hours, 21 minutes ago

Senior Software Engineer - Databases, SRE | Canada | Remote

Grafana 1K-5K IT Services

Grafana Labs is hiring a Senior Software Engineer for its remote SRE team to improve reliability and operability of Grafana Cloud database services for high-SLA customers across AWS, GCP, and Azure.

Canada Full-time Senior Site Reliability Engineer (SRE) Software Engineer

$108k-$130k

AWS Azure GCP Go Helm Java Kubernetes Linux Microservices Python Terraform

1 day, 18 hours ago

Apply

1 day, 18 hours ago

Senior Site Reliability Engineer

Semios 51-250 Food Products

Semios Group is hiring a Senior Site Reliability Engineer to help scale, secure, and improve the reliability of its global agricultural technology platform.

Canada Full-time Senior Site Reliability Engineer (SRE)

$140k-$160k

AWS Azure Bash Buildkite CI/CD Datadog Docker Envoy GCP Git GitHub GitHub Actions GitLab Go Jenkins Kubernetes Linux NATS New Relic Prometheus Python Ruby Splunk Terraform

1 day, 19 hours ago

Apply

1 day, 19 hours ago

Tyk API Management

Tags

Links

Site Reliability Engineer

Tyk API Management

Description

Requirements

Benefits

Similar Roles

Senior Site Reliability Engineer

Senior Test Platform & Reliability Engineer - Star Trek Fleet Command

Senior Software Engineer - Databases, SRE | Canada | Remote

Senior Site Reliability Engineer

You're on a roll! Sign up now to keep applying.