SPD Technology

SPD Technology

SPD Technology specializes in custom software product development, focusing on fintech and payment solutions, as well as AI/ML solutions, data engineering, and cloud services to help businesses leverage technology for growth and innovation.

Internet Software & Services
Founded 2006

Description

  • Define and lead the performance and reliability strategy across products, focusing on critical end-to-end flows, endpoints, APIs, and production-like scenarios.
  • Collaborate with engineering, Product, Architecture, DevOps/SRE, and QA teams to align non-functional requirements, acceptance criteria, and validation approaches.
  • Design and execute performance, scalability, load, and stress testing activities, including hands-on support where needed.
  • Define realistic production-like workloads and test scenarios based on traffic, concurrency, throughput, and key business flows.
  • Support the definition and validation of SLAs, SLOs, performance baselines, and release quality gates.
  • Establish and evolve cross-company performance testing standards, practices, frameworks, and tooling.
  • Identify bottlenecks across applications, APIs, databases, infrastructure, and integrations, and recommend optimizations.
  • Improve observability through dashboards, metrics, logs, monitoring, and reporting.
  • Provide technical direction and architectural guidance to performance testing engineers and other stakeholders while remaining hands-on.
  • Prepare reports on findings, risks, system limits, benchmark results, and production-readiness recommendations.
  • Lead resilience engineering efforts by introducing chaos experiments in Kubernetes and integrating them into CI/CD pipelines.

Requirements

  • Strong hands-on experience in performance testing, performance engineering, or non-functional testing.
  • Experience building, defining, or leading performance strategy for APIs, backend services, distributed systems, or high-load platforms.
  • Strong experience with load testing, stress testing, bottleneck analysis, baseline validation, performance reporting, and production-like test scenarios.
  • Good understanding of NFRs, including performance, scalability, reliability, and availability.
  • Experience with SLA/SLO-driven validation and performance quality gates.
  • Experience integrating performance checks into CI/CD pipelines.
  • Experience with quality gates in release processes.
  • Experience working with observability and monitoring tools, dashboards, metrics, logs, and reporting, ideally in Kubernetes-based environments.
  • Ability to operate as an architect-level hands-on specialist with broad system-level thinking.
  • Strong communication and stakeholder-management skills across Product, Architecture, DevOps/SRE, QA, and Engineering teams.
  • Experience with APM tools and distributed tracing is a plus.
  • Experience with microservices, cloud environments, and Kubernetes at scale is a plus.
  • Experience defining or standardizing performance practices across multiple teams or products is a plus.
  • Familiarity with AI/ML techniques for anomaly detection, performance insights, or predictive scaling is a plus.

Benefits

  • Fully remote work with a flexible working schedule.
  • Stable workload and stable income.
  • Provided laptop and licensed software.
  • Performance and merit reviews.
  • Personal development plans and individual learning opportunities.
  • Corporate library and public speaking support.
  • Company-wide tech and cultural events.
  • Referral bonus program and HR support.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Database Reliability Engineer (DBRE) & Architect (worldwide remote)

CloudLinux 51-250 IT Services

CloudLinux is seeking a visionary engineer to lead the evolution of its data platform by building an internal DBaaS model that turns database infrastructure into a reliable service for product teams across a hybrid cloud environment.

Ansible Apache Airflow AWS Azure ClickHouse DigitalOcean GCP GitLab Go Grafana Jenkins Kafka Kubernetes MongoDB PostgreSQL Python Redash Redis SQL Terraform
2 hours, 18 minutes ago

Director of Cloud Operations

Firstup 251-1K Professional Services

Firstup is hiring a Director of Cloud Operations to lead the reliability, scalability, and efficiency of its globally distributed SaaS cloud platform across AWS, while partnering with engineering, security, and product teams.

AWS CI/CD CircleCI Datadog Kubernetes Microservices .NET Serverless Terraform
7 hours, 21 minutes ago

Senior Applications Support Specialist

Ensono 1K-5K IT Services

Application Reliability Lead at an enterprise in a regulated environment, responsible for restoring service during incidents and improving the resilience, stability, and operational readiness of critical applications.

Grafana Java .NET PowerShell Prometheus Python Splunk SQL
12 hours, 54 minutes ago

Senior Site Reliability Engineer (Calgary, Canada)

Syndio 51-250 Professional Services

Syndio is hiring a Senior Site Reliability Engineer to help design and operate cloud-based systems that improve reliability, observability, and availability for its compensation platform.

CI/CD Datadog GCP GitOps Go Helm Kubernetes Linux Python Terraform
12 hours, 55 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers