Sleek

Sleek

Sleek: The SMEs' go-to platform for company registration, compliance, and accounting services in Singapore, HK, AU, and the UK. Designed by entrepreneurs, for entrepreneurs, Sleek offers hassle-free solutions to start and manage businesses digitally.

Professional Services
251-1K
Founded 2017
$34M raised

Description

  • Architect, build, and scale infrastructure for Sleek’s next-generation platform and AI-powered capabilities.
  • Partner with Product, Engineering, and AI teams to define infrastructure strategy and delivery priorities.
  • Design resilient, secure, and scalable cloud architectures for production workloads.
  • Review existing cloud infrastructure and create a roadmap for reliability and scalability improvements.
  • Lead upgrades or redesigns of core platform components such as networking, containers, orchestration, and databases.
  • Improve incident response processes, SLIs, SLOs, and on-call readiness.
  • Build or refine pipelines for model hosting, embeddings, vector search, or related AI services when needed.
  • Enhance CI/CD, infrastructure automation, testing automation, and deployment tooling to reduce manual work.
  • Strengthen logging, monitoring, tracing, alerting, readiness checks, runbooks, and automated recovery paths.
  • Improve platform security through secrets management, access control, dependency monitoring, and hardened pipeline practices.

Requirements

  • 6+ years of progressive experience in Site Reliability Engineering (SRE).
  • 6+ years of hands-on experience across multi-cloud environments such as AWS, GCP, and Azure.
  • 6+ years of deep expertise in containerization and orchestration, including Kubernetes, EKS, or ECS.
  • 6+ years of experience with Infrastructure as Code tools such as Terraform, Pulumi, or CloudFormation.
  • Proven ability to design, build, and operate highly reliable, scalable production systems with zero-downtime deployment patterns such as Blue/Green, Canary, or progressive delivery.
  • Experience modernizing deployments with GitOps tools such as ArgoCD or Flux and building self-service developer platforms.
  • Experience implementing multi-cloud API gateways and edge routing solutions such as Kong, Traefik, Cloudflare, or multi-cluster ingress.
  • Strong background in platform security, including secrets management, IAM, runtime hardening, and WAFs.
  • Practical experience with observability tools such as Prometheus, OpenTelemetry, OpenSearch, ELK, or CloudWatch.
  • Experience supporting or deploying AI/ML workloads such as model inference, vector databases, or GPU workloads, or strong familiarity with their infrastructure needs.
  • Excellent communication and collaboration skills with the ability to explain complex infrastructure decisions clearly.
  • Familiarity with Node.js, NestJS, and Python is highly desirable.

Benefits

  • Fully remote role with work from home five days a week.
  • Flexible start times to accommodate personal or family needs, with proactive communication.
  • Ability to work fully remote from anywhere in the world for one month each year.
  • Competitive market salary.
  • Generous paid time off and holiday schedules.
  • Employee share ownership plan for eligible staff.
  • Access to internal and external training programmes.
  • Opportunity for significant autonomy, responsibility, and growth in a fast-moving environment.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Observability Architect

Geotab 1K-5K Road & Rail

Geotab is hiring an SRE Observability Architect to define and lead the observability architecture for its cloud platforms, with the goal of delivering scalable, cost-efficient, and highly reliable insight across distributed systems.

Elasticsearch GCP Go Grafana Helm Jaeger Kubernetes OpenTelemetry Prometheus Python Terraform
1 hour, 44 minutes ago

[Job 30278] SRE (DevOps)

CI&T 5K-10K Internet Software & Services

CI&T is hiring a senior SRE/DevOps to evolve the infrastructure behind critical digital products, with a focus on resilient multi-region AWS architecture and mobile delivery pipelines.

Android Ansible API Gateway AWS Bash CI/CD DynamoDB GitHub Actions GitLab CI Grafana iOS Jenkins Kubernetes Prometheus Python Secrets Management Terraform
1 hour, 59 minutes ago

Senior Manager, Engineering

Sumo Logic 251-1K Internet Software & Services

Sumo Logic is hiring a Senior Manager, Engineering for Application Security to lead global programs that improve product security, reliability, and operational efficiency across its cloud platform.

Agile AWS C++ Docker GCP Java Kafka Kubernetes OWASP Ruby Scala SIEM
1 day, 2 hours ago

Staff Software Engineer - Databases SRE | Sweden | Remote

Grafana 1K-5K IT Services

Grafana Labs is hiring a Staff Software Engineer, SRE to improve the reliability and scalability of Grafana Cloud’s database products for high-value customers across AWS, GCP, and Azure.

AWS Azure GCP Go Helm Java Kubernetes Linux Microservices Python Terraform
2 days, 1 hour ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers