Senior Site Reliability Engineer, Security & Compliance (L3)

1 month ago
Full-time
Senior
DevOps and Infrastructure
CoinGecko

CoinGecko

CoinGecko is a leading cryptocurrency ranking website offering a detailed evaluation of digital currencies based on various metrics.

IT Services
51-250
Founded 2014

Description

  • Review system architecture and software components with engineers and ensure consistent best practices across teams.
  • Own service reliability objectives, monitor operational metrics, and lead improvement plans to meet SLOs and SLAs.
  • Develop and maintain infrastructure tools, including infrastructure-as-code resources, to scale operations and increase team autonomy.
  • Manage, audit, and improve security controls to meet enterprise requirements and compliance standards.
  • Collaborate with legal and compliance teams to assess and manage overall risk.
  • Lead release planning activities such as canary and blue-green deployments, including test environment provisioning and ad hoc performance testing.
  • Lead incident response and post-mortems to resolve production issues, identify root causes, and prevent recurrence.
  • Develop and implement disaster recovery plans, including data recovery procedures and fault-injection simulations on production replicas.
  • Handle day-to-day operational tasks such as access onboarding/offboarding, configuration, patch management, and capacity planning.
  • Develop runbooks, documentation, and technical assets, and support periodic technical audits and cross-functional technical questions.

Requirements

  • 3 to 5 years of experience managing software deployments and production instrumentation in environments with defined SLAs and SLOs.
  • Strong knowledge of software delivery and DevOps principles.
  • Experience with cloud platforms such as AWS, Cloudflare, or GCP.
  • Experience with infrastructure-as-code tools such as Terraform or CloudFormation.
  • Strong programming and scripting skills in Python, Go, Ruby, or similar languages.
  • Bachelor’s degree in Computer Science, InfoSec, or a related field, or relevant professional certifications such as Certified DevOps Professional or AWS/GCP Solutions Architect Professional.
  • Ability to take substantial features from concept to shipping as a sole contributor.
  • Ability to work effectively on open-ended projects, evaluate multiple solutions independently, and dive deep into complex problems.
  • Strong problem-solving and communication skills, including producing structured, data-backed written analysis under pressure.
  • Experience supporting on-call rotations for 24x7 services, including troubleshooting, following runbooks, and escalating incidents.
  • Experience working in a growth-stage startup is preferred.
  • Experience building applications across different tech stacks is preferred.
  • Interest in decentralized technologies and cryptocurrency applications is preferred.

Benefits

  • Remote work flexibility, with optional office space in Malaysia and Singapore.
  • Comprehensive life and hospitalization insurance, including coverage for dependents.
  • Virtual share options, subject to terms and conditions.
  • Annual bonus, subject to terms and conditions.
  • Parking allowance on a claim basis.
  • Monthly meal allowance of RM600 or SGD400.
  • Annual learning allowance of USD500 on a claim basis.
  • Social activity allowance and an annual company offsite.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Site Reliability Engineer

Alpaca 51-250 Capital Markets

Alpaca is hiring a Site Reliability Engineer to keep its brokerage platform reliable and operable across cloud, Kubernetes, observability, messaging, and database systems, with a strong focus on PostgreSQL reliability on the trading-critical path.

DNS GitOps Go Kafka Kubernetes Linux Load Balancing PostgreSQL Python RabbitMQ Secrets Management TLS
1 hour, 29 minutes ago

Site Reliability Engineer

Kaseya 1K-5K IT Services

Kaseya is hiring a Site Reliability Engineer to own the reliability, automation, and production stability of its AWS-based services used by thousands of MSPs worldwide.

Ansible AWS Chef CloudFormation Datadog DevSecOps Elasticsearch Kibana Kubernetes MySQL PostgreSQL Puppet Secrets Management Serverless Terraform
5 hours, 29 minutes ago

SRE - DevOps Engineer - Argentina

Coderio 51-250 Internet Software & Services

Coderio is hiring a remote DevOps/SRE Engineer in Argentina to ensure the stability, scalability, and efficient operation of the infrastructure that supports its global digital solutions.

Argo CD CI/CD Flux GitHub Actions GitOps Helm Jenkins Kubernetes OpenShift Terraform
9 hours, 9 minutes ago

Senior Site Reliability Engineer

Cribl 251-1K IT Services

Cribl is hiring a Senior Site Reliability Engineer in Poland to help build and operate the telemetry infrastructure and observability platform that supports its cloud products and enterprise customers.

Ansible AWS Azure CI/CD Grafana JavaScript Kibana Linux New Relic Node.js PagerDuty Prometheus Splunk Terraform TypeScript
16 hours, 42 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers