AlphaSense

AlphaSense

AlphaSense develops an artificial intelligence-based search platform that enables investment and corporate professionals to quickly access and analyze extensive financial data and market insights from over 500 million documents, enhancing decision-maki...

Internet Software & Services
251-1K
Founded 2011
$770M raised

Description

  • Design and implement multi-region, multi-AZ AWS architectures that meet RTO and RPO targets.
  • Engineer active-active and active-passive failover patterns using Route 53, Global Accelerator, and CloudFront.
  • Build automated disaster recovery runbooks and playbooks using AWS Systems Manager Automation and Step Functions.
  • Implement chaos engineering practices with AWS Fault Injection Simulator to validate resiliency.
  • Architect cross-region replication strategies for S3, DynamoDB Global Tables, RDS, and Aurora Global.
  • Review Kubernetes-based workloads and ensure resilience through self-healing, auto-scaling, and multi-cluster or multi-region deployments.
  • Administer AWS Backup across core services with policy-based automation and backup replication.
  • Develop and automate data recovery testing procedures and restore drills to validate integrity and service-level targets.
  • Author and maintain Infrastructure as Code templates for BCP/DR components and automate DR testing pipelines through CI/CD.
  • Build monitoring, alerting, and incident response workflows, including dashboards, alerts, on-call participation, and post-incident reviews.

Requirements

  • 5+ years of experience in cloud infrastructure, SRE, or IT disaster recovery engineering roles.
  • 3+ years of hands-on AWS experience in production environments at scale.
  • Proven delivery of multi-region disaster recovery architectures with defined and tested RTO/RPO targets.
  • Strong scripting skills in Python, Bash, or PowerShell for automation and orchestration.
  • Experience with Infrastructure as Code tools such as Terraform and/or AWS CloudFormation.
  • Solid understanding of networking fundamentals including VPC, TGW, Direct Connect, VPN, and DNS failover.
  • Excellent written and verbal communication skills, with the ability to produce executive-level DR reports.
  • AWS Certified Solutions Architect – Professional or AWS Certified DevOps Engineer – Professional preferred.
  • AWS Certified Advanced Networking – Specialty preferred.
  • Experience with AWS Resilience Hub, CloudEndure/AWS Elastic Disaster Recovery, Kubernetes-based DR, or serverless DR patterns preferred.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Staff Operations Engineer

Mozilla 251-1K Internet Software & Services

Mozilla is hiring a Staff Operations Engineer to lead the design, reliability, and evolution of hybrid-cloud and workplace infrastructure across teams.

Ansible DNS Linux Puppet Python TCP/IP Unix
6 hours, 16 minutes ago

Principal Site Reliability Engineer (SRE)

Symmetrio Professional Services

Symmetrio is recruiting a Principal Site Reliability Engineer for a rapidly growing healthcare technology company to own the reliability, scalability, security, and performance of a mission-critical SaaS platform used by healthcare providers across the United States.

Active Directory AWS CI/CD Datadog Django Grafana Kubernetes Python Terraform Windows Server
6 hours, 32 minutes ago

Senior Infrastructure Security Engineer

Dropbox 1K-5K Internet Software & Services

Dropbox is hiring a Security Engineer to secure its AI and agentic infrastructure while helping protect products and users across cloud and on-prem environments.

Bash CI/CD CrowdStrike Go Java Kubernetes Linux LLM Node.js OAuth OpenID Connect OWASP Python Ruby Rust SIEM
6 hours, 32 minutes ago

Cloud Infrastructure Administrator II

Jenzabar 251-1K Internet Software & Services

Jenzabar is hiring a Cloud Infrastructure Administrator II to support cloud security operations, vulnerability remediation, and compliance efforts across its cloud environment.

AWS Azure Cloudflare CrowdStrike Cybersecurity GCP Kubernetes SIEM Terraform
6 hours, 46 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers