Alpaca

Alpaca is a developer-first API for stock and crypto trading, offering easy-to-use APIs for building apps and trading algorithms.

Capital Markets

Financials

51-250 (167)

Founded 2015

$87M raised

31 open positions

Links

View All Jobs

Site Reliability Engineer

1 month, 4 weeks ago

Europe

Full-time

Mid Level

Site Reliability Engineer (SRE)

DevOps and Infrastructure

DNS GitOps Go Kafka Kubernetes Linux Load Balancing PostgreSQL Python RabbitMQ Secrets Management TLS

Apply Now

Alpaca

Alpaca is a developer-first API for stock and crypto trading, offering easy-to-use APIs for building apps and trading algorithms.

Capital Markets

51-250

Founded 2015

$87M raised

View All Jobs 31

Description

Operate production systems day to day, including on-call support, incident response, postmortems, and follow-up remediation.
Define and refine reliability practices, including SLIs, SLOs, and error budgets.
Improve observability across metrics, logs, traces, and alerting.
Ship infrastructure as code through a GitOps workflow for cloud resources and Kubernetes workloads.
Support and improve PostgreSQL performance, schema and migration review, online migrations, high availability, disaster recovery, and CDC pipelines.
Mentor engineers on reliability and database fundamentals through code review, design review, and pairing.
Collaborate with product teams to help ensure services operate within reliability objectives.

Requirements

4+ years of experience in SRE, DevOps, Platform/Infrastructure, or backend engineering with significant production operations ownership.
Hands-on experience operating production services on Kubernetes.
Experience shipping infrastructure as code in a GitOps workflow.
Solid working knowledge of PostgreSQL in production, including query plans, pg_stat_* views, indexing, schema trade-offs, and safe online migrations on non-trivial tables.
Cloud networking fundamentals, including VPCs, routing, L4/L7 load balancing, DNS, and TLS.
Comfort debugging cross-service connectivity issues.
Comfortable with a modern observability stack and proficient with Linux at the operator level.
Experience with incident response, structured debugging, and postmortems that drive change.
Working proficiency in Go or Python, along with strong written and verbal communication skills.
Genuine interest in databases and willingness to grow PostgreSQL/DBA expertise.
Deeper PostgreSQL experience with large clusters at OLTP load, connection pooling at scale, HA/DR ownership, or CDC pipelines is preferred.
Experience with typed SQL access layers in Go, such as pgx, gorm, or sqlc, is preferred.
Production experience with messaging systems at scale, such as RabbitMQ, Kafka, or Redpanda, is preferred.
Security and compliance experience in a regulated environment, including SOC 2, secrets management, and audit logging, is preferred.
Familiarity with trading, brokerage, or other regulated fintech domains is preferred.

Benefits

Competitive salary with stock options.
Health benefits.
One-time USD $500 new hire home-office setup stipend.
Monthly USD $150 stipend via a Brex Card.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Site Reliability Engineer

Counterpart Health 51-200 hospital & health care

Counterpart Health is hiring a Senior Site Reliability and Infrastructure Engineer to support and evolve the technology platform behind its primary care tool and maintain reliable infrastructure for domestic and international workloads.

United States Full-time Senior Site Reliability Engineer (SRE)

$160k-$208k

AWS Azure CI/CD Containerd DNS Docker GCP Go gRPC Helm Kubernetes Linux Load Balancing Prometheus Python Shell Scripting TCP/IP

16 hours, 6 minutes ago

Apply

16 hours, 6 minutes ago

Senior Test Platform & Reliability Engineer - Star Trek Fleet Command

Scopely 1K-5K Internet Software & Services

Scopely is hiring a Senior Test Platform & Reliability Engineer in Ireland to build validation, reliability, and developer enablement platforms for Star Trek Fleet Command’s large-scale live-service backend systems.

Ireland Full-time Senior SDET (Software Development Engineer in Test) Site Reliability Engineer (SRE)

AWS Bash CI/CD Docker GitLab Go Python Terraform

16 hours, 21 minutes ago

Apply

16 hours, 21 minutes ago

Senior Software Engineer - Databases, SRE | Canada | Remote

Grafana 1K-5K IT Services

Grafana Labs is hiring a Senior Software Engineer for its remote SRE team to improve reliability and operability of Grafana Cloud database services for high-SLA customers across AWS, GCP, and Azure.

Canada Full-time Senior Site Reliability Engineer (SRE) Software Engineer

$108k-$130k

AWS Azure GCP Go Helm Java Kubernetes Linux Microservices Python Terraform

1 day, 15 hours ago

Apply

1 day, 15 hours ago

Senior Site Reliability Engineer

Semios 51-250 Food Products

Semios Group is hiring a Senior Site Reliability Engineer to help scale, secure, and improve the reliability of its global agricultural technology platform.

Canada Full-time Senior Site Reliability Engineer (SRE)

$140k-$160k

AWS Azure Bash Buildkite CI/CD Datadog Docker Envoy GCP Git GitHub GitHub Actions GitLab Go Jenkins Kubernetes Linux NATS New Relic Prometheus Python Ruby Splunk Terraform

1 day, 16 hours ago

Apply

1 day, 16 hours ago

Alpaca

Tags

Links

Site Reliability Engineer

Alpaca

Description

Requirements

Benefits

Similar Roles

Senior Site Reliability Engineer

Senior Test Platform & Reliability Engineer - Star Trek Fleet Command

Senior Software Engineer - Databases, SRE | Canada | Remote

Senior Site Reliability Engineer

You're on a roll! Sign up now to keep applying.