Senior SRE Engineer

4 weeks, 1 day ago
Full-time
Senior
Software Development
Trustly

Trustly

Trustly specializes in developing and providing online payment solutions that leverage Open Banking technology to enhance payment processes, reduce costs, and streamline financial services for consumers, merchants, and banks.

Diversified Financial Services
251-1K
Founded 2008

Description

  • Architect, design, and implement strategies to ensure high availability, reliability, and fault tolerance of infrastructure and applications.
  • Lead incident response efforts, perform root cause analysis, implement preventative measures, and own post-incident follow-ups and remediation.
  • Monitor and observe production systems using automation tools to detect, triage, and resolve reliability issues.
  • Identify performance bottlenecks, conduct performance analysis, and optimize system and application performance.
  • Drive automation initiatives to remove manual toil by developing and maintaining tools, scripts, and frameworks for deployment, monitoring, and troubleshooting.
  • Generate regular reports on system reliability, uptime, and performance metrics and present findings, trends, and recommendations to management and stakeholders.
  • Collaborate with cross-functional teams to define SLIs/SLOs/error budgets, KPIs, and develop reporting frameworks to track system health and operational efficiency.
  • Support and maintain critical services running in AWS and on-premises, including system, security, and network monitoring and maintenance.

Requirements

  • Bachelor's degree in Computer Science or a related field.
  • Experience building SLIs, SLOs, and error budgets based on business rules.
  • IT project management experience.
  • Coding experience with Python, Java, Shell, Bash, or similar languages.
  • Experience supporting critical production services in the cloud (AWS) and on-premises environments.
  • Experience with network technologies and system, security, and network monitoring tools.
  • Detailed technical knowledge of databases and the Linux operating system, including standards and best practices for keeping services up and running.
  • Proactive approach to spotting problems, removing manual processes/toil using code, and fixing performance concerns programmatically.
  • Advanced English.
  • Ability to work remotely from Brazil (remote-first culture; position supports working from any city in Brazil).

Benefits

  • Bradesco health and dental plan for you and your dependents with no co-payment cost.
  • Life insurance with differentiated coverage.
  • Meal voucher and supermarket voucher.
  • Home office allowance and remote-first flexible hours (work from any city in Brazil).
  • Gympass access to physical activity spaces and online classes.
  • English program with online group classes and private teacher.
  • Welcome kit with Apple equipment (MacBook Pro, iPhone) and option to purchase equipment under internal criteria.
  • Annual discretionary bonus (annual premium) based on company KPIs and employee referral program rewards.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Site Reliability Engineer

OfficeSpace Software 251-1K Internet Software & Services

OfficeSpace Software is hiring a Senior Site Reliability Engineer to own the performance, reliability, and cost efficiency of its production platform at scale while helping modernize operations with AI-assisted reliability engineering.

Ansible Apache Argo CD CI/CD Datadog GitOps Grafana Kubernetes Linux MariaDB Microservices MySQL Nginx PostgreSQL Prometheus Puppet Python Redis Ruby Ruby on Rails Sidekiq Terraform
50 minutes ago

Senior Database Reliability Engineer

Sezzle 251-1K Diversified Financial Services

Sezzle is hiring a Senior Database Reliability Engineer to design, build, and scale the shared database platform and reliability controls that support its applications across production and development environments.

AWS CI/CD Datadog Elasticsearch Encryption Git Go Grafana Helm Kubernetes Microservices MySQL New Relic OpenTelemetry PostgreSQL Prometheus Python React React Native REST API Secrets Management Terraform TypeScript
2 hours ago

Associate Site Reliability Engineer

Ivanti 1K-5K Internet Software & Services

Ivanti is hiring a Site Reliability Engineer to help operate and improve its cloud-based SaaS services through automation, observability, and reliable production support.

Ansible Apache AWS Azure Chef Docker Elasticsearch Git HAProxy InfluxDB Java Jenkins Kafka Kubernetes Linux MongoDB MySQL Nginx PostgreSQL PowerShell Python Redis Ruby Splunk Terraform
6 hours, 9 minutes ago

Senior Database Reliability Engineer

Sezzle 251-1K Diversified Financial Services

Sezzle is hiring a Senior Database Reliability Engineer to design and scale the database platform that supports its applications and improve reliability, safety, and developer experience across the company’s production systems.

AWS CI/CD Datadog Docker Elasticsearch Git GitLab Go Grafana GraphQL Helm Kubernetes Microservices MySQL New Relic OpenTelemetry PostgreSQL Prometheus Python React React Native REST API Terraform TypeScript
16 hours, 4 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers