Site Reliability Engineer, Tech Lead

16 hours, 13 minutes ago
Contract
Lead
DevOps and Infrastructure
Loadsmart

Loadsmart

Loadsmart is a logistics solutions provider automating freight transportation with innovative technology to move more efficiently.

Air Freight & Logistics
251-1K
$346M raised

Description

  • Collaborate with and support the development team across engineering squads.
  • Design, deploy, and operate critical systems while balancing reliability, cost, and agility.
  • Drive reliability projects in partnership with engineering teams.
  • Troubleshoot system issues and perform root-cause analysis of operational incidents.
  • Own the platform’s Service Level Agreements and Service Level Objectives.
  • Provide infrastructure support during off-hours as needed.
  • Take ownership of software infrastructure projects.
  • Review code and specifications and participate in constructive feedback cycles.
  • Collect and analyze metrics, and communicate their business impact to the team.
  • Analyze, propose, and implement safer systems and processes.

Requirements

  • 1-3 years of experience leading reliability work across multiple engineering squads.
  • 5+ years of experience in Cloud Computing, SRE, or DevOps.
  • Proven experience collaborating with internal stakeholders across multiple engineering squads.
  • Strong project management skills with demonstrated ability to delegate and mentor team members.
  • Proficient in English, both written and spoken, for collaboration in an international team.
  • Detail-oriented with high initiative and self-motivation.
  • Strong understanding of software engineering principles and how systems work under the hood.
  • In-depth knowledge of modern networking and operating systems.
  • Experience with AWS, cloud environments, containers, Kubernetes, Docker, and DevOps engineering, including CI/CD pipelines and tests.
  • Familiarity with automation tools and provisioners such as Terraform, Ansible, or Chef.
  • Solid troubleshooting and system engineering experience in UNIX/Linux production environments.
  • Experience with monitoring, alerting, and incident management.
  • Proficiency in scripting languages such as Python and Bash.
  • Experience or exposure to PostgreSQL and DBA responsibilities is a plus.

Benefits

  • Competitive base salary.
  • Extremely competitive equity package.
  • Flexible PTO through Loadie Time Off.
  • Unlimited PTO and sick days.
  • Remote work from anywhere in Brazil.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Site Reliability Engineer

TextNow 51-250 Wireless Telecommunication Services

TextNow is hiring a remote Site Reliability Engineer in Canada to own infrastructure, monitoring, logging, CI/CD, and reliability for the systems supporting its free phone service platform.

Ansible AWS CI/CD GitHub System Design Terraform
4 hours, 59 minutes ago

Senior Application Engineer

Warner Music Group is hiring a Senior Application Engineer to support, improve, and modernize the software systems behind its global music operations.

Angular AWS CI/CD GitHub Actions Java Oracle PostgreSQL Python React SQL
5 hours, 14 minutes ago

Site Reliability Engineer - Backstage

Spotify Media

Site Reliability Engineer for Spotify’s Backstage team in New York City, focused on building and operating cloud infrastructure for an external developer portal and internal AI-driven coding workflows.

AWS GCP Go Java LLM Microservices Python React Terraform TypeScript
6 hours, 29 minutes ago

Blockchain Site Reliability Engineer

InfStones 51-250 Internet Software & Services

InfStones is hiring a remote Blockchain Site Reliability Engineer in Dallas to ensure the reliability, availability, and performance of its blockchain node infrastructure.

Docker Ethereum Go Grafana JavaScript Kubernetes Linux Prometheus Python Rust Solana
7 hours, 14 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers