Site Reliability Engineer

1 month, 4 weeks ago
Full-time
Senior
DevOps and Infrastructure
TextNow

TextNow

TextNow is a leading provider of free phone service, offering calling and texting through its app and SIM card. With a focus on affordability and innovation, TextNow is revolutionizing mobile phone service with cloud-based technology, providing users w...

Wireless Telecommunication Services
51-250
Founded 2009

Description

  • Design, build, and maintain scalable, resilient, highly available systems for TextNow’s infrastructure and services.
  • Develop and maintain infrastructure automation using Terraform, Ansible, and related tools.
  • Support cloud deployment, scaling, and operations for AWS-based systems.
  • Participate in an on-call rotation and respond to production incidents.
  • Troubleshoot issues, drive incident resolution, and reduce downtime.
  • Conduct post-mortems and implement corrective actions to improve reliability.
  • Implement and improve observability through logging, metrics, and monitoring solutions.
  • Collaborate with software engineers, DevOps, and product teams to improve reliability from development to production.
  • Identify opportunities to improve architecture, automation, and operational practices.
  • Contribute to the design and implementation of new SRE best practices.

Requirements

  • 5+ years of experience in an operationally focused role such as SRE, DevOps, or Infrastructure Engineering.
  • Deep understanding of reliability, scalability, and performance optimization.
  • Hands-on experience with AWS, GitHub, Terraform, Ansible, or similar tools.
  • Experience handling production incidents, performing root cause analysis, and implementing long-term fixes.
  • Strong focus on automation and scripting to reduce operational toil.
  • Experience building robust observability with logging, metrics, and monitoring tools.
  • Ability to work cross-functionally with engineers, product teams, and leadership.
  • Experience in a remote or distributed working environment is preferred.
  • Canada-based role with compensation listed in CAD and select USD markets.
  • Applicants must be eligible to work in the relevant hiring location.

Benefits

  • Competitive pay with a stated salary range of $113,400 - $162,000 CAD.
  • Employee stock options.
  • Unlimited vacation and 12 paid holidays per year.
  • Flexible work arrangements, including work-from-home, remote, or office access.
  • Health, dental, and vision benefits.
  • Short-term and long-term disability coverage.
  • $750 annual wellness benefit or healthcare spending account.
  • RRSP matching in Canada or 401(k) in the USA.
  • Parental leave for eligible employees.
  • Learning and development opportunities.
  • Free phone service.
  • Strong work-life blend.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Staff Operations Engineer

Mozilla 251-1K Internet Software & Services

Mozilla is hiring a Staff Operations Engineer to lead the design, reliability, and evolution of hybrid-cloud and workplace infrastructure across teams.

Ansible DNS Linux Puppet Python TCP/IP Unix
7 hours, 22 minutes ago

Principal Site Reliability Engineer (SRE)

Symmetrio Professional Services

Symmetrio is recruiting a Principal Site Reliability Engineer for a rapidly growing healthcare technology company to own the reliability, scalability, security, and performance of a mission-critical SaaS platform used by healthcare providers across the United States.

Active Directory AWS CI/CD Datadog Django Grafana Kubernetes Python Terraform Windows Server
7 hours, 38 minutes ago

Performance Test Engineer Lead

PartnerOne 51-250 Media

An enterprise performance engineering role at a cloud-focused organization, responsible for validating the scalability, stability, and production readiness of distributed systems across Azure and hybrid environments.

Azure CI/CD Kubernetes PowerShell
7 hours, 53 minutes ago

Site Reliability Engineer

MLabs 11-50 Internet Software & Services

Remote UK-hours Site Reliability Engineering role at a financial technology company, focused on automating and operating the infrastructure that supports global integration services for financial institutions.

Active Directory Ansible AWS CI/CD GCP OAuth PostgreSQL SAML
8 hours, 8 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers