TrueML

TrueML

TrueML develops innovative financial technology solutions that enhance customer experience and aim to improve the financial health of consumers by addressing their unique needs and preferences.

Internet Software & Services
51-250
Founded 2013

Description

  • Define and execute the long-term strategy for Infrastructure as Code, CI/CD, and cloud-native architecture.
  • Lead the design and implementation of self-service internal platforms that reduce developer friction and improve deployment velocity.
  • Own cloud spend for AWS, including cost optimization initiatives and vendor contract negotiations.
  • Ensure infrastructure meets high availability and disaster recovery requirements across multiple regions.
  • Oversee monitoring, logging, and distributed tracing, using AIOps to shift from reactive to predictive operations.
  • Integrate automated vulnerability scanning, secret management, and compliance checks into build pipelines.
  • Serve as the escalation point for major production outages and lead blameless post-mortems focused on systemic improvement.
  • Write and review code in Python, Go, or Bash to automate operational tasks and integrations.
  • Develop and maintain Terraform-based infrastructure provisioning and complex CI/CD workflows.
  • Troubleshoot and optimize Kubernetes, EKS, networking, scaling, IAM policies, and incident-response integrations across the DevOps toolchain.
  • Recruit, mentor, and develop a DevOps team while setting goals, conducting performance reviews, and partnering with other engineering leaders.

Requirements

  • Bachelor's degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience.
  • 10+ years of experience in DevOps, Site Reliability Engineering (SRE), or Software Engineering.
  • 5+ years of experience managing engineers.
  • Expert-level AWS experience, including multi-region and high-availability deployments.
  • Advanced experience with Kubernetes and Docker in production environments.
  • Proficiency with Terraform for infrastructure automation.
  • Experience designing and maintaining CI/CD pipelines using GitHub Actions, GitLab CI, Jenkins, ArgoCD, or Atlantis.
  • Strong scripting ability in Python, Go, or Bash.
  • Experience with monitoring, observability, and tracing tools such as Datadog or Observe, plus SRE concepts like SLIs, SLOs, and error budgets.
  • Experience serving as Incident Commander for high-severity outages and using blameless post-mortems.
  • Ability to influence executive leadership and collaborate cross-functionally with Product, Engineering, and Security.
  • Experience integrating AI-assisted productivity tools such as Cline or GitHub Copilot into engineering workflows.
  • AWS or Kubernetes certifications are a plus, but hands-on production experience is preferred over certifications.
  • Experience leading organizational platform migrations is a plus.
  • Open source contributions or experience at high-velocity, product-driven technology companies is a plus.

Benefits

  • Competitive salary of $150,000 to $220,000 per year.
  • Fully remote role in the USA.
  • Opportunity to work on mission-critical, product-oriented infrastructure and platform engineering.
  • Leadership role with direct influence over architecture, tooling, and reliability strategy.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Software Engineer - Platform Engineering (Fed Ops)

Abnormal AI Internet Software & Services

Abnormal AI is hiring a Senior Software Engineer, Cloud Infrastructure to build and evolve the Federal Platform layer for its GovCloud environment.

AWS CI/CD Go Kubernetes Python Terraform
5 hours, 37 minutes ago

DevOps Engineer

Level Access 251-1K Internet Software & Services

Level Access is hiring a DevOps Engineer to help automate and maintain cloud infrastructure and CI/CD systems that support its accessibility solutions across AWS and Azure.

Ansible Argo CD AWS Azure Bash CI/CD Datadog Docker GitHub Actions GitOps Helm Jenkins Kubernetes Python Terraform
2 days, 6 hours ago

Sr. DevOps Engineer II (Remote Eligible in Bulgaria)

Smartsheet 1K-5K Internet Software & Services

Smartsheet is hiring a Senior DevOps Engineer in Bulgaria to own its edge proxy platform, cloud infrastructure, and internal developer tooling across production and FedRAMP-authorized government environments.

API Gateway AWS CDN Datadog Docker GitOps Go HAProxy Helm Kubernetes Linux Nginx Python Reverse Proxy Terraform WAF
2 days, 6 hours ago

DevOps Intern

Fandom 251-1K Internet Software & Services

Fandom is hiring two DevOps Engineer Interns to join its infrastructure team and help modernize server templates, automate configuration management, and prototype next-generation Kubernetes environments in a hybrid-cloud data center setting.

Ansible AWS Bash Chef CI/CD DNS Docker GCP Git Go Kubernetes Linux Python SSH Terraform Ubuntu
2 days, 6 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers