Smarsh

Smarsh

Smarsh provides cloud-based archiving and compliance solutions that help organizations in regulated and litigious industries manage the risks associated with their electronic communications across more than 80 channels.

IT Services
251-1K
Founded 2001
$44M raised

Description

  • Own Kubernetes platform operations, including cluster health, workload deployments, scaling, and incident response.
  • Design, implement, and operate infrastructure automation using Ansible, Terraform, and GitOps workflows.
  • Lead migration projects that move on-premises workloads toward cloud-native platform services.
  • Build and maintain CI/CD pipelines for infrastructure and application delivery.
  • Improve observability through dashboards, alert tuning, and SLO/SLA definition using Datadog, Splunk, and ELK.
  • Participate in the on-call rotation and respond to P1/P2 incidents.
  • Support security and compliance needs, including patch management, access controls, and audit readiness.
  • Contribute to runbooks and operational documentation for owned systems.
  • Collaborate with adjacent platform teams on the build and adoption of a shared platform.

Requirements

  • 4–7 years of experience in platform engineering, SRE, or infrastructure engineering roles.
  • Strong hands-on experience with Kubernetes, including cluster operations, Helm, and workload troubleshooting.
  • Proficiency with infrastructure-as-code tooling, specifically Ansible and/or Terraform in production environments.
  • Strong Linux systems administration skills, preferably Ubuntu.
  • Experience with GitOps workflows and CI/CD pipelines at scale.
  • Experience with VMware vSphere in a production environment.
  • Demonstrated ability to self-direct and drive projects to completion with minimal oversight.
  • Strong communication skills with cross-functional stakeholders.
  • Experience with Datadog, Splunk, or ELK for dashboards, monitors, and log management is preferred.
  • Familiarity with compliance-sensitive or regulated industry infrastructure is preferred.
  • Experience with ArgoCD, Flux, or similar GitOps continuous delivery tooling is preferred.
  • Familiarity with Jenkins or Concourse for CI/CD pipeline management is preferred.
  • Familiarity with VMware Kubernetes Service (VKS) or other VMware-native Kubernetes platforms is preferred.
  • Python scripting for automation and tooling is preferred.
  • Prior experience in an on-call rotation with a defined SLA structure is preferred.
  • Experience with cloud infrastructure, especially AWS, is beneficial as cloud responsibilities expand.

Benefits

  • Base salary range of $120,000 to $160,000 per year.
  • Bonus programs may be available and will be discussed during the recruiting process.
  • Local cost of living is considered in offer determination.
  • Remote work arrangement.
  • Opportunity to work on a major platform modernization effort.
  • Chance to take on expanding cloud infrastructure responsibilities.
  • Work in a collaborative, global organization that values learning and development.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior AI Platform Engineer

Wellhub 1-10 Gas Utilities

Wellhub is hiring a Senior AI Platform Engineer in Brazil to help build and evolve the cloud-native ML development platform that enables engineers and data scientists to develop and deploy AI at scale.

Apache Spark AWS CI/CD Kubeflow Kubernetes MLOps Python Terraform
5 hours, 49 minutes ago

Platform Engineer III

Veeam Software 1K-5K Internet Software & Services

Veeam is hiring a Platform Engineer for the Veeam Data Cloud to build and operate a secure, reliable platform that helps teams develop, test, deploy, and monitor the VDC product.

AWS Azure Bash Docker Git GitHub Actions Go Helm Java Kubernetes Microservices Pulumi Python Serverless Terraform
10 hours, 59 minutes ago

AI Platform Engineer

NEORIS 5K-10K Internet Software & Services

NEORIS, part of the EPAM group, is seeking a Principal AI Platform Engineer to design and advance enterprise-scale AI platform capabilities that support governed ML and AI delivery across the organization.

Apache Spark AWS CI/CD Cybersecurity Kubernetes MLOps Python Terraform
12 hours, 34 minutes ago

Développeuse ou développeur en fiabilité de production / Production Reliability Engineer

Unity 5K-10K Internet Software & Services

Unity is hiring an experienced engineer to help own a shared internal platform that enables hundreds of developers to build, deploy, and operate services across the company.

Argo CD AWS Azure CI/CD DNS Docker GCP GitHub Actions Go HashiCorp Vault Helm Kubernetes Node.js Python Secrets Management Terraform
20 hours, 56 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers