Smarsh

Smarsh

Smarsh provides cloud-based archiving and compliance solutions that help organizations in regulated and litigious industries manage the risks associated with their electronic communications across more than 80 channels.

IT Services
251-1K
Founded 2001
$44M raised

Description

  • Collect, analyze, and interpret small and large datasets to generate insights for statistical and machine learning methods.
  • Lead the design, training, and deployment of NLP and transformer-based models for financial surveillance and supervisory use cases.
  • Develop machine learning models and analytics using established workflows while identifying opportunities for optimization and improvement.
  • Perform data annotation, quality review, exploratory data analysis, and model fail-state analysis.
  • Contribute to model governance, documentation, and explainability frameworks aligned with internal and regulatory AI standards.
  • Provide guidance to clients and prospects on machine learning model and analytics fine-tuning and development processes.
  • Mentor junior team members on model development and exploratory data analysis.
  • Work with product managers to translate project requirements into technical tasks and team workflows.
  • Collaborate with data scientists, researchers, engineers, and business leaders across end-to-end data science initiatives.
  • Continue self-directed professional development in data science and applied machine learning.

Requirements

  • Strong understanding of financial markets, compliance, surveillance, supervision, or regulatory technology.
  • Experience with data science and machine/deep learning frameworks and tools such as scikit-learn, H2O, Keras, PyTorch, TensorFlow, pandas, numpy, caret, or tidyverse.
  • Command of data science and statistics principles, including regression, Bayes, time series, clustering, precision/recall, AUROC, and exploratory data analysis.
  • Strong knowledge of programming concepts such as split-apply-combine, data structures, and object-oriented programming.
  • Solid statistics knowledge, including hypothesis testing, ANOVA, and chi-square tests.
  • Knowledge of NLP transfer learning and models such as word embeddings, BERT, SBERT, HuggingFace, and GPT variants.
  • Experience with NLP toolkits such as NLTK, spaCy, or Nvidia NeMo.
  • Knowledge of microservices architecture and continuous delivery concepts in machine learning, including Helm, Docker, and Kubernetes.
  • Familiarity with deep learning techniques for NLP and with LLM tools such as Ollama and LangChain.
  • Excellent verbal and written communication skills.
  • Proven ability to collaborate effectively on cross-functional teams.
  • Master’s or PhD in Computer Science, Applied Math, Statistics, or a scientific field preferred.
  • Familiarity with cloud platforms such as AWS, GCS, or Azure preferred.
  • Experience with automated supervision, surveillance, or compliance tools preferred.

Benefits

  • Base salary range of $166,000 to $214,000 per year.
  • Bonus programs discussed during the recruiting process.
  • Compensation determined by factors including experience, education, location, specialty, training, and internal equity.
  • Remote work option available.
  • Opportunity to work across Atlanta, New York, or remotely in the U.S.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Director, Data Science/ML

CookUnity 251-1K Hotels, Restaurants & Leisure

CookUnity is hiring a Director, Data Science/ML to lead product-focused machine learning and experimentation efforts that improve personalization, customer engagement, retention, and lifetime value at scale.

Amplitude Apache Airflow AWS GCP Generative AI Kubeflow Looker Machine Learning Mixpanel MLflow Python PyTorch Scikit-learn Snowflake SQL TensorFlow
8 hours, 1 minute ago

Sênior/Lead Data Scientist

Harford County Public Library 51-250 Diversified Consumer Services

Stone is hiring a Senior/Lead Data Scientist to lead end-to-end modeling and analysis projects that support business decisions and growth in Brazil’s payments and financial services context.

Apache Spark Databricks Feature Engineering Machine Learning Python SQL
8 hours, 16 minutes ago

Data Scientist II

Bluesight 51-250 Pharmaceuticals

Bluesight is hiring a remote Data Scientist II to join its central data team and support healthcare analytics and machine learning work across medication intelligence products.

AWS CI/CD Docker FastAPI Flask Generative AI Git Machine Learning MLflow Python SQL Terraform
8 hours, 16 minutes ago

Credit Risk - Modeling Specialist

CloudWalk 51-250 Diversified Financial Services

CloudWalk is hiring a Credit Risk Modeling Specialist in São Paulo to build and manage retail credit risk frameworks for its payments and acquiring business in line with BACEN and IFRS 9 requirements.

Python R SQL
8 hours, 31 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers