Data Scientist/Machine Learning Engineer

1 day, 12 hours ago
Full-time
Mid Level
Software Development

Sumble

Sumble provides AI-powered account intelligence for enterprise sales teams, helping users understand team structures, reporting lines, tech stacks, and other account signals to drive pipeline.

Technology, Information and Internet
11-50
$38M raised

Description

  • Finetune small language models for data quality and enrichment workflows.
  • Improve the quality of existing data using scalable methods and validation techniques.
  • Verify and correct entity relationships such as company URLs, headquarters addresses, and parent-subsidiary mappings.
  • Add new signals by scrubbing, matching, normalizing, and aligning them to the existing ontology.
  • Push data quality solutions into production and support them in data pipelines and backend systems.
  • Use techniques such as LLM validation, SERP checks, and cross-source triangulation to improve accuracy.
  • Work with growing sets of data sources, machine learning models, and large-scale data operations.
  • Contribute to systems that support efficient analytics and a strong product-led growth experience.

Requirements

  • Must be located within Americas time zones.
  • Experience working with small language models, LLMs, or machine learning-based data workflows is implied by the role.
  • Familiarity with Python and backend or data pipeline environments.
  • Experience with data cleaning, matching, normalization, or ontology mapping is relevant for the role.
  • Knowledge of ML/data tooling such as PyTorch, Hugging Face, Gemma models, LoRA, or vLLM is a plus.
  • Experience with FastAPI, React, Typescript, PostgreSQL, DuckDB, or Google Cloud Platform is a plus.
  • Ability to work on production systems and operational data infrastructure.
  • Experience in environments handling noisy datasets and multiple data sources is a plus.

Benefits

  • Medical, dental, and vision coverage for US employees.
  • 401(k) plan for US employees.
  • Target of 4 weeks of PTO.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Data Scientist, Platform - Identity/Algorithms

Airbnb 5K-10K Hotels, Restaurants & Leisure

Airbnb is hiring a Full-Stack Data Scientist for its Identity Data Science team to help improve identity verification and fraud detection systems that protect trust across its global platform.

Computer Vision Feature Engineering LLM Python R SQL
40 minutes ago

Senior Machine Learning Engineer

Clover Health 251-1K Insurance

Counterpart Health is hiring a Senior Machine Learning Engineer to build and improve ML, NLP, and LLM systems that support primary care workflows and help deliver better patient outcomes at lower cost.

Feature Engineering LLM Machine Learning NLP NumPy Pandas Python PyTorch Scikit-learn TensorFlow
56 minutes ago

Senior ML-Engineer, Finance

Fundraise Up 51-250 Capital Markets

Fundraise Up is hiring a Senior ML Engineer in Finance to build and own an end-to-end client intelligence system that enriches prospect data, scores potential clients, and integrates results into the sales pipeline.

Apache Airflow CatBoost CI/CD ClickHouse Docker FastAPI Git Grafana Linux LLM MLflow MLOps MongoDB NLP Pandas Python Redis Salesforce SQL
1 hour, 10 minutes ago

Senior Machine Learning Engineer - ML Planner

Motional 1K-5K Automotive

Motional is hiring an ML engineer to help develop, evaluate, and deploy models for scene understanding, behavior prediction, and planning in its autonomous robotaxi stack.

C++ Computer Vision Deep Learning Machine Learning Neural Networks Python PyTorch
1 hour, 10 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers