Toptal

Toptal

Toptal is a curated talent marketplace connecting freelance software developers, designers, finance experts, product managers, and project managers with businesses globally. With a focus on top 3% talent in software engineering, design, and finance, To...

Construction & Engineering
5K-10K
Founded 2010
$4M raised

Description

  • Act as the founding Data Scientist and define the data science strategy, tools, frameworks, and best practices for the product.
  • Design and build Task Mining and Process Mining solutions that surface workflows, patterns, bottlenecks, and optimization opportunities.
  • Design, develop, and deploy machine learning systems and data pipelines for structured, unstructured, event, and interaction data at scale.
  • Build predictive and pattern-discovery solutions using supervised learning, unsupervised learning, representation learning, sequence modeling, and GenAI or LLM approaches where appropriate.
  • Establish dataset construction, labeling, evaluation, monitoring, feedback loops, and human-in-the-loop review foundations.
  • Own projects end to end from problem framing and experimentation through production deployment and iteration.
  • Collaborate with engineering on data instrumentation, pipeline design, deployment, and integration of production-ready services.
  • Communicate findings, tradeoffs, and technical concepts clearly to technical and business stakeholders.

Requirements

  • 5+ years of professional experience in Data Science, Machine Learning, or Applied ML roles.
  • Experience operating as the sole or lead Data Scientist on a product or team, owning problems end to end without senior DS supervision.
  • Strong experience with supervised and unsupervised ML, modern ML/data tooling, and selecting the right approach for the problem.
  • Practical familiarity with representation learning, sequence modeling, Transformers, LLMs, or GenAI systems for product use cases.
  • Experience handling large-scale structured, unstructured, event, or interaction datasets.
  • Advanced proficiency in Python and SQL.
  • Hands-on experience with tools such as PyTorch, scikit-learn, pandas or Polars, experiment tracking, and production ML workflows.
  • Experience deploying ML models, data pipelines, or intelligent systems into production.
  • Familiarity with Task Mining, Process Mining, event-log analysis, behavioral analytics, workflow automation, or adjacent domains.
  • Advanced degree in Computer Science, Data Science, AI, Statistics, Mathematics, or a related field is a plus; equivalent practical experience is strongly valued.
  • Previous experience as a first or early Data Scientist at a startup or new product line is a plus.
  • Direct experience with LLMs and Generative AI applications, especially evaluation, structured outputs, semantic labeling, summarization, or human-in-the-loop workflows is a plus.
  • Experience working with privacy-sensitive behavioral, productivity, or user-interaction data is a plus.
  • Experience with product experimentation, causal inference, or measuring the impact of workflow or process interventions is a plus.
  • Knowledge of MLOps and distributed processing frameworks such as Spark is a plus.
  • Experience with cloud environments, especially GCP, is a plus.
  • Resumes and communication must be submitted in English.
  • This is a remote role with no visa sponsorship or visa assistance provided.

Benefits

  • Remote position with eligibility to work from anywhere in Europe or South America.
  • Join Toptal’s fully remote global workforce.
  • Work on a new product in a founding role with high ownership and influence.
  • Opportunity to shape the data science function from the ground up.
  • Access to a support structure designed to encourage innovation, social interaction, and fun.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Director, Biostatistics

Definium Therapeutics 51-250 Health Care Providers & Services

Definium Therapeutics is seeking a Director of Biostatistics to lead statistical support for its clinical development pipeline, regulatory filings, and commercialization efforts.

SAP Statistics
5 minutes ago

Health Science Research Intern

OURA 251-1K Health Care Providers & Services

Oura is hiring a remote U.S. Health Science Research Intern to support clinical and real-world evidence research by contributing to study design, documentation, and data-driven insights for its Health Science team.

Python R SQL
5 minutes ago

Lead AIML

Weekday 11-50 Construction & Engineering

Nomiso is hiring a senior AIML Developer in India to apply AI and machine learning to customer-facing products and operational challenges for a remote, full-time role.

Apache Spark AWS Docker Git GitLab Go Jenkins Kubernetes Machine Learning Microservices Neural Networks NLP Python PyTorch R Random Forest Scala TensorFlow
5 minutes ago

Senior Data Scientist, Guest Travel Insurance (Algorithms)

Airbnb 5K-10K Hotels, Restaurants & Leisure

Airbnb is hiring a Data Scientist for AirCover to develop machine learning systems that personalize guest insurance coverage and messaging across the booking and travel journey.

Apache Airflow Computer Vision Deep Learning LLM Machine Learning Python PyTorch Reinforcement Learning SQL TensorFlow
35 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers