Elastic

Elastic

Elastic is a leading platform for search-powered solutions, providing real-time insights and making data usable for developers and enterprises worldwide.

Internet Software & Services
1K-5K
Founded 2010

Description

  • Build and maintain the golden customer dataset by unifying GTM signals into a governed source of truth.
  • Design and operate enrichment pipelines, deduplication, entity resolution, and validation systems.
  • Prepare structured and unstructured data for AI workflows such as account research, lead scoring, churn signals, and CSM briefings.
  • Work with the RevOps Data Science team on chunking, embedding strategy, metadata design, and source integration.
  • Implement monitoring, drift detection, and lineage tracking to protect data quality before issues reach forecasts or sellers.
  • Define standards for how RevTech prepares data for AI consumption.
  • Document schemas, pipelines, and data contracts used by downstream teams.
  • Partner with RevOps analysts, GTM systems teams, and the field organization on customer data use cases.
  • Help onboard new data sources faster while keeping the dataset accurate and governed.

Requirements

  • 3+ years of experience building production pipelines that feed ML or LLM-based systems.
  • Experience working with CRM data at scale, including accounts, contacts, opportunities, and leads.
  • Understanding of entity resolution and deduplication challenges in GTM data.
  • Experience preparing data for RAG, embeddings, and AI agents.
  • Knowledge of chunking strategies, metadata enrichment, and embedding model selection.
  • Experience using LLMs for extraction, classification, and normalization, with ability to evaluate results.
  • Strong Python skills and senior-level SQL experience.
  • Experience with cloud infrastructure such as AWS, Azure, or GCP, plus orchestration tools like Airflow or Dagster.
  • Working knowledge of Elasticsearch, vector search, and ESRE, or a strong interest in building with them.
  • Bachelor's or Master's degree in Computer Science, Data Engineering, or a related field (preferred).
  • Prior experience on a RevOps, GTM Systems, or Marketing Operations engineering team (preferred).

Benefits

  • Competitive pay based on the work you do here, not your previous salary.
  • Health coverage for you and your family in many locations.
  • Flexible locations and schedules for many roles.
  • Generous number of vacation days each year.
  • Employer match of up to $2,000, or local currency equivalent, for financial donations and service.
  • Up to 40 hours each year for volunteer projects.
  • Minimum of 16 weeks of parental leave.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

DATA ENGINEER I

Inter 51-250 Banks

The Inter is hiring a Data Engineering professional to join a high-performance technical team working on data engineering solutions for its global financial super app.

Apache Airflow Apache Spark AWS BDD CI/CD Git Kafka Kubernetes Pytest Python SQL
8 hours, 54 minutes ago

Senior FDE Data Engineer

DefenseUnicorns 51-250 Internet Software & Services

Defense Unicorns is hiring a forward-deployed data engineer to deploy and operate a classified DoD data platform in customer environments and help move it into production use.

Apache Airflow Argo CD AWS Azure ClickHouse Dagster Flink Flux GCP GitOps Helm Kafka Kubernetes Linux OAuth OpenID Connect Oracle PostgreSQL Pulumi REST API SOAP SQL SQL Server Terraform TLS Trino
9 hours, 9 minutes ago

Senior Data Engineer

Orion Innovation 1K-5K IT Services

Orion Innovation is seeking a Senior Data Engineer to design and build scalable cloud-native data platforms for its Center of Innovation, supporting enterprise analytics and data-driven applications.

Apache Airflow Apache Spark AWS Azure CI/CD Databricks Docker GCP Git Python Snowflake SQL
9 hours, 9 minutes ago

Senior Data Engineer

Karbon 51-250 Diversified Financial Services

Karbon is hiring a Senior Data Engineer to design and own the cloud data platform foundations behind AI-driven analytics, real-time insights, and multi-tenant data experiences for its global accounting software business.

ArangoDB Azure CI/CD Databricks Datadog dbt HIPAA Kafka MLOps Neo4j Python Scala SQL Terraform
9 hours, 23 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers