Plume

Plume provides virtual gender-affirming healthcare services specifically designed for the transgender and gender nonconforming community, offering accessible and supportive care through a mobile platform.

Family Services

Consumer Discretionary

51-250 (150)

Founded 2019

$38M raised

2 open positions

Links

View All Jobs

Senior Data Engineer (Data + Applied AI)

2 months, 1 week ago

United States

Full-time

Senior

Data Engineer

Software Development

Apache Airflow Apache Spark dbt GitHub Actions HIPAA Kafka Kubernetes Looker MLOps Pandas Power BI Python SQL Tableau

Apply Now

Plume

Family Services

51-250

Founded 2019

$38M raised

View All Jobs 2

Description

Build and maintain production-grade data pipelines in cloud data warehouses such as BigQuery or equivalent.
Design and develop dbt models across bronze, silver, and gold layers with testing, documentation, and incremental loading.
Create and optimize Airflow DAGs for scheduling, dependencies, monitoring, error handling, and alerting.
Implement dimensional data models and data mart structures that support clinical BI and ML feature consumption.
Build dashboards and visualizations in Looker or equivalent BI tools in collaboration with cross-functional stakeholders.
Integrate healthcare data from EHRs, Stripe, third-party APIs, and application databases into a unified data platform.
Apply HIPAA-compliant data handling practices, including PHI/PII masking, tokenization, audit logging, and access controls.
Architect and implement RAG pipelines, including document ingestion, chunking, embedding generation, and retrieval.
Support MLOps workflows, including training pipeline maintenance, deployment support, monitoring, and retraining triggers.
Code review teammates’ pull requests, provide technical feedback, and uphold engineering standards.
Collaborate with product managers to define requirements and deliver reliable data and AI products.
Monitor and triage pipeline and data quality failures, escalating architectural issues when needed.
Document pipeline designs, data models, and technical decisions to support governance and lineage tracking.
Evaluate new tools and frameworks through hands-on prototyping and technical assessment.

Requirements

5+ years of hands-on experience in data engineering, analytics engineering, or a closely related role.
2+ years of experience in the healthcare industry with knowledge of healthcare data standards, clinical workflows, regulated data environments, and domain-specific reporting.
Working knowledge of HIPAA, including PHI/PII classification, data masking, audit logging, and access control requirements.
Production experience with at least one major cloud data warehouse: BigQuery, Snowflake, or Redshift.
Strong hands-on experience with dbt Core or dbt Cloud, including incremental models, tests, documentation, and multi-environment workflows.
Deep experience with Apache Airflow for orchestration, including DAG design, scheduling, monitoring, and failure handling.
Demonstrated knowledge of dimensional modeling, including star and snowflake schemas, SCD Type 1/2, and fact/dimension table design.
Hands-on experience delivering dashboards and reports in an enterprise BI tool such as Looker, Power BI, Tableau, or Qlik.
Proficiency in Python for data pipelines, API integrations, and automation, including Pandas, PySpark, or similar.
Practical exposure to RAG pipeline development and LLM integration using LangChain, LangGraph, or LlamaIndex.
Hands-on exposure to MLOps concepts, including model deployment, monitoring, and retraining workflows.
Knowledge of CI/CD tooling for data and AI workloads, such as GitHub Actions or dbt Cloud CI.
Strong understanding of data quality and governance principles, including lineage, access controls, data contracts, and automated testing.
Experience with data governance tools such as OpenMetadata.
Excellent written and verbal communication skills and the ability to collaborate across engineering, analytics, and clinical teams.
Ability to work independently while keeping leadership informed of progress, blockers, and risks.
Experience with real-time or streaming data pipelines using Kafka, Kinesis, or Pub/Sub is preferred.
Knowledge of vector databases such as Pinecone, Weaviate, FAISS, or Chroma is preferred.
Familiarity with responsible AI principles, including bias detection and model explainability in healthcare, is preferred.
Experience with data observability tools such as Monte Carlo, Bigeye, or Soda is preferred.
Familiarity with data lakehouse patterns such as Delta Lake, Iceberg, or Apache Hudi is preferred.
Experience working toward or maintaining SOC2 or HITRUST certification is preferred.
Familiarity with semantic layer tools such as Looker LookML or dbt Semantic Layer is preferred.
Experience with population health, revenue cycle, or clinical quality reporting datasets is preferred.
Exposure to Kubernetes or containerized ML workloads is preferred.
Must be legally authorized to work in the USA and reside in the USA.

Benefits

$158,000 - $168,000 annual salary.
Ground-floor equity (Series B).
Free medical, dental, and vision coverage starting the first of the month after full-time start.
Unlimited PTO.
11 paid holidays plus a company shutdown for one week in December.
401(k) retirement plan.
Free Plume and BetterHelp subscriptions.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Data Conversion Software Engineer

Career TEAM 251-1K Professional Services

Career Team is hiring a Data Conversion Software Engineer to build data transformation and integration software for government-funded workforce development programs across the United States.

United States Full-time Mid Level Data Engineer Software Engineer

$60k-$100k

Agile Angular CI/CD Docker Express.js JavaScript JSON MongoDB NestJS Next.js Node.js React Scrum TypeScript XML

15 hours, 28 minutes ago

Apply

15 hours, 28 minutes ago

Sr. Associate Data Platform - Remote

TWO95 International 51-250 Internet Software & Services

Sr. Associate Data Platform is a contract role with a Los Angeles-based team supporting Adobe analytics and data platform implementation work across on-site and remote locations.

United States Contract Senior Data Engineer

CSS Digital Marketing HTML JavaScript jQuery Vue.js

15 hours, 43 minutes ago

Apply

15 hours, 43 minutes ago

Freelance Data Scraping Engineer (Python)

Mindrift.ai: Be the “I” in AI Internet Software & Services

Mindrift is hiring a part-time remote Python Data Scraping Engineer for the Tendem project to deliver accurate, structured data extraction and processing within a hybrid AI-plus-human workflow.

Kuwait Qatar Saudi Arabia Turkey Israel Jordan Part-time Junior Backend Engineer Data Engineer

Up to $37k

AJAX GitHub JavaScript JSON LLM Python Selenium

15 hours, 43 minutes ago

Apply

15 hours, 43 minutes ago

Freelance Data Scraping Engineer (Python)

Mindrift.ai: Be the “I” in AI Internet Software & Services

Mindrift is hiring a part-time freelance Python Data Scraping Engineer for the Tendem project to build and oversee data extraction workflows in a hybrid AI + human environment.

United States Part-time Mid Level Backend Engineer Data Engineer

AJAX GitHub JavaScript JSON LLM Python Selenium

15 hours, 43 minutes ago

Apply

15 hours, 43 minutes ago

Plume

Tags

Links

Senior Data Engineer (Data + Applied AI)

Plume

Description

Requirements

Benefits

Similar Roles

Data Conversion Software Engineer

Sr. Associate Data Platform - Remote

Freelance Data Scraping Engineer (Python)

Freelance Data Scraping Engineer (Python)

You're on a roll! Sign up now to keep applying.