Plume

Plume

Plume provides virtual gender-affirming healthcare services specifically designed for the transgender and gender nonconforming community, offering accessible and supportive care through a mobile platform.

Family Services
51-250
Founded 2019
$38M raised

Description

  • Build and maintain production-grade data pipelines in cloud data warehouses such as BigQuery or equivalent.
  • Design and develop dbt models across bronze, silver, and gold layers with testing, documentation, and incremental loading.
  • Create and optimize Airflow DAGs for scheduling, dependencies, monitoring, error handling, and alerting.
  • Implement dimensional data models and data mart structures that support clinical BI and ML feature consumption.
  • Build dashboards and visualizations in Looker or equivalent BI tools in collaboration with cross-functional stakeholders.
  • Integrate healthcare data from EHRs, Stripe, third-party APIs, and application databases into a unified data platform.
  • Apply HIPAA-compliant data handling practices, including PHI/PII masking, tokenization, audit logging, and access controls.
  • Architect and implement RAG pipelines, including document ingestion, chunking, embedding generation, and retrieval.
  • Support MLOps workflows, including training pipeline maintenance, deployment support, monitoring, and retraining triggers.
  • Code review teammates’ pull requests, provide technical feedback, and uphold engineering standards.
  • Collaborate with product managers to define requirements and deliver reliable data and AI products.
  • Monitor and triage pipeline and data quality failures, escalating architectural issues when needed.
  • Document pipeline designs, data models, and technical decisions to support governance and lineage tracking.
  • Evaluate new tools and frameworks through hands-on prototyping and technical assessment.

Requirements

  • 5+ years of hands-on experience in data engineering, analytics engineering, or a closely related role.
  • 2+ years of experience in the healthcare industry with knowledge of healthcare data standards, clinical workflows, regulated data environments, and domain-specific reporting.
  • Working knowledge of HIPAA, including PHI/PII classification, data masking, audit logging, and access control requirements.
  • Production experience with at least one major cloud data warehouse: BigQuery, Snowflake, or Redshift.
  • Strong hands-on experience with dbt Core or dbt Cloud, including incremental models, tests, documentation, and multi-environment workflows.
  • Deep experience with Apache Airflow for orchestration, including DAG design, scheduling, monitoring, and failure handling.
  • Demonstrated knowledge of dimensional modeling, including star and snowflake schemas, SCD Type 1/2, and fact/dimension table design.
  • Hands-on experience delivering dashboards and reports in an enterprise BI tool such as Looker, Power BI, Tableau, or Qlik.
  • Proficiency in Python for data pipelines, API integrations, and automation, including Pandas, PySpark, or similar.
  • Practical exposure to RAG pipeline development and LLM integration using LangChain, LangGraph, or LlamaIndex.
  • Hands-on exposure to MLOps concepts, including model deployment, monitoring, and retraining workflows.
  • Knowledge of CI/CD tooling for data and AI workloads, such as GitHub Actions or dbt Cloud CI.
  • Strong understanding of data quality and governance principles, including lineage, access controls, data contracts, and automated testing.
  • Experience with data governance tools such as OpenMetadata.
  • Excellent written and verbal communication skills and the ability to collaborate across engineering, analytics, and clinical teams.
  • Ability to work independently while keeping leadership informed of progress, blockers, and risks.
  • Experience with real-time or streaming data pipelines using Kafka, Kinesis, or Pub/Sub is preferred.
  • Knowledge of vector databases such as Pinecone, Weaviate, FAISS, or Chroma is preferred.
  • Familiarity with responsible AI principles, including bias detection and model explainability in healthcare, is preferred.
  • Experience with data observability tools such as Monte Carlo, Bigeye, or Soda is preferred.
  • Familiarity with data lakehouse patterns such as Delta Lake, Iceberg, or Apache Hudi is preferred.
  • Experience working toward or maintaining SOC2 or HITRUST certification is preferred.
  • Familiarity with semantic layer tools such as Looker LookML or dbt Semantic Layer is preferred.
  • Experience with population health, revenue cycle, or clinical quality reporting datasets is preferred.
  • Exposure to Kubernetes or containerized ML workloads is preferred.
  • Must be legally authorized to work in the USA and reside in the USA.

Benefits

  • $158,000 - $168,000 annual salary.
  • Ground-floor equity (Series B).
  • Free medical, dental, and vision coverage starting the first of the month after full-time start.
  • Unlimited PTO.
  • 11 paid holidays plus a company shutdown for one week in December.
  • 401(k) retirement plan.
  • Free Plume and BetterHelp subscriptions.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Data Conversion Software Engineer

Career TEAM 251-1K Professional Services

Career Team is hiring a Data Conversion Software Engineer to build data transformation and integration software for government-funded workforce development programs across the United States.

Agile Angular CI/CD Docker Express.js JavaScript JSON MongoDB NestJS Next.js Node.js React Scrum TypeScript XML
15 hours, 28 minutes ago

Sr. Associate Data Platform - Remote

TWO95 International 51-250 Internet Software & Services

Sr. Associate Data Platform is a contract role with a Los Angeles-based team supporting Adobe analytics and data platform implementation work across on-site and remote locations.

CSS Digital Marketing HTML JavaScript jQuery Vue.js
15 hours, 43 minutes ago

Freelance Data Scraping Engineer (Python)

Mindrift.ai: Be the “I” in AI Internet Software & Services

Mindrift is hiring a part-time remote Python Data Scraping Engineer for the Tendem project to deliver accurate, structured data extraction and processing within a hybrid AI-plus-human workflow.

AJAX GitHub JavaScript JSON LLM Python Selenium
15 hours, 43 minutes ago

Freelance Data Scraping Engineer (Python)

Mindrift.ai: Be the “I” in AI Internet Software & Services

Mindrift is hiring a part-time freelance Python Data Scraping Engineer for the Tendem project to build and oversee data extraction workflows in a hybrid AI + human environment.

AJAX GitHub JavaScript JSON LLM Python Selenium
15 hours, 43 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers