C the Signs

C the Signs

C the Signs is a cutting-edge cancer prediction system that uses artificial intelligence to enhance early detection and survival rates. The company is dedicated to reducing healthcare disparities by accelerating early cancer detection, improving patien...

Professional Services
51-250
Founded 2017

Description

  • Lead the design and evolution of the cloud-native data platform on Google Cloud Platform.
  • Architect robust, scalable ETL/ELT pipelines and orchestration workflows for clinical and operational data.
  • Build ingestion pipelines for HL7, FHIR, DICOM, and custom healthcare data formats.
  • Develop streaming and batch dataflows to support AI/ML development and model training.
  • Design and maintain BigQuery datasets, warehouse structures, and semantic layers.
  • Establish data engineering best practices, coding standards, and architectural patterns.
  • Implement data quality checks, automated testing, monitoring, lineage, and documentation practices.
  • Ensure HIPAA compliance and proper handling of PHI/PII across pipelines and cloud environments.
  • Collaborate with product, analytics, clinical informatics, machine learning, engineering, and security teams.
  • Provide technical direction for multi-cloud integrations and support AWS-based partner systems when needed.
  • Assist with recruiting and mentoring junior data engineers.

Requirements

  • 7+ years of data engineering experience.
  • 2–3+ years in a lead or senior technical role.
  • Deep hands-on expertise in Google Cloud Platform, especially BigQuery, Healthcare API, Cloud Storage, Pub/Sub, and Cloud Run/Functions.
  • Strong proficiency with dbt (Core or Cloud).
  • Strong proficiency with Airflow (Cloud Composer or self-managed).
  • Strong proficiency with Python and advanced SQL, preferably BigQuery SQL.
  • Hands-on experience with healthcare standards including FHIR (R4/US Core), HL7 v2/v3, DICOM, C-CDA, and X12.
  • Strong understanding of PHI handling, HIPAA compliance, and healthcare interoperability.
  • Preferred experience with AWS services such as Redshift, Lambda, S3, Glue, Kinesis, Athena, API Gateway, and Step Functions.
  • Experience building or maintaining multi-cloud pipelines bridging GCP and AWS.
  • Background with Dataflow/Beam or other stream processing frameworks.
  • Experience with EHR integrations, claims processing, HIEs, or clinical data networks.
  • Familiarity with ML-enabled data pipelines or feature engineering in healthcare contexts.

Benefits

  • Competitive salary and benefits package.
  • Flexible working arrangements, including remote or hybrid options.
  • Opportunity to work on life-changing AI technology that directly impacts patient outcomes.
  • Mission-driven work focused on improving health equity and saving lives.
  • Continuous learning opportunities with access to the latest tools and advancements in AI and healthcare.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Mid-Level Data Engineer | CARE

Wellhub 1-10 Gas Utilities

Wellhub is hiring a Brazil-based remote Data Engineer for its CARE (Coach on Artificial Intelligence for Real Engagement) team to build cloud-based data solutions that support AI-powered wellness engagement products.

Apache Airflow Kafka
24 minutes ago

Data Engineer (Python)

Orcrist Technologies Internet Software & Services

Orcrist is seeking a Data Engineer to prototype and validate new data pipelines and connectors for its Kubernetes-based intelligence platform, with the goal of producing adoptable blueprints for productization.

Hive Kafka Kubernetes PostgreSQL Python SQL Trino
54 minutes ago

Data Engineer I

Capital Rx 251-1K Health Care Providers & Services

Judi Health is hiring a Data Engineer I (Analytics Engineering) to build analytics-ready datasets, trusted metrics, and scalable data models for Capital Rx’s healthcare data platform.

CI/CD dbt Git PostgreSQL Python Snowflake SQL
54 minutes ago

Data Engineer (Remote-US)

DataKind 51-250 Diversified Consumer Services

DataKind is hiring a remote Data Engineer to support higher education institutions by building and maintaining the UDTS data platform and partnering directly with clients to improve student outcomes.

Databricks GCP JavaScript Python SQL
1 hour, 9 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers