Data & Semantic Model Architect

1 month, 3 weeks ago
Full-time
Senior
Data Science and Analytics
TetraScience

TetraScience

TetraScience is the only vendor neutral, open, cloud native platform purpose built for science, providing next generation lab data automation and scientific data management to accelerate scientific discovery and improve human life.

Biotechnology
51-250
Founded 2019
$99M raised

Description

  • Design and own the Common Data Models (CDMs) and the Exchange Layer that provide a standardized semantic foundation for scientific data across customers.
  • Move the platform from bespoke, one-off mappings to a reusable, interoperable exchange layer that supports cross-customer data flow.
  • Define and create data contracts and standardized definitions that Forward Deployed Engineers and Architects rely on for rapid, reliable integrations.
  • Balance global standardization and local extensibility by defining immutable core model aspects and permissible extensions.
  • Translate high-level business goals into concrete data modeling strategies that drive time-to-insight for scientific use cases.
  • Design and implement complex ontologies, taxonomies, and semantic relationships linking domain entities (e.g., ELN entities to assay results).
  • Collaborate with Engineering to integrate models into software systems, ensuring ontologies support query performance and system scalability.
  • Establish governance for data quality, versioning, enforcement, and evolution of data contracts to ensure downstream consumer trust.
  • Partner with Scientific Business Analysts to convert ambiguous scientific requirements into rigorous, machine-readable data structures.
  • Architect models to ensure data is FAIR (Findable, Accessible, Interoperable, Reusable) and suitable for downstream AI/ML applications.

Requirements

  • 7+ years of experience in data architecture, informatics, or technical product leadership, especially within life sciences, healthcare, or manufacturing technology, or equivalent demonstrable experience unifying complex, multidomain data models and semantic layers.
  • Direct, hands-on experience implementing and extending Common Data Model frameworks such as HL7 FHIR, OMOP (OHDSI), Allotrope, or CDISC and understanding their strengths and limitations for biopharma R&D.
  • Proven mastery of terminology standardization and semantic curation, including semantic mapping, aggregation, and value set creation across vocabularies and instance data.
  • Experience designing and enforcing data contracts in microservices or platform environments, including versioning and governance approaches.
  • Hands-on expertise with semantic web standards and graph technologies such as RDF, OWL, SHACL, SPARQL and property graph concepts (LPG).
  • Demonstrated experience building data platforms or exchange layers that prioritize standardization and reusability across multiple customers.
  • Architectural versatility to move between high-level system design and low-level entity-relationship modeling.
  • Strong technical background with the ability to read code, understand API contracts, and discuss database internals and query performance trade-offs.
  • Bachelor's or Master's degree in a relevant field (e.g., Medical Informatics, Computer Science, Bioinformatics, Physics).

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Principal Engineer - Data Infrastructure

Sezzle 251-1K Diversified Financial Services

Sezzle is hiring a Principal Engineer for Data Infrastructure to own and evolve its database and data warehousing systems as the company scales its fintech platform and data volume.

Apache Airflow AWS Dagster Databricks dbt Kafka MySQL PostgreSQL Prefect Snowflake SQL
1 hour, 24 minutes ago

Data Solutions Lead

Bounteous 1K-5K Internet Software & Services

Bounteous is seeking a North America-based Data Solutions Lead to drive client-facing data solutioning, pre-sales support, and technical delivery leadership for consumer industry clients in retail, restaurant, hospitality, and consumer goods.

AWS Azure Databricks GCP Snowflake
7 hours, 24 minutes ago

Principal Software Developer – Data Architect

Caseware 251-1K Internet Software & Services

Caseware is hiring a Principal Software Developer – Data Architect in Colombia to lead the architecture and delivery of its AI-ready data platform for cloud products, analytics, and secure interoperability with customer systems.

Apache Spark AWS CI/CD DynamoDB Java Kafka MapReduce MongoDB New Relic OpenTelemetry PostgreSQL Python Redis Spring Trino
7 hours, 39 minutes ago

Principal Consultant – Knowledge Graphs & Decision Advantage

Redhorse 251-1K Aerospace & Defense

Redhorse Corporation is hiring a remote Principal Consultant to design and deliver enterprise-scale knowledge graph solutions for U.S. national security and defense missions.

Apache Spark Databricks Java Machine Learning Neo4j NLP Python Scala
10 hours, 39 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers