Innodata

Innodata

Innodata Inc. is a global leader in data engineering, offering end-to-end AI solutions and platforms for businesses worldwide, combining AI and human expertise to solve complex data challenges.

IT Services
1K-5K
Founded 1988

Description

  • Design and implement data solutions on GCP using BigQuery, Cloud Storage, Dataflow, Pub/Sub, and Looker/BI.
  • Build ETL scripts in SQL and Python to extract, clean, and transform structured and unstructured data from ERP, procurement, logistics, and facility management systems.
  • Develop and optimize data pipelines for ingestion, transformation, and loading into enterprise data lakes and warehouses.
  • Build end-to-end data and BI solutions across extraction, storage, transformation, and visualization layers.
  • Partner with supply chain, real estate, and AI/ML teams to provide pipelines for AI use cases such as RAG ingestion, copilot integration, and multi-agent workflows.
  • Ensure data governance, lineage, and compliance across supply chain datasets.
  • Continuously improve query performance, ETL processes, and pipeline reliability.

Requirements

  • Advanced proficiency in SQL, including complex queries and optimization.
  • Advanced proficiency in Python for data engineering, scripting, and APIs.
  • Experience building ETL/ELT pipelines that operate on structured and unstructured data sources.
  • Knowledge of enterprise data warehouse and data lake architectures.
  • Exposure to data pipelines for AI/ML, including vector DB ingestion, embeddings, RAG pipelines, copilots, and agents.
  • Strong hands-on expertise with GCP services such as BigQuery, Dataflow, Pub/Sub, Cloud Storage, and Looker/BI or similar tools.
  • Familiarity with supply chain or data center operations data is a strong plus.
  • Experience with ML engineering, data visualization tools such as Looker, Tableau, or Power BI, and MLOps practices is a bonus.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior AI & Data Engineer

Orion Innovation 1K-5K IT Services

Orion Innovation is seeking a Web Delivery Lead to drive end-to-end delivery of AEM-based global websites for its digital engagement platforms, with a focus on reliable launches, governance, and AI-enabled web operations.

Agile AWS Azure CDN CI/CD Computer Vision DNS Docker GCP GPT GraphQL Hugging Face Java JavaScript Kubernetes LLM NLP Python PyTorch SEO TensorFlow
3 hours, 4 minutes ago

Data Engineer

Pavago IT Services

Pavago is hiring a remote Data Engineer to build and maintain cloud-based data pipelines, warehouses, and analytics datasets that support reporting, automation, and business intelligence.

Apache Airflow AWS CI/CD Dagster dbt Docker GCP GitHub Actions GitLab CI Jenkins Kafka Kubernetes Looker Luigi Power BI Prefect Python Scala Snowflake SQL Tableau Terraform
3 hours, 34 minutes ago

Senior Data Engineer - Surveillance & Interoperability

inventYOU 1-10 Internet Software & Services

Senior Data Engineer - Surveillance & Interoperability at inventYOU, responsible for designing and delivering large-scale data integration and interoperability solutions that support secure information exchange, surveillance, monitoring, and analytics across complex systems.

AWS Azure GCP Python REST API SQL
3 hours, 49 minutes ago

Social Media Data Crawler

Social Media Data Crawler at an unspecified company to build and manage automated tools for extracting and analyzing data from major social platforms in support of marketing, research, and analytics initiatives.

Apache Spark CSS Hadoop HTML Instagram API Java JavaScript NLP Python
3 hours, 49 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers