Welo Global

Welo Global

Welo Global specializes in providing enterprise localization, AI training data, and multilingual content solutions, leveraging a portfolio of specialized global brands to meet diverse customer needs in the AI and language services industry.

Professional Services
Founded 1997

Description

  • Design, build, and maintain ETL pipelines across application databases, cloud data warehouses, third-party APIs, and object storage.
  • Architect new data solutions that support production-level APIs and AI/ML product features.
  • Sync application data into analytics-ready warehouses and maintain reliable data flows.
  • Investigate and resolve data integrity issues such as missing data, incorrect mappings, duplicates, and schema mismatches.
  • Automate end-to-end data ingestion, transformation, and processing to improve efficiency and reliability.
  • Develop and maintain complex datasets with consistent engineering standards across systems.
  • Partner with product managers, research scientists, QA, and platform engineers to translate requirements into data engineering work with clear acceptance criteria and documentation.
  • Ensure data security, quality, and compliance standards are met across pipelines and processes.
  • Mentor junior engineers, conduct peer reviews, and help improve team processes and tooling.
  • Research, prototype, and implement new tools or architectures for novel data challenges.

Requirements

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.
  • 4-6 years of experience in data engineering or related fields, with a focus on AI/ML.
  • Strong Python skills, including experience with migration frameworks such as Alembic and SQLAlchemy.
  • Advanced SQL experience across transactional databases such as PostgreSQL and columnar warehouses.
  • Experience with AWS Redshift is strongly preferred.
  • Proficiency with cloud services and tools such as dbt, AWS, Azure, or GCP.
  • Hands-on experience with AWS services including Redshift, S3, Lambda, and IAM.
  • Strong Git and GitHub skills, including branching strategies, pull requests, and code review workflows.
  • Experience building scalable, fault-tolerant data pipelines.
  • Experience mentoring junior engineers or leading small teams is preferred.
  • Excellent problem-solving, analytical, communication, and collaboration skills.
  • Comfort working in ambiguous, fast-moving environments and taking ownership from investigation through production deployment.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Software Engineer - Data Integration & JVM Ecosystem

ClickHouse 51-250 IT Services

ClickHouse is hiring a Senior Software Engineer for its Connectors team to build and maintain JVM-based data integrations that connect the database to widely used data engineering and visualization platforms.

Apache Airflow Apache Spark dbt Grafana HTTP Java Kafka Metabase Pandas Power BI Python SQL Tableau TCP/IP
3 minutes ago

Senior Data Infrastructure Engineer

Voltus 251-1K Electric Utilities

Voltus is hiring a Senior Data Infrastructure Engineer to own and strengthen the core data systems that power analytics, reporting, and future AI-ready applications for its remote climate-tech platform.

Apache Airflow AWS Dagster Databricks Datadog dbt GCP Go Jupyter Looker Machine Learning Mode Prometheus Python Redash Superset
3 minutes ago

Data Intern - AI

Futurae 11-50 Professional Services

Futurae is hiring a Data Engineer Intern to build and own the company’s HubSpot-to-BigQuery analytics integration and make CRM data available for internal reporting in Metabase.

Apache Airflow Cybersecurity dbt GCP HubSpot Looker Metabase Python REST API SQL Tableau
18 minutes ago

Senior Software Engineer - Data Integration & JVM Ecosystem

ClickHouse 51-250 IT Services

ClickHouse is hiring a Senior Software Engineer to own JVM-based data connector and integration work that connects its high-performance analytics platform with the broader data ecosystem.

Apache Airflow Apache Spark Beam ClickHouse dbt Flink Grafana HTTP Java Kafka Metabase Pandas Power BI Python SQL Tableau TCP/IP
18 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers