Data Engineer (Azure) - Remote, Latin America

1 hour, 21 minutes ago
Full-time
Mid Level
Software Development
Bluelight Consulting

Bluelight Consulting

Bluelight Consulting is a leading Nearshore Software Development company that provides access to highly skilled Nearshore Tech Talent in the US timezone. They offer services such as Nearshore Staffing, Cloud consulting, Cloud migration strategies, and ...

Internet Software & Services
11-50
Founded 2015

Description

  • Develop and maintain ETL processes using Python (PySpark) in Azure Synapse Analytics notebooks and pipelines.
  • Design and build data warehouse structures using star schemas, facts, dimensions, and MPP SQL pools.
  • Extract and integrate data from REST APIs, SQL database tables, and CSV files.
  • Design, optimize, and scale Azure Synapse notebooks and pipelines for performance.
  • Apply data fabric concepts such as data lakes, lakehouses, delta lakes, and data cataloging.
  • Collaborate with data architects to create data models and schemas aligned with business requirements.
  • Implement data quality checks and validation processes to ensure accuracy and consistency.
  • Monitor ETL jobs, troubleshoot issues, and resolve performance bottlenecks to meet SLAs.
  • Maintain documentation for data flows, transformations, and ETL processes.
  • Work with cross-functional stakeholders and ensure data security, governance, and privacy compliance.

Requirements

  • Bachelor’s degree in Computer Science, Information Technology, or a related field, or equivalent work experience.
  • Certification related to data engineering or data science, such as Azure Data Engineer, is a plus.
  • Proven experience in ETL data engineering using Python (PySpark).
  • Experience extracting, transforming, and loading data from REST APIs, SQL tables, and CSV files.
  • Proficiency with Azure Synapse Analytics, including Notebooks, Pipelines, Linked Services, and Azure Key Vault.
  • Ability to write complex SQL queries and optimize query performance.
  • Experience working with both SparkSQL and MS SQL.
  • Knowledge of data integration best practices and tools.
  • Experience with version control systems such as Git and Azure DevOps.
  • Strong problem-solving, analytical, and communication skills with the ability to work in a collaborative, fast-paced environment.
  • Familiarity with big data technologies, machine learning, and data analysis is preferred.
  • Experience with data visualization tools such as Power BI or Tableau is a plus.
  • Experience with Agile methodologies is a plus.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Principal Data Engineer - PerfectScale by DoiT

Zendesk 5K-10K Professional Services

DoiT is hiring a Principal Data Engineer to help shape and build the PerfectScale Kubernetes optimization platform through large-scale backend systems and data pipelines in a remote EMEA setting.

Apache Spark AWS Azure CI/CD ClickHouse dbt Docker GCP GitOps Go Java Kubernetes Microservices PostgreSQL Python Rust Trino
6 minutes ago

Senior Staff Software Engineer - Data Platform

Marqeta 251-1K Diversified Financial Services

Marqeta is hiring a software engineer to build and operate the core data platform underpinning its data and ML organization, with ownership of the lakehouse, streaming ingestion, and the abstractions other teams use to run pipelines and publish datasets.

Apache Airflow Apache Spark AWS Go Java Kafka Microservices Python
6 minutes ago

[Job - 29379] Senior Data Developer (IA/ AWS), Brazil

CI&T 5K-10K Internet Software & Services

CI&T is hiring a Senior Data Developer in Brazil to build scalable AWS-based data ingestion and orchestration solutions for large volumes of data in a remote/hybrid work context.

Apache Spark AWS Datadog Generative AI Kafka Python Serverless
21 minutes ago

Specialist Solutions Architect - Data Engineering & Observability

Databricks 1K-5K IT Services

Databricks is hiring a Specialist Solutions Architect for Data Engineering & Observability to lead customer-facing cloud data architecture work and help customers plan, evaluate, and implement production data and analytics workloads on the Databricks Data Intelligence Platform.

Apache Spark AWS Azure Elasticsearch GCP Hadoop Java Kafka Python Scala Sentinel SIEM Splunk SQL
21 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers