Azure Data Engineer - Remote, Latin America

3 hours, 15 minutes ago
Full-time
Mid Level
Software Development
Bluelight Consulting

Bluelight Consulting

Bluelight Consulting is a leading Nearshore Software Development company that provides access to highly skilled Nearshore Tech Talent in the US timezone. They offer services such as Nearshore Staffing, Cloud consulting, Cloud migration strategies, and ...

Internet Software & Services
11-50
Founded 2015

Description

  • Develop and maintain ETL processes using Python (PySpark) in Azure Synapse Analytics notebooks and pipelines.
  • Design and build data warehousing structures using star schemas, facts, and dimensions in an MPP SQL pool.
  • Extract data from REST APIs, SQL database tables, and CSV files.
  • Design and optimize Azure Synapse Analytics notebooks and pipelines for scalability and performance.
  • Contribute to data fabric initiatives including data lakes, lakehouses, delta lakes, and data cataloging.
  • Collaborate with data architects to create data models and schemas aligned with business requirements.
  • Implement data quality checks and validation processes to ensure accuracy and consistency.
  • Identify and resolve performance bottlenecks and monitor ETL jobs to ensure reliability and SLA compliance.
  • Maintain documentation for ETL processes, data flows, and transformations.
  • Work closely with cross-functional teams on data requirements, support, security, and compliance needs.

Requirements

  • Bachelor’s degree in Computer Science, Information Technology, or a related field, or equivalent work experience.
  • Experience in ETL data engineering with strong expertise in Python (PySpark).
  • Experience extracting, transforming, and loading data from REST APIs, SQL database tables, and CSV files.
  • Proficiency with Azure Synapse Analytics resources including Notebooks, Pipelines, Linked Services, and Azure Key Vault.
  • Ability to write complex SQL queries, optimize query performance, and work with SparkSQL and MS SQL.
  • Knowledge of data integration best practices and tools.
  • Experience with version control systems such as Git and Azure DevOps.
  • Strong problem-solving and analytical skills with excellent attention to detail.
  • Excellent verbal and written communication skills and ability to work in a team with shifting priorities.
  • Certification related to data engineering or data science, such as Azure Data Engineer, is a plus.
  • Familiarity with big data technologies, machine learning, and data analysis is preferred.
  • Experience with data visualization tools such as Power BI or Tableau and Agile methodologies is a plus.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Data Engineer

qode Internet Software & Services

Senior Data Engineer role focused on building and optimizing data engineering and analytics pipelines using Informatica and PySpark in a client-facing, cross-functional environment.

Agile Apache Spark AWS Hadoop HDFS Hive JIRA Kafka Machine Learning MapReduce Python
32 minutes ago

Data Engineer II (SAS)

eClinical Solutions 251-1K Professional Services

The Data Engineer II at eClinical Solutions will work with clients to configure and support the elluminate platform, deliver technical consulting, and contribute to clinical data and analytics projects.

Apache Spark AWS Azure C# DB2 HTML Java .NET Oracle Python R REST API SQL SQL Server Tableau Teradata
1 hour, 41 minutes ago

Senior Software Engineer - Data Platform

Samsara 1K-5K IT Services

Samsara is hiring a Senior Software Engineer I for its Data Platform team to build and operate the core data infrastructure that powers analytics, AI, product, and operational use cases across the company.

Apache Airflow Apache Spark AWS CloudFormation Dagster Databricks Docker Go Java Kubernetes Microservices Prefect Python Scala Terraform
3 hours, 26 minutes ago

Software Engineer II, Big Data

tvScientific 11-50 Media

tvScientific is hiring a Data Engineer to build and evolve the company’s AWS-based data infrastructure and pipelines that support its CTV advertising platform and data-heavy operations.

Apache Spark AWS Machine Learning Scala SQL
5 hours, 44 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers