Data Engineer (Azure) - Remote, Latin America

1 hour, 45 minutes ago
Full-time
Mid Level
Software Development
Bluelight Consulting

Bluelight Consulting

Bluelight Consulting is a leading Nearshore Software Development company that provides access to highly skilled Nearshore Tech Talent in the US timezone. They offer services such as Nearshore Staffing, Cloud consulting, Cloud migration strategies, and ...

Internet Software & Services
11-50
Founded 2015

Description

  • Develop and maintain ETL processes using Python (PySpark) in Azure Synapse Analytics notebooks and pipelines.
  • Design and build data warehouse structures using star schemas, facts, and dimensions in an MPP SQL pool.
  • Extract and integrate data from REST APIs, SQL database tables, and CSV files.
  • Design and optimize Azure Synapse notebooks and pipelines for scalability and performance.
  • Contribute to data fabric capabilities including data lakes, lakehouses, delta lakes, and data cataloging.
  • Collaborate with data architects to create data models and schemas that align with business needs.
  • Implement data quality checks and validation processes to ensure accurate and consistent data.
  • Identify and resolve performance bottlenecks and troubleshoot ETL jobs to meet SLAs.
  • Maintain documentation for ETL processes, data flows, and transformations.
  • Work with cross-functional teams to understand requirements and support data-related initiatives.
  • Ensure data security, governance, and privacy compliance.

Requirements

  • Bachelor’s degree in Computer Science, Information Technology, or a related field, or equivalent work experience.
  • Certifications related to data engineering or data science, such as Azure Data Engineer, are a plus.
  • Proven experience in ETL data engineering using Python (PySpark).
  • Experience extracting, transforming, and loading data from REST APIs, SQL database tables, and CSV files.
  • Proficiency with Azure Synapse Analytics, including Notebooks, Pipelines, Linked Services, and Azure Key Vault.
  • Ability to write complex SQL queries and optimize query performance.
  • Experience with both SparkSQL and MS SQL.
  • Knowledge of data integration best practices and tools.
  • Experience with version control systems such as Git and Azure DevOps.
  • Strong problem-solving and analytical skills with attention to detail.
  • Excellent verbal and written communication skills and ability to work in a team with shifting priorities.
  • Familiarity with big data technologies, machine learning, and data analysis is preferred.
  • Experience with data visualization tools such as Power BI or Tableau is a plus.
  • Experience with Agile methodologies is a plus.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Backend / Data Software Engineer (Generalist)

Tenchi Security 11-50 Internet Software & Services

Tenchi is seeking a versatile Backend Software Engineer to help build its TPCRM SaaS platform by developing backend services and shaping the data infrastructure that ingests, processes, and stores security-related data.

Apache Spark AWS CI/CD Docker FastAPI Flask Git MySQL PostgreSQL Python SQL
2 hours, 20 minutes ago

Data Engineer Mid-Level

Valtech 5K-10K Professional Services

Valtech is hiring a Data Engineer in Colombia to build and support data solutions for client projects and customer experience innovation.

Agile AWS Azure GCP Metabase SQL Statistics
4 hours, 21 minutes ago

Sr. Associate, Data Quality Engineering

Puck 1-10 Internet Software & Services

Fortitude Re is hiring a Data Quality Engineer to design and maintain automated data quality controls for reinsurance data across governance, reporting, and compliance workflows.

AWS Azure CI/CD GCP Pandas Python SQL
7 hours, 5 minutes ago

Senior Data Engineer

qode Internet Software & Services

Senior Data Engineer role focused on building and optimizing data engineering and analytics pipelines using Informatica and PySpark in a client-facing, cross-functional environment.

Agile Apache Spark AWS Hadoop HDFS Hive JIRA Kafka Machine Learning MapReduce Python
7 hours, 20 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers