Data Engineer, Azure - Remote, Latin America

1 hour, 54 minutes ago
Full-time
Mid Level
Software Development
Bluelight Consulting

Bluelight Consulting

Bluelight Consulting is a leading Nearshore Software Development company that provides access to highly skilled Nearshore Tech Talent in the US timezone. They offer services such as Nearshore Staffing, Cloud consulting, Cloud migration strategies, and ...

Internet Software & Services
11-50
Founded 2015

Description

  • Develop and maintain ETL data engineering processes using Python and PySpark in Azure Synapse Analytics notebooks and pipelines.
  • Design and build data storage structures in an MPP SQL pool using data warehousing concepts such as star schemas, facts, and dimensions.
  • Extract and transform data from REST APIs, SQL database tables, and CSV files.
  • Design and optimize Azure Synapse Analytics notebooks and pipelines for scalability and performance.
  • Contribute to data fabric initiatives, including data lakes, lakehouses, delta lakes, and data cataloging.
  • Collaborate with data architects to create data models and schemas aligned with business requirements.
  • Implement data quality checks and validation processes to ensure accurate and consistent data.
  • Identify and resolve performance bottlenecks and optimize ETL jobs to meet SLAs.
  • Monitor ETL jobs, diagnose issues, and implement fixes to improve pipeline reliability.
  • Maintain documentation for ETL processes, data flows, and transformations, and work with cross-functional teams on data initiatives.

Requirements

  • Bachelor’s degree in Computer Science, Information Technology, or a related field, or equivalent work experience.
  • Certification related to data engineering or data science, such as Azure Data Engineer, is a plus.
  • Proven experience in ETL data engineering with strong Python and PySpark experience.
  • Experience extracting, transforming, and loading data from REST APIs, SQL database tables, and CSV files.
  • Proficiency with Azure Synapse Analytics resources, including Notebooks, Pipelines, Linked Services, and Azure Key Vault.
  • Ability to write complex SQL queries and optimize query performance using SparkSQL and MS SQL.
  • Knowledge of data integration best practices and tools.
  • Experience with version control systems such as Git and Azure DevOps.
  • Strong problem-solving and analytical skills with attention to detail.
  • Excellent verbal and written communication skills and the ability to work collaboratively in a team with shifting priorities.
  • Familiarity with big data technologies, machine learning, and data analysis is preferred.
  • Experience with data visualization tools such as Power BI or Tableau, and Agile methodologies is a plus.

Benefits

  • Competitive salary and bonuses, including performance-based salary increases.
  • Generous paid time off policy.
  • Flexible working hours.
  • Remote work.
  • Continuing education, training, and conference opportunities.
  • Company-sponsored coursework, exams, and certifications.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Data Engineer

Wavicle Data Solutions 251-1K IT Services

Wavicle Data Solutions is hiring a Senior Data Engineer in Oak Brook, IL to design and deliver cloud-based data integration and pipeline solutions for client transformation initiatives.

Apache Spark AWS Azure Cassandra Databricks GCP Hadoop MongoDB Oracle Python Scala Snowflake SQL SQL Server Teradata
54 minutes ago

Data Engineer, Encounters Intelligence

AcuityMD 51-250 Health Care Providers & Services

AcuityMD is hiring a Data Engineer, Encounters Intelligence to help evolve its core healthcare data assets and deliver data products that improve access to medical technology for MedTech customers.

Dagster dbt GCP LLM Machine Learning Python
1 hour, 31 minutes ago

Software Engineer – Data Pipelines / ETL / MLOps

ALTEN Technology 251-1K Construction & Engineering

ALTEN Technology USA is hiring a Software Engineer to build and support data pipelines, ETL, and MLOps systems that help production machine learning models run reliably at scale.

Apache Airflow Apache Spark GCP Kafka Kubeflow MLflow MLOps Python Scala Snowflake SQL
3 hours, 17 minutes ago

Data Engineer

qode Internet Software & Services

CoderPush is hiring a Data Engineer for its Data Platform team to build scalable data pipelines and support analytics and data-driven decision-making across the organization.

Apache Airflow Apache Spark AWS Databricks dbt Docker EC2 GitHub Actions Kubernetes Python SQL Terraform
4 hours, 11 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers