Azure Data Engineer - Remote, Latin America

3 hours, 36 minutes ago
Full-time
Mid Level
Software Development
Bluelight Consulting

Bluelight Consulting

Bluelight Consulting is a leading Nearshore Software Development company that provides access to highly skilled Nearshore Tech Talent in the US timezone. They offer services such as Nearshore Staffing, Cloud consulting, Cloud migration strategies, and ...

Internet Software & Services
11-50
Founded 2015

Description

  • Develop and maintain ETL processes using Python (PySpark) in Azure Synapse Analytics Notebooks and Pipelines.
  • Design and build data storage structures using data warehousing concepts such as star schemas, facts, and dimensions in an MPP SQL pool.
  • Extract data from REST APIs, SQL database tables, and CSV files.
  • Design and optimize Azure Synapse notebooks and pipelines for scalability and performance.
  • Contribute to data fabric initiatives, including data lakes, lakehouses, delta lakes, and data cataloging.
  • Collaborate with data architects to create data models and schemas aligned to business needs.
  • Implement data quality checks and validation processes to ensure data accuracy and consistency.
  • Identify performance bottlenecks and troubleshoot ETL jobs to meet SLAs.
  • Maintain documentation for ETL processes, data flows, and transformations.
  • Work with cross-functional teams to gather requirements and support data-related initiatives.
  • Ensure data security and compliance with governance and privacy standards.

Requirements

  • Bachelor’s degree in Computer Science, Information Technology, or a related field, or equivalent work experience.
  • Certification related to data engineering or data science, such as Azure Data Engineer, is a plus.
  • Proven experience in ETL data engineering using Python (PySpark).
  • Experience extracting, transforming, and loading data from REST APIs, SQL database tables, and CSV files.
  • Proficiency with Azure Synapse Analytics, including Notebooks, Pipelines, Linked Services, and Azure Key Vault.
  • Ability to write complex SQL queries and optimize query performance.
  • Experience working with both SparkSQL and MS SQL.
  • Knowledge of data integration best practices and tools.
  • Experience with version control systems such as Git and Azure DevOps.
  • Strong problem-solving, analytical, and attention-to-detail skills.
  • Excellent verbal and written communication skills and ability to work in a team with shifting priorities.
  • Familiarity with big data technologies, machine learning, and data analysis is preferred.
  • Experience with data visualization tools such as Power BI or Tableau is a plus.
  • Experience with Agile methodologies is a plus.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Software Engineer II, Big Data

tvScientific 11-50 Media

tvScientific is hiring a Data Engineer to build and evolve the company’s AWS-based data infrastructure and pipelines that support its CTV advertising platform and data-heavy operations.

Apache Spark AWS Machine Learning Scala SQL
1 hour, 39 minutes ago

Data Engineer

A Data Engineer on the Professional Services team at a B2B sales intelligence platform will work directly on customer implementations for wholesale distributors to build data workflows, configure systems, and resolve data issues.

Apache Airflow Asana ERP GCP Git GitHub Actions HubSpot JSON Kubernetes Looker Python REST API Salesforce SFTP SQL
2 hours, 13 minutes ago

Senior Data Engineer

Kaseya 1K-5K IT Services

Kaseya is hiring a Senior Data Engineer to build and operate large-scale data platforms that support analytics, automation, and customer-facing experiences across its products.

AWS Azure CI/CD CloudFormation Flink GCP Go Java Kafka Machine Learning Microservices Python Rust Terraform
2 hours, 55 minutes ago

Data Engineer

Lighthouse Hotels, Restaurants & Leisure

Lighthouse is hiring a Data Engineer in Barcelona to build and improve large-scale data ingestion and transformation pipelines that power its hospitality intelligence platform.

GCP Kubernetes Machine Learning Python
6 hours, 5 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers