Bluelight Consulting

Bluelight Consulting

Bluelight Consulting is a leading Nearshore Software Development company that provides access to highly skilled Nearshore Tech Talent in the US timezone. They offer services such as Nearshore Staffing, Cloud consulting, Cloud migration strategies, and ...

Internet Software & Services
11-50
Founded 2015

Description

  • Develop and maintain ETL processes using Python (PySpark) in Azure Synapse Analytics notebooks and pipelines.
  • Design and build data storage structures using data warehousing concepts such as star schemas, facts, and dimensions.
  • Extract data from REST APIs, SQL database tables, and CSV files.
  • Design and optimize Azure Synapse Analytics notebooks and pipelines for scalability and performance.
  • Contribute to data fabric initiatives including data lakes, lakehouses, delta lakes, and data cataloging.
  • Collaborate with data architects to create data models and schemas aligned with business requirements.
  • Implement data quality checks and validation processes to ensure accuracy and consistency.
  • Identify and resolve performance bottlenecks and monitor ETL jobs to maintain reliability and SLA compliance.
  • Maintain documentation for ETL processes, data flows, and transformations.
  • Work with cross-functional teams on data requirements, support data initiatives, and ensure security and compliance with governance and privacy standards.

Requirements

  • Bachelor’s degree in Computer Science, Information Technology, or a related field, or equivalent work experience.
  • Certification related to data engineering or data science, such as Azure Data Engineer, is a plus.
  • Proven experience in ETL data engineering with strong Python (PySpark) experience.
  • Experience extracting, transforming, and loading data from REST APIs, SQL database tables, and CSV files.
  • Proficiency with Azure Synapse Analytics resources, including Notebooks, Pipelines, Linked Services, and Azure Key Vault.
  • Ability to write complex SQL queries and optimize query performance.
  • Experience working with both SparkSQL and MS SQL.
  • Knowledge of data integration best practices and tools.
  • Experience with version control systems such as Git and Azure DevOps.
  • Strong problem-solving, analytical, and communication skills with attention to detail.
  • Familiarity with big data technologies, machine learning, and data analysis is preferred.
  • Experience with data visualization tools such as Power BI or Tableau is a plus.
  • Experience with Agile methodologies is a plus.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior AI & Data Engineer

Orion Innovation 1K-5K IT Services

Orion Innovation is seeking a Web Delivery Lead to drive end-to-end delivery of AEM-based global websites for its digital engagement platforms, with a focus on reliable launches, governance, and AI-enabled web operations.

Agile AWS Azure CDN CI/CD Computer Vision DNS Docker GCP GPT GraphQL Hugging Face Java JavaScript Kubernetes LLM NLP Python PyTorch SEO TensorFlow
17 hours, 52 minutes ago

Data Engineer

Innodata 1K-5K IT Services

Innodata is seeking a Data Engineer to build enterprise data warehouses, data lakes, and pipelines that support data center supply chain and real estate operations, while enabling AI-driven analytics and workflow automation.

AWS ERP GCP Looker MLOps Power BI Python SQL Tableau
18 hours, 7 minutes ago

Data Engineer

Pavago IT Services

Pavago is hiring a remote Data Engineer to build and maintain cloud-based data pipelines, warehouses, and analytics datasets that support reporting, automation, and business intelligence.

Apache Airflow AWS CI/CD Dagster dbt Docker GCP GitHub Actions GitLab CI Jenkins Kafka Kubernetes Looker Luigi Power BI Prefect Python Scala Snowflake SQL Tableau Terraform
18 hours, 22 minutes ago

Senior Data Engineer - Surveillance & Interoperability

inventYOU 1-10 Internet Software & Services

Senior Data Engineer - Surveillance & Interoperability at inventYOU, responsible for designing and delivering large-scale data integration and interoperability solutions that support secure information exchange, surveillance, monitoring, and analytics across complex systems.

AWS Azure GCP Python REST API SQL
18 hours, 37 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers