AWS Data Engineer (Associate)

1 hour, 59 minutes ago
Full-time
Junior
Software Development
Mactores

Mactores

Mactores provides data analytics solutions and services that empower businesses to enhance their operational efficiency and value through automation and advanced technologies such as AI, Big Data, and cloud computing.

IT Services
51-250
Founded 2003

Description

  • Write efficient code using PySpark and Amazon Glue.
  • Write SQL queries in Amazon Athena and Amazon Redshift.
  • Explore new technologies and techniques to solve business problems creatively.
  • Collaborate with engineering and business teams to build data products and services.
  • Work with business leads, analysts, and data scientists to understand the business domain.
  • Engage with fellow engineers to develop data solutions that support better decision-making.
  • Deliver projects collaboratively as part of the team.
  • Manage customer updates and communication in a timely manner.

Requirements

  • 1 to 3 years of experience with Apache Spark, PySpark, and Amazon Glue.
  • 2+ years of experience writing ETL jobs using PySpark and SparkSQL.
  • 2+ years of experience writing SQL queries and stored procedures.
  • Deep understanding of the Spark DataFrame API and its transformation functions.
  • Experience with Spark 2.7+.
  • Prior experience working with AWS EMR and Apache Airflow is preferred.
  • AWS Certified Big Data – Specialty, Cloudera Certified Big Data Engineer, or Hortonworks Certified Big Data Engineer certification is preferred.
  • Understanding of DataOps engineering is preferred.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

AWS&Snowflake Data Engineer

Lingaro 5K-10K IT Services

AWS & Snowflake Data Engineer for CC Data Engineering & Management in Poland, focused on remote data engineering and management work.

AWS Snowflake
14 minutes ago

Data Engineer

Kpler 251-1K Professional Services

Kpler is hiring a Data Engineer in Germany to build and maintain core ship-tracking datasets and the APIs and data pipelines that power them in a remote engineering environment.

Agile Apache Airflow Apache Spark AWS CI/CD Docker FastAPI Git Gradle Kafka Kubernetes Maven Microservices Python REST API Scala Terraform
14 minutes ago

GCP Data Engineer- Senior Consultant

Lingaro 5K-10K IT Services

Senior GCP Data Engineer role at KK in India, focused on building and optimizing cloud data platforms and pipelines to support reliable data processing and analytics.

Apache Airflow GCP Python SQL
29 minutes ago

Senior Data Engineer

Nova 1K-5K Professional Services

Senior Data Engineer at a remote U.S.-based engineering team focused on maintaining and improving data warehouse infrastructure, pipelines, and reporting to deliver reliable data solutions and actionable insights.

Apache Airflow Apache Spark AWS Azure CI/CD Databricks GCP Git Power BI Python Snowflake SQL Tableau
44 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers