Data Engineer - Web Scraping (4 months contract)

5 hours, 9 minutes ago
Contract
Junior
Software Development

CXG

CXG provides customer sentiment analysis and consulting services to help businesses enhance customer experiences and drive growth.

Description

  • Maintain and manage website scraping configurations using Python.
  • Monitor scraping configurations for errors, crashes, and potential blockages.
  • Oversee retrieved data to detect issues and data-quality problems.
  • Coordinate with stakeholders to clarify scraping task requirements and report issues.
  • Prepare and share periodic reports on scraping activities with stakeholders.
  • Develop pipelines to ingest data into the Datalake and perform required transformations.
  • Build, test, and maintain tasks and projects related to data engineering workflows.
  • Optimize data pipelines for performance and efficiency.

Requirements

  • Minimum 2 years of experience in a similar data engineering role.
  • Proven experience designing and implementing scalable data architectures.
  • Strong experience with ETL processes, data modeling, and data warehousing.
  • Airflow and DBT experience preferred; hands-on experience with Airflow and/or DBT is required.
  • Expertise in relational and NoSQL database technologies.
  • Knowledge of cloud platforms, particularly Azure.
  • Solid understanding of data security measures and compliance standards.
  • Excellent Python experience for data engineering and automation.
  • Experience with version control systems such as Git.
  • Experience with Terraform for infrastructure management.
  • Strong collaboration skills to work closely with data scientists and analysts.
  • Strong academic background in a relevant field.
  • Fluent in English; French is a plus.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior DataOps Engineer- Remote US

Smile Digital Health 251-1K IT Services

Smile Digital Health is hiring a Senior DataOps Engineer to own analytics infrastructure and data processing environments for its remote U.S. clinical intelligence product team.

Ansible Apache Airflow Apache Spark AWS Azure CI/CD Databricks GCP GitHub Actions GitLab CI Java Jenkins Kubernetes Linux Prefect Python Scala Terraform
36 minutes ago

Staff Data Engineer

CookUnity 251-1K Hotels, Restaurants & Leisure

CookUnity is hiring a Data Engineer to help rebuild and scale the company’s B2C data foundation by designing production-ready pipelines and data systems for a rapidly growing food marketplace.

Apache Spark Flink Java Kafka Kubernetes Python Scala Snowflake SQL
3 hours ago

Data Engineering Team Lead (Agentic Search)

Nebius 51-250 Internet Software & Services

Nebius is seeking a Data Engineering Team Lead to own the data platform supporting its agent-native search product, spanning ingestion, warehouse architecture, analytics, and trustworthy datasets for product and business decisions.

Apache Airflow Apache Spark AWS dbt GCP Kafka MapReduce Python Snowflake SQL
4 hours, 4 minutes ago

GCP Data Architect

66degrees 251-1K IT Services

66degrees is seeking an experienced Data Architect to design, develop, and maintain Google Cloud-based data architecture that turns enterprise data into scalable, reliable business value.

Apache Spark dbt GCP Hadoop Python SQL
4 hours, 8 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers