Sizanid Staffing

Global staffing and recruitment agency offering AI-powered staffing solutions across multiple industries and regions, including permanent and temporary staffing, executive search, recruitment process outsourcing, global mobility staffing, international/remote staffing, and workforce training and AI upskilling.

Staffing & Recruiting

Description

  • Design, develop, and maintain scalable data collection architectures and workflows.
  • Automate data extraction from web sources, APIs, and data feeds.
  • Implement web scraping approaches while following legal and ethical standards.
  • Collaborate with data analysts and data scientists to define data requirements and deliver usable datasets.
  • Perform data cleansing, transformation, and validation to maintain data integrity and accuracy.
  • Monitor and troubleshoot data pipelines and resolve issues or discrepancies promptly.
  • Document workflows, processes, and data sources to support visibility and reproducibility.
  • Stay current on trends, tools, and technologies in data collection and web automation.
  • Participate in code reviews and contribute to continuous improvement in data engineering practices.

Requirements

  • Bachelor’s degree in Computer Science, Data Science, or a related field.
  • Proven experience as a Data Engineer, Data Scientist, or in a similar data collection and automation role.
  • Strong programming skills in Python, Java, or Scala.
  • Experience with web scraping tools such as Beautiful Soup, Scrapy, or Selenium, or similar frameworks.
  • Proficiency in SQL and experience with MySQL, PostgreSQL, or NoSQL databases.
  • Familiarity with cloud platforms such as AWS, Google Cloud, or Azure.
  • Experience with data processing tools such as Apache Spark or Apache Airflow.
  • Understanding of data governance, data quality, and data management principles.
  • Excellent analytical and problem-solving skills with strong attention to detail.
  • Strong communication skills for collaboration with technical and non-technical teams.
  • Experience with ETL processes and data integration tools, preferred.
  • Knowledge of data visualization tools such as Tableau or Power BI, a plus.
  • Familiarity with Git, preferred.
  • Experience working in Agile development environments, preferred.

Benefits

  • Full-time, freelance, and contract roles available.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

AI Data Engineer

Influur 11-50 Media

Influur is hiring an AI Data Engineer in New York/remote to own the full data-to-agent pipeline behind its autonomous viral marketing system for influencer campaigns.

AWS GCP LLM Python
3 hours, 26 minutes ago

Senior Data Engineer

Zencore Group 11-50 Internet Software & Services

Zencore is hiring a Senior Data Engineer in its LATAM Data & Analytics team to help customers modernize and migrate data platforms on Google Cloud through hands-on pipeline engineering and advisory work.

Apache Airflow Apache Spark CI/CD Databricks GCP MLOps Oracle Python Snowflake SQL
4 hours, 11 minutes ago

Data Observability Consultant - Dynatrace

Lingaro 5K-10K IT Services

Dynatrace India’s Consulting and Advisory Data Consulting Practice is hiring a remote Data Observability Consultant to support data-focused consulting work.

4 hours, 26 minutes ago

Senior Data Engineer

Lodgify 251-1K Internet Software & Services

Lodgify is hiring a Senior Data Engineer in Barcelona to build and optimize the company’s modern data platform that powers data-driven decisions across its vacation rental business.

Apache Airflow AWS Azure dbt GCP JavaScript Machine Learning Python SQL
4 hours, 26 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers