Senior Python Data Scraping Engineer (Freelance)

1 hour, 54 minutes ago
Part-time
Senior
Software Development
Mindrift.ai: Be the “I” in AI

Mindrift.ai: Be the “I” in AI

Join 10,000+ experts earning $15-50/hr training AI models remotely. Flexible freelance work, weekly payments. No AI experience required. Apply in 5 minutes.

Internet Software & Services

Description

  • Own end-to-end data extraction workflows across complex websites and deliver structured datasets.
  • Use Apify, OpenRouter, and custom workflows to accelerate data collection, validation, and task execution.
  • Extract data reliably from dynamic and interactive web sources, including JavaScript-rendered content.
  • Enforce data quality standards through validation checks, cross-source consistency controls, and formatting verification.
  • Scale scraping operations for large datasets using batching or parallelization.
  • Monitor failures and maintain scraping stability when websites change structure slightly.
  • Troubleshoot technical issues independently and adapt scraping approaches as needed.
  • Collaborate with AI-assisted workflows while applying critical thinking, domain expertise, and quality control.

Requirements

  • 5+ years of relevant experience in data engineering, web scraping, automation, or software development.
  • Bachelor’s or Master’s degree in Engineering, Applied Mathematics, Computer Science, or a related technical field is a plus.
  • Strong technical foundation with practical experience in scripting, automation, and AI-assisted workflows.
  • Strong Python web scraping experience with BeautifulSoup, Selenium, or similar tools.
  • Experience handling dynamic content such as JS, AJAX, infinite scroll, and APIs via proxies.
  • Proven ability to extract data from complex structures such as hierarchies, archived pages, and inconsistent HTML.
  • Solid background in data cleaning, normalization, and validation, delivering structured datasets in CSV, JSON, or Google Sheets.
  • Experience handling anti-bot mechanisms and dynamic site structures at scale.
  • Experience with cloud infrastructure such as AWS and containerization with Docker in real workflows.
  • Hands-on experience with LLM frameworks such as LangChain, OpenRouter, or similar.
  • Upper-intermediate English proficiency (B2) or above.
  • A GitHub link is a plus.

Benefits

  • Part-time remote freelance opportunity.
  • Estimated workload of around 10–20 hours per week during active project phases.
  • Compensation of up to $45 per hour equivalent, depending on contribution level and pace.
  • Opportunity to work on AI projects through Mindrift's specialist platform.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Python Developer

SPD Technology Internet Software & Services

SPD Technology is hiring a Senior Python Developer to own the API and async pipeline for an AI-powered legal content platform that helps law firms research, draft, and submit Chambers USA rankings.

API Gateway AWS Celery GitHub Actions Microservices OpenAPI PostgreSQL Pulumi Python REST API Terraform
1 hour, 9 minutes ago

Principal / Staff Software Engineer - Backend, MLOps and Cloud Infrastructure

Nacre Capital 11-50 Capital Markets

Seed-X is hiring a Principal / Staff Software Engineer to lead backend, MLOps, and cloud platform architecture for its AI-driven AgTech systems that analyze seed quality at scale.

AWS CI/CD Datadog DNS Docker Grafana Kubernetes Microservices MLOps OpenSearch Prometheus Python SQL
1 hour, 9 minutes ago

Data Engineer: Integration Migration

Enroute 51-250 Internet Software & Services

Enroute is seeking a Data Engineer to support a migration of data integrations to Azure Data Factory, with a focus on rebuilding pipelines, managing data movement, and stabilizing the new Azure-based platform.

CI/CD Oracle Snowflake SQL
1 hour, 9 minutes ago

Senior Data Engineer

MediaRadar 51-250 Media

MediaRadar is seeking a Senior Data Engineer to lead a distributed team while architecting and modernizing its data delivery platform as it moves from a vendor-heavy stack to a flexible, open-source, AWS-based architecture.

Apache Airflow AWS ClickHouse Dagster dbt Docker Kafka Kubernetes PostgreSQL Python SQL SQL Server
1 hour, 24 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers