Senior Python Data Scraping Engineer (Freelance)

1 hour, 55 minutes ago
Part-time
Senior
Software Development
Mindrift.ai: Be the “I” in AI

Mindrift.ai: Be the “I” in AI

Join 10,000+ experts earning $15-50/hr training AI models remotely. Flexible freelance work, weekly payments. No AI experience required. Apply in 5 minutes.

Internet Software & Services

Description

  • Own end-to-end data extraction workflows across complex websites and deliver structured datasets with complete coverage and accuracy.
  • Use Apify, OpenRouter, and custom workflows to accelerate data collection, validation, and task execution.
  • Ensure reliable extraction from dynamic and interactive web sources, including JavaScript-rendered content and changing site behavior.
  • Apply data quality checks, cross-source consistency controls, and formatting verification before delivery.
  • Scale scraping operations for large datasets using batching or parallelization while monitoring failures and stability.
  • Adapt scraping approaches to handle complex site structures, such as hierarchies, archived pages, and inconsistent HTML.
  • Handle anti-bot mechanisms and maintain scraping performance as websites change.
  • Work independently to troubleshoot issues and complete tasks within project requirements.

Requirements

  • 5+ years of relevant experience in data engineering, web scraping, automation, or software development.
  • Strong Python web scraping experience with BeautifulSoup, Selenium, or similar tools.
  • Experience extracting data from dynamic content such as JavaScript, AJAX, infinite scroll, and APIs via proxies.
  • Proven ability to work with complex structures, including hierarchies, archived pages, and inconsistent HTML.
  • Solid background in data cleaning, normalization, and validation, with delivery of structured datasets in CSV, JSON, or Google Sheets.
  • Demonstrated experience handling anti-bot mechanisms and dynamic site structures at scale.
  • Experience with cloud infrastructure such as AWS or equivalent.
  • Experience with containerization tools such as Docker.
  • Hands-on experience with LLM frameworks such as LangChain, OpenRouter, or similar for automation tasks.
  • English proficiency at upper-intermediate (B2) level or above.
  • Bachelor’s or Master’s degree in Engineering, Applied Mathematics, Computer Science, or a related technical field is a plus.
  • A GitHub link is a plus.

Benefits

  • Part-time remote freelance opportunity.
  • Up to $45 per hour equivalent, depending on level and pace of contribution.
  • Estimated workload of around 10–20 hours per week during active project phases.
  • Opportunity to work on a specialized AI + human hybrid workflow.
  • Access to internal tools such as Apify and OpenRouter as part of the project.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Python Developer

SPD Technology Internet Software & Services

SPD Technology is hiring a Senior Python Developer to own the API and async pipeline for an AI-powered legal content platform that helps law firms research, draft, and submit Chambers USA rankings.

API Gateway AWS Celery GitHub Actions Microservices OpenAPI PostgreSQL Pulumi Python REST API Terraform
1 hour, 10 minutes ago

Principal / Staff Software Engineer - Backend, MLOps and Cloud Infrastructure

Nacre Capital 11-50 Capital Markets

Seed-X is hiring a Principal / Staff Software Engineer to lead backend, MLOps, and cloud platform architecture for its AI-driven AgTech systems that analyze seed quality at scale.

AWS CI/CD Datadog DNS Docker Grafana Kubernetes Microservices MLOps OpenSearch Prometheus Python SQL
1 hour, 10 minutes ago

Data Engineer: Integration Migration

Enroute 51-250 Internet Software & Services

Enroute is seeking a Data Engineer to support a migration of data integrations to Azure Data Factory, with a focus on rebuilding pipelines, managing data movement, and stabilizing the new Azure-based platform.

CI/CD Oracle Snowflake SQL
1 hour, 10 minutes ago

Senior Data Engineer

MediaRadar 51-250 Media

MediaRadar is seeking a Senior Data Engineer to lead a distributed team while architecting and modernizing its data delivery platform as it moves from a vendor-heavy stack to a flexible, open-source, AWS-based architecture.

Apache Airflow AWS ClickHouse Dagster dbt Docker Kafka Kubernetes PostgreSQL Python SQL SQL Server
1 hour, 25 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers