Senior Python Data Scraping Engineer (Freelance)

1 hour, 27 minutes ago
Part-time
Senior
Software Development
Mindrift.ai: Be the “I” in AI

Mindrift.ai: Be the “I” in AI

Join 10,000+ experts earning $15-50/hr training AI models remotely. Flexible freelance work, weekly payments. No AI experience required. Apply in 5 minutes.

Internet Software & Services

Description

  • Own end-to-end data extraction workflows across complex websites to deliver complete, accurate structured datasets.
  • Use internal tools such as Apify and OpenRouter, along with custom workflows, to speed up data collection, validation, and task execution.
  • Ensure reliable extraction from dynamic and interactive web sources, including JavaScript-rendered content and changing site behavior.
  • Enforce data quality standards through validation checks, cross-source consistency reviews, formatting adherence, and systematic verification before delivery.
  • Scale scraping operations for large datasets using batching or parallelization while monitoring failures and maintaining stability against minor site changes.
  • Clean, normalize, and validate scraped data into structured outputs such as CSV, JSON, or Google Sheets.
  • Troubleshoot extraction challenges independently and adapt approaches for non-trivial scraping problems.

Requirements

  • 5+ years of relevant experience in data engineering, web scraping, automation, or software development.
  • Strong experience in Python web scraping using BeautifulSoup, Selenium, or similar tools.
  • Experience extracting data from dynamic content such as JavaScript, AJAX, infinite scroll, and APIs via proxies.
  • Proven ability to work with complex website structures, including hierarchies, archived pages, and inconsistent HTML.
  • Solid background in data cleaning, normalization, and validation with structured datasets such as CSV, JSON, or Google Sheets.
  • Demonstrated experience handling anti-bot mechanisms and dynamic site structures at scale.
  • Experience with cloud infrastructure such as AWS or equivalent, and containerization with Docker in real workflows.
  • Hands-on experience with LLM frameworks such as LangChain, OpenRouter, or similar automation tools.
  • Bachelor’s or Master’s degree in Engineering, Applied Mathematics, Computer Science, or a related technical field is a plus.
  • English proficiency at Upper-intermediate (B2) level or above.
  • A GitHub link is a plus.
  • Methodical, detail-oriented, and able to work independently.

Benefits

  • Part-time remote freelance opportunity.
  • Estimated workload of around 10–20 hours per week during active project phases.
  • Compensation of up to $40 per hour equivalent, depending on contribution level and pace.
  • Opportunity to work on AI projects within a hybrid AI + human system.
  • Access to internal tools such as Apify and OpenRouter for project work.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Python Developer

SPD Technology Internet Software & Services

SPD Technology is hiring a Senior Python Developer to own the API and async pipeline for an AI-powered legal content platform that helps law firms research, draft, and submit Chambers USA rankings.

API Gateway AWS Celery GitHub Actions Microservices OpenAPI PostgreSQL Pulumi Python REST API Terraform
1 hour, 12 minutes ago

Principal / Staff Software Engineer - Backend, MLOps and Cloud Infrastructure

Nacre Capital 11-50 Capital Markets

Seed-X is hiring a Principal / Staff Software Engineer to lead backend, MLOps, and cloud platform architecture for its AI-driven AgTech systems that analyze seed quality at scale.

AWS CI/CD Datadog DNS Docker Grafana Kubernetes Microservices MLOps OpenSearch Prometheus Python SQL
1 hour, 12 minutes ago

Data Engineer: Integration Migration

Enroute 51-250 Internet Software & Services

Enroute is seeking a Data Engineer to support a migration of data integrations to Azure Data Factory, with a focus on rebuilding pipelines, managing data movement, and stabilizing the new Azure-based platform.

CI/CD Oracle Snowflake SQL
1 hour, 12 minutes ago

Senior Data Engineer

MediaRadar 51-250 Media

MediaRadar is seeking a Senior Data Engineer to lead a distributed team while architecting and modernizing its data delivery platform as it moves from a vendor-heavy stack to a flexible, open-source, AWS-based architecture.

Apache Airflow AWS ClickHouse Dagster dbt Docker Kafka Kubernetes PostgreSQL Python SQL SQL Server
1 hour, 27 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers