Senior Python Data Scraping Engineer (Freelance)

1 hour, 39 minutes ago
Part-time
Senior
Software Development
Mindrift.ai: Be the “I” in AI

Mindrift.ai: Be the “I” in AI

Join 10,000+ experts earning $15-50/hr training AI models remotely. Flexible freelance work, weekly payments. No AI experience required. Apply in 5 minutes.

Internet Software & Services

Description

  • Own end-to-end data extraction workflows across complex websites and deliver structured datasets.
  • Use internal tools such as Apify and OpenRouter, along with custom workflows, to speed up data collection, validation, and execution.
  • Extract data reliably from dynamic and interactive web sources, including JavaScript-rendered content and changing site behavior.
  • Apply validation checks, cross-source consistency controls, and formatting verification to ensure high data quality before delivery.
  • Scale scraping operations for large datasets using batching or parallelization.
  • Monitor failures and maintain scraping stability when sites change in minor ways.
  • Collaborate with AI agents by providing critical thinking, domain expertise, and quality control.
  • Systematically collect, structure, and validate data from diverse sources while working independently.

Requirements

  • At least 5+ years of relevant experience in data engineering, web scraping, automation, or software development.
  • Bachelor’s or Master’s degree in Engineering, Applied Mathematics, Computer Science, or a related technical field is a plus.
  • Strong technical foundation with practical experience in scripting, automation, and AI-assisted workflows.
  • Strong experience in Python web scraping using BeautifulSoup, Selenium, or similar tools.
  • Experience handling dynamic content such as JS, AJAX, infinite scroll, and APIs via proxies.
  • Proven ability to extract data from complex structures such as hierarchies, archived pages, and inconsistent HTML.
  • Solid background in data cleaning, normalization, and validation, with experience delivering structured datasets in CSV, JSON, or Google Sheets.
  • Demonstrated experience handling anti-bot mechanisms and dynamic site structures at scale.
  • Experience with cloud infrastructure such as AWS or equivalent and containerization with Docker.
  • Hands-on experience with LLM frameworks such as LangChain, OpenRouter, or similar tools.
  • Upper-intermediate English proficiency (B2) or above.
  • A GitHub link is a plus.

Benefits

  • Part-time remote freelance opportunity.
  • Estimated workload of around 10–20 hours per week during active project phases.
  • Compensation of up to $45 per hour equivalent, depending on level and pace of contribution.
  • Opportunity to work on specialized AI projects with a hybrid AI + human system.
  • Access to provided tools such as Apify and OpenRouter for the project.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Python Developer

SPD Technology Internet Software & Services

SPD Technology is hiring a Senior Python Developer to own the API and async pipeline for an AI-powered legal content platform that helps law firms research, draft, and submit Chambers USA rankings.

API Gateway AWS Celery GitHub Actions Microservices OpenAPI PostgreSQL Pulumi Python REST API Terraform
1 hour, 9 minutes ago

Principal / Staff Software Engineer - Backend, MLOps and Cloud Infrastructure

Nacre Capital 11-50 Capital Markets

Seed-X is hiring a Principal / Staff Software Engineer to lead backend, MLOps, and cloud platform architecture for its AI-driven AgTech systems that analyze seed quality at scale.

AWS CI/CD Datadog DNS Docker Grafana Kubernetes Microservices MLOps OpenSearch Prometheus Python SQL
1 hour, 9 minutes ago

Data Engineer: Integration Migration

Enroute 51-250 Internet Software & Services

Enroute is seeking a Data Engineer to support a migration of data integrations to Azure Data Factory, with a focus on rebuilding pipelines, managing data movement, and stabilizing the new Azure-based platform.

CI/CD Oracle Snowflake SQL
1 hour, 9 minutes ago

Senior Data Engineer

MediaRadar 51-250 Media

MediaRadar is seeking a Senior Data Engineer to lead a distributed team while architecting and modernizing its data delivery platform as it moves from a vendor-heavy stack to a flexible, open-source, AWS-based architecture.

Apache Airflow AWS ClickHouse Dagster dbt Docker Kafka Kubernetes PostgreSQL Python SQL SQL Server
1 hour, 24 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers