Senior Python Data Scraping Engineer (Freelance)

2 hours, 24 minutes ago
Part-time
Senior
Software Development
Mindrift.ai: Be the “I” in AI

Mindrift.ai: Be the “I” in AI

Join 10,000+ experts earning $15-50/hr training AI models remotely. Flexible freelance work, weekly payments. No AI experience required. Apply in 5 minutes.

Internet Software & Services

Description

  • Own end-to-end data extraction workflows across complex websites to ensure complete coverage, accuracy, and reliable delivery of structured datasets.
  • Use internal tools such as Apify and OpenRouter, along with custom workflows, to accelerate data collection, validation, and task execution.
  • Extract data reliably from dynamic and interactive web sources, including JavaScript-rendered content and changing site behavior.
  • Apply data quality controls through validation checks, cross-source consistency checks, formatting adherence, and systematic verification before delivery.
  • Scale scraping operations for large datasets using batching or parallelization while monitoring failures and maintaining stability against minor site structure changes.

Requirements

  • 5+ years of relevant experience in data engineering, web scraping, automation, or software development.
  • Bachelor’s or Master’s degree in Engineering, Applied Mathematics, Computer Science, or a related technical field is a plus.
  • Strong technical foundation with practical experience in scripting, automation, and AI-assisted workflows.
  • Strong Python web scraping experience with BeautifulSoup, Selenium, or similar tools, including dynamic content, JavaScript, AJAX, infinite scroll, and API access via proxies.
  • Proven ability to extract data from complex structures such as hierarchies, archived pages, and inconsistent HTML.
  • Solid background in data cleaning, normalization, and validation, with experience delivering structured datasets in CSV, JSON, or Google Sheets.
  • Demonstrated experience handling anti-bot mechanisms and dynamic site structures at scale.
  • Experience with cloud infrastructure such as AWS or equivalent, and containerization with Docker in real workflows.
  • Hands-on experience with LLM frameworks such as LangChain, OpenRouter, or similar applied to automation tasks.
  • English proficiency at Upper-intermediate (B2) level or above.
  • A GitHub link is a plus.

Benefits

  • Part-time remote freelance opportunity.
  • Estimated workload of around 10–20 hours per week during active project phases.
  • Compensation of up to $40 per hour equivalent, depending on level and pace of contribution.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Software Engineer - Node

Creative Chaos 251-1K Internet Software & Services

Creative Chaos is hiring a Senior Software Engineer (Node) to develop and maintain server applications and RESTful APIs for client-facing software platforms, with ownership of quality, security, performance, and team mentorship.

Express.js GitHub JavaScript JIRA MongoDB MySQL NestJS Node.js PostgreSQL REST API
2 hours, 9 minutes ago

Senior Data Engineer (Graph Databases & Databricks) - LATAM/Mexico - English Req

DaCodes 51-250 Internet Software & Services

DaCodes is hiring a Senior Graph Data Engineer to evaluate and benchmark enterprise knowledge graph technologies through hands-on data engineering, graph database implementation, and performance analysis for large-scale workloads.

ArangoDB AWS Databricks Git Kubernetes Neo4j PostgreSQL SQL
2 hours, 24 minutes ago

Senior Manager, Data Engineering

Zscaler 1K-5K Internet Software & Services

Zscaler is hiring a Senior Manager, Enterprise Data Platform to lead the strategy, architecture, and evolution of its modern enterprise data platform in a remote US role.

CI/CD dbt Git Python Snowflake SQL
2 hours, 24 minutes ago

Senior Go Developer

Devsu 51-250 Internet Software & Services

Devsu is seeking a Senior Backend Developer to build and scale Go- and Google Cloud-based backend services for live event management, media processing, asset distribution, and hybrid edge-caching systems.

CI/CD GCP GitHub Actions Go Grafana gRPC Helm K6 Kubernetes Locust Microservices MySQL OpenTelemetry PostgreSQL Prometheus Redis REST API SQL Terraform
2 hours, 24 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers