Senior Python Data Scraping Engineer (Freelance)

3 hours ago
Part-time
Senior
Software Development
Mindrift.ai: Be the “I” in AI

Mindrift.ai: Be the “I” in AI

Join 10,000+ experts earning $15-50/hr training AI models remotely. Flexible freelance work, weekly payments. No AI experience required. Apply in 5 minutes.

Internet Software & Services

Description

  • Own end-to-end data extraction workflows across complex websites and deliver structured datasets.
  • Use internal tools such as Apify and OpenRouter, along with custom workflows, to accelerate data collection and task execution.
  • Extract data reliably from dynamic and interactive web sources, including JavaScript-rendered content and changing site behavior.
  • Validate extracted data through quality checks, cross-source consistency controls, and verification against formatting specifications.
  • Scale scraping operations for large datasets using batching or parallelization.
  • Monitor scraping failures and maintain stability when minor site structure changes occur.
  • Apply systematic verification to ensure complete coverage, accuracy, and reliable delivery of results.
  • Collaborate with Tendem Agents in a hybrid AI + human workflow while providing critical thinking, domain expertise, and quality control.

Requirements

  • At least 5+ years of relevant experience in data engineering, web scraping, automation, or software development.
  • Bachelor’s or Master’s degree in Engineering, Applied Mathematics, Computer Science, or a related technical field is preferred.
  • Strong experience in Python web scraping using tools such as BeautifulSoup, Selenium, or similar.
  • Experience scraping dynamic content such as JS-rendered pages, AJAX, infinite scroll, and APIs via proxies.
  • Proven ability to extract data from complex structures, including hierarchies, archived pages, and inconsistent HTML.
  • Solid background in data cleaning, normalization, and validation, with delivery of structured datasets such as CSV, JSON, or Google Sheets.
  • Demonstrated experience handling anti-bot mechanisms and dynamic site structures at scale.
  • Experience with cloud infrastructure such as AWS or equivalent, and containerization with Docker.
  • Hands-on experience with LLM frameworks such as LangChain, OpenRouter, or similar for automation tasks.
  • Upper-intermediate English proficiency (B2) or above is required.
  • GitHub profile link is a plus.

Benefits

  • Part-time freelance remote work.
  • Compensation of up to $45 per hour equivalent, depending on contribution level and pace.
  • Estimated workload of around 10–20 hours per week during active phases.
  • Opportunity to work on AI projects with major tech innovators through the Mindrift platform.
  • Access to internal tools such as Apify and OpenRouter for the project.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Data/Infrastructure Advocate Engineer - US Remote

Hugging Face 51-250 IT Services

Hugging Face is hiring its first Data/Infrastructure Advocate Engineer to bridge technical data infrastructure work with the global community around the Hub, Xet storage, and open data workflows.

AWS Colab GitHub Machine Learning Pandas Python
43 minutes ago

Principal Data Platform Engineer (Databricks)

Prominence 51-250 Professional Services

Prominence Advisors is seeking a Principal Data Platform Engineer (Databricks) to lead the architecture and delivery of client-facing healthcare and life sciences data platforms.

Apache Airflow Apache Spark AWS Azure CI/CD Databricks dbt GCP Python SQL
1 hour, 5 minutes ago

Senior 3D & AI Systems Software Developer

CoLab 51-250 Construction & Engineering

CoLab is seeking a backend engineer to help build its 3D AutoReview platform, powering intelligent design analysis and AI-assisted review for leading mechanical engineering organizations.

Hugging Face LLM Machine Learning PyTorch SageMaker Scikit-learn SQL
2 hours, 5 minutes ago

Head of Data

BetterSleep 11-50 Internet Software & Services

BetterSleep is seeking a Head of Data to lead its data platform, analytics, and team as the company scales a modern growth engine for its global sleep app business.

Amplitude CI/CD dbt Firebase GCP Jenkins JIRA Mixpanel Python SQL
3 hours, 4 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers