MLabs

MLabs

MLabs is a Haskell, Rust, Blockchain, and AI consultancy specializing in mission-critical software development, cross-team collaboration, and cutting-edge value delivery for fintech, blockchain, and information technology sectors.

Internet Software & Services
11-50
Founded 2018

Description

  • Write, test, and refine code to extract data from online sources with high reliability and efficiency.
  • Handle complex data retrieval tasks, including pagination and dynamic content loaded via AJAX.
  • Clean and format scraped data to meet quality standards for downstream analysis and processing.
  • Store and manage scraped data in databases, optimizing for access speed and long-term integrity.
  • Monitor scraping processes and infrastructure to identify, troubleshoot, and resolve issues.
  • Support the scaling of high-quality public web data accessibility across a distributed crawling environment.
  • Gather and analyze data to improve scraping outcomes and pipeline performance.

Requirements

  • Demonstrated ability to extract data from complex websites with minimal supervision, supported by a portfolio of past projects.
  • Advanced skills in Python or JavaScript.
  • Experience with web scraping libraries and frameworks such as BeautifulSoup, Scrapy, or Selenium.
  • Strong knowledge of asynchronous programming, multithreading, and distributed scraping architectures.
  • In-depth knowledge of HTML, CSS, JavaScript, and the Document Object Model (DOM).
  • Experience with NoSQL databases such as MongoDB or Cassandra, including efficient storage design.
  • Experience deploying and managing large-scale scraping jobs on AWS, Google Cloud, or Azure.
  • Ability to apply machine learning algorithms for data cleaning, categorization, or predictive analysis is preferred.
  • Active participation in relevant open-source projects is preferred.

Benefits

  • Competitive salary of $75,000 to $150,000.
  • Comprehensive benefits and equity package.
  • Remote work with a 6-hour overlap with EST.
  • Opportunity to work on AI development and web-scale knowledge graph creation.
  • Low-ego, high-autonomy, rapid-execution team culture.
  • Commitment to equality, accessibility, and reasonable workplace adjustments.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Intern, Forward Deployed Engineering

Workato 251-1K IT Services

Workato is hiring a Forward Deployed Engineering intern to support AI-driven automation initiatives by helping build intelligent agents and enterprise workflow integrations on its Agentic AI platform.

JavaScript JSON LLM Python REST API Salesforce
12 hours, 47 minutes ago

Software Engineer 3

Black Duck Inn 1K-5K Internet Software & Services

Black Duck Software is seeking a License Developer to evolve legacy licensing systems and build reliable, production-ready services for secure 24/7 customer use.

CI/CD DevSecOps Java Kubernetes Linux REST API Ruby on Rails
12 hours, 47 minutes ago

Statistical Programmer Sr

eClinical Solutions 251-1K Professional Services

Experienced Statistical Programmer role at a clinical research organization focused on delivering compliant statistical programming outputs for multiple clinical studies and regulatory submissions.

Git GitHub GitLab R SAP Shell Scripting
12 hours, 47 minutes ago

Data Conversion Software Engineer

Career TEAM 251-1K Professional Services

Career Team is hiring a Data Conversion Software Engineer to build data transformation and integration software for government-funded workforce development programs across the United States.

Agile Angular CI/CD Docker Express.js JavaScript JSON MongoDB NestJS Next.js Node.js React Scrum TypeScript XML
13 hours, 2 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers