Social Media Data Crawler

5 hours, 14 minutes ago
Full-time
Mid Level
Data Science and Analytics
Hire Premium Sales & Support Global Team - CrewBloom

Hire Premium Sales & Support Global Team - CrewBloom

CrewBloom is a matchmaking platform for sales and support talent, connecting high-growth companies with highly vetted professionals globally. With a focus on reducing hiring costs by up to 70% and ensuring compliance, we facilitate quick and efficient ...

Professional Services
51-250
Founded 2016

Description

  • Develop and maintain automated scripts and tools for extracting data from social media platforms such as LinkedIn, Facebook, X, and Instagram.
  • Ensure crawling and data extraction activities comply with each platform's terms of service and privacy policies.
  • Monitor crawling systems continuously and troubleshoot issues to maintain accurate, consistent data collection.
  • Analyze collected data to identify trends, insights, and patterns aligned with business goals.
  • Collaborate with data scientists, marketing teams, and research teams to deliver relevant data for their initiatives.
  • Optimize crawling systems for performance, scalability, and adaptability to platform changes.
  • Stay current with changes in social media platforms and APIs and adjust crawling strategies accordingly.

Requirements

  • Bachelor’s degree in Computer Science, Information Technology, or a related field.
  • Proven experience in web crawling, scraping, or data extraction from social media.
  • Proficiency in programming languages such as Python, Java, or similar.
  • Familiarity with social media APIs, including LinkedIn API, Facebook Graph API, Twitter API, and Instagram API.
  • Strong understanding of HTML, CSS, and JavaScript.
  • Knowledge of data privacy laws and ethical considerations related to data scraping.
  • Excellent problem-solving skills with keen attention to detail.
  • Ability to work independently and collaboratively within a team.
  • Experience with big data technologies such as Hadoop or Spark (preferred).
  • Familiarity with data visualization and reporting tools (preferred).
  • Understanding of natural language processing (NLP) techniques (preferred).
  • Primary internet connection with a minimum speed of 15 Mbps.
  • Backup internet connection with at least 10 Mbps and the ability to support work during a power outage.
  • Desktop or laptop with at least an Intel Core i5 (8th generation or newer), Intel Core i3 (10th generation or newer), AMD Ryzen 5, or equivalent processor.
  • Minimum of 8 GB RAM.
  • Backup device that meets or exceeds the performance of an Intel Core i3 processor and is functional during power interruptions.
  • Functioning webcam.
  • Noise-canceling USB headset.
  • Quiet, dedicated home office space.
  • Smartphone for communication and verification purposes.

Benefits

  • Fun, inclusive, and innovative team culture that supports professional growth.
  • Daily opportunities to learn, innovate, and make an impact.
  • Strong career growth opportunities and access to resources for advancement.
  • High-energy, engaging, fast-paced work environment.
  • Flexible remote work from home or any location of your choice.
  • Improved work-life balance with no stressful commute.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior AI & Data Engineer

Orion Innovation 1K-5K IT Services

Orion Innovation is seeking a Web Delivery Lead to drive end-to-end delivery of AEM-based global websites for its digital engagement platforms, with a focus on reliable launches, governance, and AI-enabled web operations.

Agile AWS Azure CDN CI/CD Computer Vision DNS Docker GCP GPT GraphQL Hugging Face Java JavaScript Kubernetes LLM NLP Python PyTorch SEO TensorFlow
4 hours, 29 minutes ago

Data Engineer

Innodata 1K-5K IT Services

Innodata is seeking a Data Engineer to build enterprise data warehouses, data lakes, and pipelines that support data center supply chain and real estate operations, while enabling AI-driven analytics and workflow automation.

AWS ERP GCP Looker MLOps Power BI Python SQL Tableau
4 hours, 44 minutes ago

Data Engineer

Pavago IT Services

Pavago is hiring a remote Data Engineer to build and maintain cloud-based data pipelines, warehouses, and analytics datasets that support reporting, automation, and business intelligence.

Apache Airflow AWS CI/CD Dagster dbt Docker GCP GitHub Actions GitLab CI Jenkins Kafka Kubernetes Looker Luigi Power BI Prefect Python Scala Snowflake SQL Tableau Terraform
4 hours, 59 minutes ago

Senior Data Engineer - Surveillance & Interoperability

inventYOU 1-10 Internet Software & Services

Senior Data Engineer - Surveillance & Interoperability at inventYOU, responsible for designing and delivering large-scale data integration and interoperability solutions that support secure information exchange, surveillance, monitoring, and analytics across complex systems.

AWS Azure GCP Python REST API SQL
5 hours, 14 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers