Data Engineering Intern

6 days, 21 hours ago
Part-time
Entry Level
Software Development
DomainTools

DomainTools

DomainTools is the go-to for Internet intelligence, empowering security analysts with essential tools to assess threats, prevent attacks, and investigate criminal activity.

IT Services
51-250
Founded 2004

Description

  • Assist with data hygiene projects and documentation to maintain data integrity.
  • Clean and prepare data to keep machine learning pipelines accurate and reliable.
  • Automate processes to improve development and data operations efficiency.
  • Update and maintain code to stay compatible with evolving data sources.
  • Develop and improve tools to monitor data health.
  • Support the R&D team with ad-hoc tasks and special projects as needed.
  • Help the team explore new ways to use datasets.
  • Provide development and operational support for production machine learning pipelines.

Requirements

  • Strong organizational skills and the ability to manage tasks and schedule effectively.
  • Strong attention to detail for spotting data inconsistencies and producing high-quality documentation.
  • Clear and precise communication skills for collaborating with a remote team.
  • Familiarity with Python and Git.
  • Ideally familiar with Spark or PySpark.
  • Knowledge of computer networks, including DNS, domain names, and IP addresses, is preferred.
  • Ability to work 5-15 hours per week, depending on availability.
  • Availability for regular check-ins during US/Eastern business hours.
  • Experience working with remote teams is preferred.

Benefits

  • Unpaid internship offered for academic credit only.
  • Hands-on learning experience in a real-world educational environment.
  • Close supervision and mentorship from existing staff.
  • Opportunity to gain experience in internet security, machine learning, and production development patterns.
  • No guarantee of a paid position after the internship.
  • The internship is designed to support the intern's education and training.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Data Platform Engineer

Lola Blankets 1-10 Textiles, Apparel & Luxury Goods

Lola Blankets is hiring a Data Platform Engineer to own its analytics platform and support engineering work across product, operations, integrations, and platform reliability.

Apache Airflow Dagster dbt LLM Prefect Python Snowflake SQL TypeScript
6 hours, 16 minutes ago

Oracle Data Engineer (with German Language)

Soname Solutions 11-50 Internet Software & Services

Soname Solutions is seeking a Senior Data Warehouse Developer to support a German telecom client by designing, optimizing, and evolving its multi-layer data warehouse environment.

Oracle PostgreSQL Power BI SQL
8 hours, 18 minutes ago

Synthetic Data Engineer (AI Data/Training)

Hyphen Connect 1-10 staffing & recruiting

A Synthetic Data Engineer at the organization will design and manage domain-specific synthetic data pipelines that support data processing and model training workflows.

Apache Airflow Apache Spark
8 hours, 51 minutes ago

Senior Data Engineer

Alpaca 51-250 Capital Markets

Alpaca is seeking a Senior Data Platform Engineer to build and operate the data management layer for its global brokerage infrastructure as it scales across customers, jurisdictions, and high-volume financial event streams.

Apache Airflow dbt Docker GCP Helm Kafka Kubernetes Python SQL Terraform Trino
10 hours, 37 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers