Tech Holding

Tech Holding

Tech Holding: California's #1 website design company offering full-service technology consulting with expertise in software management, AI, and security.

Internet Software & Services
51-250
Founded 2016

Description

  • Design, deploy, and scale large-scale ML and data processing pipelines across cloud infrastructure.
  • Build systems to ingest, process, and serve 250,000+ hours of multimodal data including video, audio, and metadata.
  • Architect and optimize GPU-based compute environments for distributed training and inference.
  • Develop high-throughput backend systems for video ingestion from desktop and mobile platforms.
  • Implement distributed processing workflows with job scheduling, fault tolerance, and resource allocation.
  • Design and build human-in-the-loop and automated annotation systems to support data quality and scalability.
  • Translate ML and multimodal research into scalable, production-grade cloud architectures.
  • Optimize pipelines for performance, reliability, and cost efficiency across compute, storage, and networking layers.
  • Collaborate with ML, data, and engineering teams to deliver end-to-end data workflows.

Requirements

  • 5+ years of experience in data engineering, ML pipelines, or distributed systems.
  • Strong experience building scalable data pipelines for large datasets, preferably video and audio data.
  • Hands-on experience with cloud platforms such as AWS, Azure, or GCP.
  • Experience working with GPU-based environments and distributed computing.
  • Strong programming skills in Python, Scala, or similar languages.
  • Experience with data processing frameworks such as Spark, Ray, Kafka, Airflow, or similar tools.
  • Understanding of ML workflows, training pipelines, and inference systems.
  • Experience designing fault-tolerant, high-availability systems.
  • Strong knowledge of data storage systems including data lakes, object storage, and distributed file systems.
  • Ability to handle high-throughput, large-scale data ingestion and processing.
  • Experience with multimodal AI systems involving video, audio, and NLP, preferred.
  • Familiarity with annotation tools and data labeling workflows, preferred.
  • Experience with containerization and orchestration tools such as Docker and Kubernetes, preferred.
  • Knowledge of cost optimization strategies for large-scale cloud workloads, preferred.

Benefits

  • Equal Opportunity Employer commitment with a diverse and inclusive workplace.
  • Accommodation provided during the application process for candidates who need it.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Data Engineer, Azure - Remote, Latin America

Bluelight Consulting 11-50 Internet Software & Services

Bluelight is hiring a remote Data Engineer, Azure in Latin America to build and optimize data pipelines and warehousing solutions for client projects in a growing software consultancy.

Agile Apache Spark Azure Git Machine Learning Power BI Python REST API SQL SQL Server Tableau
20 minutes ago

Senior Machine Learning Engineer, Advertiser Growth

Unity 5K-10K Internet Software & Services

Unity is hiring a senior software engineer on the Advertiser Growth team to build the systems that power ad marketplace scaling, financial integrity, and experimentation at massive scale.

Apache Spark Flink Generative AI Go Java Kafka LLM Scala
37 minutes ago

Data Engineer (Remote)

Evio Beauty 11-50 Consumer Goods

Evio is hiring an experienced Data Engineer to build and maintain the AWS-native data platform that powers pharmacy innovation and improved patient outcomes.

AWS Python SQL
52 minutes ago

2026-0061 Ballistic Missile Defence (BMD) Feasibility Study - TUE 19 May

EMW 51-250 Internet Software & Services

NCIA is seeking a contractor team to conduct a multi-year feasibility study on applying AI and machine learning to ballistic missile defence use cases, including research, prototyping, and knowledge transfer across NATO environments.

Agile AWS Azure CI/CD Deep Learning Docker Git Java JavaScript Kubernetes LLM Machine Learning Microservices Python PyTorch Reinforcement Learning REST API Scikit-learn Spring Boot TensorFlow Transformers TypeScript
52 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers