Software Engineer, Data Infrastructure & Acquisition - Bloomington, IN, USA

4 days, 22 hours ago
Full-time
Senior
Software Development
Speechify

Speechify

Speechify is a top-rated text to speech AI app with voice cloning and dubbing features, serving over 10 million users including students and professionals.

Internet Software & Services
51-250
Founded 2017

Description

  • Find new sources of audio data and integrate them into the ingestion pipeline.
  • Operate and extend cloud infrastructure for the ingestion pipeline on GCP using Terraform.
  • Collaborate with scientists to improve the cost, throughput, and quality of dataset generation.
  • Work with AI team members and Speechify leadership to define the dataset roadmap for next-generation products.
  • Support data collection efforts that enable model training operations at petabyte scale.
  • Contribute to building richer data at larger scale and lower cost for future models.

Requirements

  • BS, MS, or PhD in Computer Science or a related field.
  • 5+ years of industry experience in software development.
  • Proficiency with bash and Python scripting in Linux environments.
  • Proficiency in Docker and infrastructure-as-code concepts.
  • Professional experience with at least one major cloud provider, preferably GCP.
  • Experience with web crawlers and large-scale data processing workflows is a plus.
  • Ability to handle multiple tasks and adapt to changing priorities.
  • Strong written and verbal communication skills.

Benefits

  • Competitive United States base salary of $140,000-$200,000 plus bonus and equity, depending on experience.
  • Opportunity to work in a fully distributed, 100% remote environment.
  • Fast-growing, entrepreneurial team with a laid-back, asynchronous culture.
  • Hands-off management that lets you focus on your work.
  • Opportunity to make a significant impact on a life-changing product used by millions.
  • Work on products that support people with learning differences and accessibility needs.
  • Opportunity to build in the rapidly growing intersection of artificial intelligence and audio.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Software Engineer

Recorded Future 251-1K Professional Services

Recorded Future is hiring a Senior Software Engineer to build and deploy AI agentic systems that autonomously process, enrich, and analyze cyber threat intelligence data at production scale.

AWS Azure Cybersecurity GCP Git JIRA LLM Machine Learning Python
9 minutes ago

Staff Software Engineer, Batch Processing Platform

Pinterest 5K-10K Internet Software & Services

Pinterest is hiring a Staff Software Engineer to build and optimize its batch processing platform and infrastructure for large-scale big data workloads.

Apache Spark Java Presto Python Scala Trino
39 minutes ago

Staff Software Engineer, Backend (Capacity Modeling)

Affirm 1K-5K Diversified Financial Services

Affirm is hiring a leader for its Capacity Modeling team to build and operationalize capacity plans that keep the company’s systems reliable during forecasted traffic spikes and peak sales events.

Apache Spark AWS DynamoDB Kotlin Kubernetes MySQL Python
39 minutes ago

Senior Agentic Systems Engineer

Natera 1K-5K Pharmaceuticals

Natera is seeking a Senior Agentic Systems Engineer to build and scale AI orchestration platforms that use multimodal data to support therapeutic development and clinical innovation.

AWS Generative AI LLM Python Terraform
45 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers