Research Scientist, Text-to-Speech

2 months, 1 week ago
Contract
Senior
Software Development
Oddin.gg

Oddin.gg

Oddin.gg is a top B2B esports betting provider, offering cutting-edge products like odds feeds and risk management to maximize profitability for partners worldwide.

Hotels, Restaurants & Leisure
51-250

Description

  • Research and train state-of-the-art text-to-speech models for realistic and emotional voice generation.
  • Experiment with different model architectures and datasets to improve TTS quality and inference speed.
  • Select the best-performing approaches and bring them into production.
  • Stay current with the latest research in speech synthesis and generative AI.
  • Generate new ideas for model and product improvements based on research developments.
  • Collaborate closely with a team of three TTS researchers.
  • Work with engineering and hardware teams to support deployment and production readiness.

Requirements

  • Experience training text-to-speech or voice cloning models.
  • Strong knowledge of transformers, diffusion models, and GANs.
  • Understanding of human speech and audio processing, including sampling, spectrograms, and vocoders.
  • Proficiency in Python and key ML libraries such as PyTorch and Hugging Face Transformers.
  • Ability to read research papers, implement approaches, and keep up with current literature.
  • Strong machine learning fundamentals and critical thinking skills.
  • Familiarity with modern speech synthesis models such as GPT-based, flow matching, Vevo, StyleTTS, IndexTTS, or MaskGCT is a plus.
  • Contributions to open-source AI tools or research publications in speech processing are a plus.
  • Familiarity with AWS or similar compute clusters is a plus.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Engineer

Motional 1K-5K Automotive

Motional is seeking a researcher to help improve autonomous driving models by mining challenging scenarios, analyzing model behavior, and supporting continuous learning workflows for driverless vehicle development.

Agile AWS CI/CD Deep Learning Looker Machine Learning MongoDB MySQL PostgreSQL Python Redash SQL SQLite
6 hours, 58 minutes ago

Principal Software Engineer - Vector Search - Elasticsearch

Elastic 1K-5K Internet Software & Services

Elastic is hiring a Principal Software Engineer for its globally distributed Elasticsearch Search team to lead work on vector similarity search and help advance search relevance in Elasticsearch.

Cassandra CI/CD Elasticsearch Git GitHub Java Lucene MongoDB PostgreSQL Solr
6 hours, 58 minutes ago

Applied Research Scientist, LLM Evaluation & Post-Training

Innodata 1K-5K IT Services

Innodata is hiring an Applied Research Scientist to advance LLM and multimodal evaluation and post-training methods that improve model quality through rigorous research, experimentation, and customer-facing technical guidance.

Hugging Face LLM Machine Learning Python PyTorch Statistics TensorFlow
1 day, 6 hours ago

Lead Signal Processing Researcher

STR 251-1K Aerospace & Defense

STR’s Sensors Division is hiring a Lead Signal Processing Researcher to lead development and integration of advanced signal processing, optimization, and machine learning solutions for electronic warfare and sensor systems supporting national security missions.

C C++ Machine Learning MATLAB Python
2 days, 5 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers