Disco Narrator - Data Scraping
To train an AI Text-To-Speech (TTS) model, we’ll need to obtain a Labelled Dataset with two things:
- Clean audio files, containing only the voice we’re cloning
- The dialogue transcript (text) for each audio file
To train an AI Text-To-Speech (TTS) model, we’ll need to obtain a Labelled Dataset with two things:
The default shell prompt (for Ubuntu 20) looks like this:
| |
Memoization is a part of the standard toolkit for, “things I can use to solve the algorithm question in my next job interview”. Most of the time, I like to use functools.cache for this:
