Limited Keyword Speech Synthesis Leveraging Transfer Learning
- yarn install
- yarn start
It requires microphone permissions, asked via browser.
the samples that the datasets are comprised of are in an ArrayBuffer/byte form. To See the actual samples (and to add more training data), upload a dataset file (found in /datasets) to https://storage.googleapis.com/tfjs-speech-model-test/2019-01-03a/dist/index.html in the "Dataset IO" section.
This project primarily accepts a labelled dataset of word samples.
Transfer learning is then used to re-train the model with the inputted dataset of entirely new words.
Tensorflow Transfer Learning API docs describe this process in depth as:
The modification process involves removing the top (output) dense layer of the original model and keeping the "base" of the model. Due to its previous training, the base can be used as a good feature extractor for any data similar to the original training data. The removed dense layer is replaced with a new dense layer configured specifically for the new dataset.
You may upload your own pre-defined set of speech commands via this syntax
This project integrates a Javascript IDE in the browser to allow for simple Javascript programming.