1. Select Your Word List: 0-20 mins
- Default: No changes need to be made.
0 min
- Custom: Add or remove words as you please. However keep in mind that to achieve decent performance from the engine, you need to train
Count^2
samples, whereCount
represents the number of words in your word list.20 mins
2. Record Audio Samples: 1week
The number of samples depend on the user but generally the more the merrier. Sample means an audio .wav
file consisted of three spoken words.
Hint
- Default Wordlist: At least 5K samples.
Start recording by pressing space
.
Info
When the recording starts couple of words will be shown up on the screen. The default setting is 3 words and you have 3 seconds to say them. The time will be shown up on the screen. For best performance try to say the words not close to the edges, so give a small pause after start and try to finish up 200ms before time up.
2. Training The Model: 1hr
-
Train your model by Pressing
T
. -
Wait for
Generate Enn Sample
dialog to show up, this could take a while. SelectNo
in the two following dialogs.Hint
Generate Enn Samples is only required for training neural network. If you are a newcomer you can skip this for now. Next dialog is also only used in advanced mode. So you should be fine skipping both of them as mentioned. To learn more about these two features please checkout Advanced Training.
-
Congrats! Your model is now ready.
More Info
A detailed description of Gym functionalities is discussed in the Gym User Guide.