Full Training - Benjamin-ASR

Full Training

1. Select Your Word List: 0-20 mins

Default: No changes need to be made. 0 min
Custom: Add or remove words as you please. However keep in mind that to achieve decent performance from the engine, you need to train Count^2 samples, where Count represents the number of words in your word list. 20 mins

2. Record Audio Samples: 1week

The number of samples depend on the user but generally the more the merrier. Sample means an audio .wav file consisted of three spoken words.

Hint

Default Wordlist: At least 5K samples.

Start recording by pressing space.

Info

When the recording starts couple of words will be shown up on the screen. The default setting is 3 words and you have 3 seconds to say them. The time will be shown up on the screen. For best performance try to say the words not close to the edges, so give a small pause after start and try to finish up 200ms before time up.

Record

2. Training The Model: 1hr

Train your model by Pressing T.
Wait for Generate Enn Sample dialog to show up, this could take a while. Select No in the two following dialogs.

Hint

Generate Enn Samples is only required for training neural network. If you are a newcomer you can skip this for now. Next dialog is also only used in advanced mode. So you should be fine skipping both of them as mentioned. To learn more about these two features please checkout Advanced Training.
Congrats! Your model is now ready.

More Info

A detailed description of Gym functionalities is discussed in the Gym User Guide.