1. Select Your Word List: 0-20 mins

  • Default: No changes need to be made. 0 min
  • Custom: Add or remove words as you please. However keep in mind that to achieve decent performance from the engine, you need to train Count^2 samples, where Count represents the number of words in your word list. 20 mins

2. Record Audio Samples: 1week

The number of samples depend on the user but generally the more the merrier. Sample means an audio .wav file consisted of three spoken words.

Hint

  • Default Wordlist: At least 5K samples.

Start recording by pressing space.

Info

When the recording starts couple of words will be shown up on the screen. The default setting is 3 words and you have 3 seconds to say them. The time will be shown up on the screen. For best performance try to say the words not close to the edges, so give a small pause after start and try to finish up 200ms before time up.

Record

2. Training The Model: 1hr

  1. Train your model by Pressing T.

    Train

  2. Wait for Generate Enn Sample dialog to show up, this could take a while. Select No in the two following dialogs.

    Hint

    Generate Enn Samples is only required for training neural network. If you are a newcomer you can skip this for now. Next dialog is also only used in advanced mode. So you should be fine skipping both of them as mentioned. To learn more about these two features please checkout Advanced Training.

    GenENN

  3. Congrats! Your model is now ready.

More Info

A detailed description of Gym functionalities is discussed in the Gym User Guide.