Context Navigation

Changes between Version 7 and Version 8 of HMMVoiceCreationMary4.0

Timestamp:: 09/23/09 16:53:24 (16 years ago)
Author:: marcela_charfuelan
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

HMMVoiceCreationMary4.0

-                      v7
+                      v8
 III) Training of HMM models[[BR]]
 IV)  Adding a new HMM voice in the Mary system.[[BR]]
 V)   Creating other voice in German or English (''to train a HMM voice with another speech database'').[[BR]]
+V)   Creating other voice in German (''to train a HMM voice with another speech database'').[[BR]]
 VI) (''NEW'') Creating other voice in a language different from German or English (US).
 …
 - EHMM for automatic labeling, available with festvox-2.1 link: http://festvox.org/download.html [[BR]]
+The HTS demo for MARY 4.0 beta, includes a shell script "check_programs.sh" that will help you to check if all the previous
+programs are installed, otherwise it will suggest how they can be installed.
+The HTS demo for MARY 4.0 beta, includes a shell script "check_programs.sh" that will help you to check if all the previous programs are installed.
 …
 Where to start? There are three options:
+'''a-''' If you would like to try the HTS-SLT-demo (slt voice) for MARY 4.0 from scratch:[[BR]]
+Download the HTS-demo_CMU-ARCTIC-SLT_for_MARY-4.0-beta.tar.gz (92MB) and un-zip, un-tar it in a directory where you will
+train the SLT voice.[[BR]]
+'''b-''' If you have already created a unit selection voice for MARY, with the SLT data, and want to build a HMM-based voice for that:[[BR]]
+Copy the openmary/lib/hts/HTS-demo_for_MARY-4.0-beta.tar.gz (112K) in your unit selection voice creation directory and un-zip, un-tar the file.[[BR]]
+If you have already created a unit selection voice for this data, most probably you have already created phonefeatures, phonelab and a mary/features.txt file for that, so you can run steps 1-3 and skip steps 4-11.
+'''a-''' If you would like to try the HTS-demo_CMU-ARCTIC-SLT for MARY 4.0 beta from scratch:[[BR]]
+Download the HTS-demo_CMU-ARCTIC-SLT_for_MARY-4.0-beta.tar.gz (92MB), unpack the file and go to that directory:
+{{{
+   tar -zxvf HTS-demo_CMU-ARCTIC-SLT_for_MARY-4.0-beta.tar.gz
+   cd HTS-demo_CMU-ARCTIC-SLT_for_MARY-4.0-beta
+}}}
+'''b-''' If you have already created a unit selection voice for MARY, with the CMU-ARCTIC-SLT data, and want to build a HMM-based voice for that,
+copy the openmary/lib/hts/HTS-demo_for_MARY-4.0-beta.tar.gz (112K) in your unit selection voice creation directory and unpack the file:
+{{{
+   tar -zxvf HTS-demo_for_MARY-4.0-beta.tar.gz
+}}}
+If you have already created a unit selection voice for this data, most probably you have already created phonefeatures, phonelab and a mary/features.txt file for that, so you can run steps 1-3, skip steps 4-11 and continue with section III HMM models training.
 '''c-''' If you want to create a HMM voice in other language please see the section V or VI below.[[BR]]
 …
 ./check_programs.sh
 }}}
 This is a simple shell script that will check first, if the programs are available in the PATH and report what is missing. You can provide the paths where you install the programs if they are not found in the PATH. The script will suggest how to install missing programs.
+This is a simple shell script that will check which programs are available in the PATH and report what is missing. You can provide the paths where you have installed the  required programs if they are not found in the PATH. The script will check minimal requirements for programs, versions, suggest how to install missing programs, etc.
+If all the necessary programs are installed correctly you can continue with step 2.
 '''2-''' Run the VoiceImport program
+First of all you need to set your MARY_BASE directory and then run the program:  [[BR]]
+If you have installed MARY 4.0 beta and the voicebuilding component, you can start the VoiceImport program from '''Applications -> OpenMary -> Voice import tools'''[[BR]]
+Otherwise you can run it on a terminal in your working directory (the directory where you have unpacked the HTS demo for MARY 4.0 beta), first you need to set your MARY_BASE directory and then run the voiceimport.jar program:
 {{{
    export MARY_BASE="/dir/to/openmary"
 …
 }}}
 When starting the voiceimport tools please provide information for:
+When starting the voiceimport tools, go to your working directory (the directory where you have unpacked the HTS demo for MARY 4.0 beta) and provide information for:
 {{{
   db.gender    = female
   db.locale    = en_US
   db.marybase  = /path-to-MARY_BASE/ or /path-to-openmary/
   db.voicename = hsmm-slt
+  db.voicename = slt-hsmm
 }}}
 If you are not familiar or have problems with the VoiceImport program, please read the instructions in the Voice Import Tools
 …
 }}}
 '''3-''' Run the HMMVoiceDataPreparation of the HMM Voice Trainer group to check if text, wav and data/raw files are available and in the correct paths.
 If just data/raw is provided, the program will do the conversion.  If no text files are available but data/utts in festival format, the program will do the conversion as well.
+'''3-''' Run the HMMVoiceDataPreparation of the HMM Voice Trainer group to check if text, wav or raw files are available and in the correct paths.
+If just raw is provided, the program will do the conversion.  If no text files are available but utts in festival format, the program will do the conversion as well.
 '''4-''' Run the AllophonesExtractor of the Automatic Labeling group to create the '''prompt_allophones''' directory required in the next step. This component requires the MARY server. [[BR]]
 …
 '''
 === V) Creating other voice in German or English. ===
+=== V) Creating other voice in German ===
 '''
 …
   * a wav or raw directory with the speech files you will use for training the German voice. [[BR]]
   * transcriptions of the files, one text file per speech file, or transcriptions in festival format if available. [[BR]]
+then copy the openmary/lib/hts/HTS-demo_for_MARY-4.0-beta.tar.gz file in the directory where you have your wav and transcription data and un-zip, un-tar the file.[[BR]]
+then copy the openmary/lib/hts/HTS-demo_for_MARY-4.0-beta.tar.gz file in the directory where you have your wav and transcription data and unpack the file:
+{{{
+   tar -zxvf HTS-demo_for_MARY-4.0-beta.tar.gz
+}}}
 Once you have unpacked the HTS demo for MARY 4.0 beta, follow the instructions as normal from step 1. Provide general settings for:
 …
+- If you have already created a German unit selection voice for MARY and want to build a HMM-based voice for that:[[BR]]
+copy the openmary/lib/hts/HTS-demo_for_MARY-4.0-beta.tar.gz (112K) in your unit selection voice creation directory and un-zip, un-tar the file.[[BR]]
+If you have already created a unit selection voice for German, most probably you have already created phonefeatures, phonelab and a mary/features.txt file for that, so you can run steps 1-3 and skip steps 4-11.
+- If you have already created a German unit selection voice for MARY and want to build a HMM-based voice for that, copy the openmary/lib/hts/HTS-demo_for_MARY-4.0-beta.tar.gz (112K) in your unit selection voice creation directory and unpack the file:
+{{{
+   tar -zxvf HTS-demo_for_MARY-4.0-beta.tar.gz
+}}}
+If you have already created a unit selection voice for German, most probably you have already created phonefeatures, phonelab and a mary/features.txt file for that, so you can run steps 1-3, skip steps 4-11 and continue with section III HMM models training.
 …
 - '''Phoneme set''':  contained in MARY_BASE/lib/modules/xx/lexicon/allophones.xx.xml , where xx corresponds to the new language.
+- After creating the minimal components, follow the steps in section V.
+- After creating the minimal components, you will need wav files (in a wav directory) and the corresponding transcriptions (one file per wav file in a text directory). [[BR]]
+Then copy the openmary/lib/hts/HTS-demo_for_MARY-4.0-beta.tar.gz file in the directory where you have your wav and transcription data and unpack the file:
+{{{
+   tar -zxvf HTS-demo_for_MARY-4.0-beta.tar.gz
+}}}
+Once you have unpacked the HTS demo for MARY 4.0 beta, follow the instructions as normal from step 1. Provide general settings for:
+{{{
+   db.gender    =  male  (or female)
+   db.locale    =  new_language locale (according to your minimal NLP components, ex. tr for Turkish, te for Telugu, etc.)
+   db.marybase  =  /path/to/mary/base/
+   db.voicename =  new_language_voice_name
+}}}
 [[BR]]