Changes between Version 7 and Version 8 of HMMVoiceCreationMary4.0
- Timestamp:
- 09/23/09 16:53:24 (16 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
HMMVoiceCreationMary4.0
v7 v8 8 8 III) Training of HMM models[[BR]] 9 9 IV) Adding a new HMM voice in the Mary system.[[BR]] 10 V) Creating other voice in German or English(''to train a HMM voice with another speech database'').[[BR]]10 V) Creating other voice in German (''to train a HMM voice with another speech database'').[[BR]] 11 11 VI) (''NEW'') Creating other voice in a language different from German or English (US). 12 12 … … 50 50 - EHMM for automatic labeling, available with festvox-2.1 link: http://festvox.org/download.html [[BR]] 51 51 52 The HTS demo for MARY 4.0 beta, includes a shell script "check_programs.sh" that will help you to check if all the previous 53 programs are installed, otherwise it will suggest how they can be installed. 52 The HTS demo for MARY 4.0 beta, includes a shell script "check_programs.sh" that will help you to check if all the previous programs are installed. 54 53 55 54 … … 61 60 Where to start? There are three options: 62 61 63 '''a-''' If you would like to try the HTS-SLT-demo (slt voice) for MARY 4.0 from scratch:[[BR]] 64 Download the HTS-demo_CMU-ARCTIC-SLT_for_MARY-4.0-beta.tar.gz (92MB) and un-zip, un-tar it in a directory where you will 65 train the SLT voice.[[BR]] 66 67 '''b-''' If you have already created a unit selection voice for MARY, with the SLT data, and want to build a HMM-based voice for that:[[BR]] 68 Copy the openmary/lib/hts/HTS-demo_for_MARY-4.0-beta.tar.gz (112K) in your unit selection voice creation directory and un-zip, un-tar the file.[[BR]] 69 If you have already created a unit selection voice for this data, most probably you have already created phonefeatures, phonelab and a mary/features.txt file for that, so you can run steps 1-3 and skip steps 4-11. 62 '''a-''' If you would like to try the HTS-demo_CMU-ARCTIC-SLT for MARY 4.0 beta from scratch:[[BR]] 63 Download the HTS-demo_CMU-ARCTIC-SLT_for_MARY-4.0-beta.tar.gz (92MB), unpack the file and go to that directory: 64 {{{ 65 tar -zxvf HTS-demo_CMU-ARCTIC-SLT_for_MARY-4.0-beta.tar.gz 66 cd HTS-demo_CMU-ARCTIC-SLT_for_MARY-4.0-beta 67 }}} 68 69 '''b-''' If you have already created a unit selection voice for MARY, with the CMU-ARCTIC-SLT data, and want to build a HMM-based voice for that, 70 copy the openmary/lib/hts/HTS-demo_for_MARY-4.0-beta.tar.gz (112K) in your unit selection voice creation directory and unpack the file: 71 {{{ 72 tar -zxvf HTS-demo_for_MARY-4.0-beta.tar.gz 73 }}} 74 If you have already created a unit selection voice for this data, most probably you have already created phonefeatures, phonelab and a mary/features.txt file for that, so you can run steps 1-3, skip steps 4-11 and continue with section III HMM models training. 70 75 71 76 '''c-''' If you want to create a HMM voice in other language please see the section V or VI below.[[BR]] … … 77 82 ./check_programs.sh 78 83 }}} 79 This is a simple shell script that will check first, if the programs are available in the PATH and report what is missing. You can provide the paths where you install the programs if they are not found in the PATH. The script will suggest how to install missing programs.80 84 This is a simple shell script that will check which programs are available in the PATH and report what is missing. You can provide the paths where you have installed the required programs if they are not found in the PATH. The script will check minimal requirements for programs, versions, suggest how to install missing programs, etc. 85 If all the necessary programs are installed correctly you can continue with step 2. 81 86 82 87 '''2-''' Run the VoiceImport program 83 88 84 First of all you need to set your MARY_BASE directory and then run the program: [[BR]] 89 If you have installed MARY 4.0 beta and the voicebuilding component, you can start the VoiceImport program from '''Applications -> OpenMary -> Voice import tools'''[[BR]] 90 Otherwise you can run it on a terminal in your working directory (the directory where you have unpacked the HTS demo for MARY 4.0 beta), first you need to set your MARY_BASE directory and then run the voiceimport.jar program: 85 91 {{{ 86 92 export MARY_BASE="/dir/to/openmary" … … 88 94 }}} 89 95 90 When starting the voiceimport tools pleaseprovide information for:96 When starting the voiceimport tools, go to your working directory (the directory where you have unpacked the HTS demo for MARY 4.0 beta) and provide information for: 91 97 {{{ 92 98 db.gender = female 93 99 db.locale = en_US 94 100 db.marybase = /path-to-MARY_BASE/ or /path-to-openmary/ 95 db.voicename = hsmm-slt101 db.voicename = slt-hsmm 96 102 }}} 97 103 If you are not familiar or have problems with the VoiceImport program, please read the instructions in the Voice Import Tools … … 106 112 }}} 107 113 108 '''3-''' Run the HMMVoiceDataPreparation of the HMM Voice Trainer group to check if text, wav and data/raw files are available and in the correct paths.109 If just data/raw is provided, the program will do the conversion. If no text files are available but data/utts in festival format, the program will do the conversion as well.114 '''3-''' Run the HMMVoiceDataPreparation of the HMM Voice Trainer group to check if text, wav or raw files are available and in the correct paths. 115 If just raw is provided, the program will do the conversion. If no text files are available but utts in festival format, the program will do the conversion as well. 110 116 111 117 '''4-''' Run the AllophonesExtractor of the Automatic Labeling group to create the '''prompt_allophones''' directory required in the next step. This component requires the MARY server. [[BR]] … … 243 249 244 250 ''' 245 === V) Creating other voice in German or English.===251 === V) Creating other voice in German === 246 252 ''' 247 253 … … 250 256 * a wav or raw directory with the speech files you will use for training the German voice. [[BR]] 251 257 * transcriptions of the files, one text file per speech file, or transcriptions in festival format if available. [[BR]] 252 then copy the openmary/lib/hts/HTS-demo_for_MARY-4.0-beta.tar.gz file in the directory where you have your wav and transcription data and un-zip, un-tar the file.[[BR]] 258 then copy the openmary/lib/hts/HTS-demo_for_MARY-4.0-beta.tar.gz file in the directory where you have your wav and transcription data and unpack the file: 259 {{{ 260 tar -zxvf HTS-demo_for_MARY-4.0-beta.tar.gz 261 }}} 253 262 254 263 Once you have unpacked the HTS demo for MARY 4.0 beta, follow the instructions as normal from step 1. Provide general settings for: … … 261 270 262 271 263 - If you have already created a German unit selection voice for MARY and want to build a HMM-based voice for that:[[BR]] 264 copy the openmary/lib/hts/HTS-demo_for_MARY-4.0-beta.tar.gz (112K) in your unit selection voice creation directory and un-zip, un-tar the file.[[BR]] 265 266 If you have already created a unit selection voice for German, most probably you have already created phonefeatures, phonelab and a mary/features.txt file for that, so you can run steps 1-3 and skip steps 4-11. 272 - If you have already created a German unit selection voice for MARY and want to build a HMM-based voice for that, copy the openmary/lib/hts/HTS-demo_for_MARY-4.0-beta.tar.gz (112K) in your unit selection voice creation directory and unpack the file: 273 {{{ 274 tar -zxvf HTS-demo_for_MARY-4.0-beta.tar.gz 275 }}} 276 277 If you have already created a unit selection voice for German, most probably you have already created phonefeatures, phonelab and a mary/features.txt file for that, so you can run steps 1-3, skip steps 4-11 and continue with section III HMM models training. 267 278 268 279 … … 277 288 - '''Phoneme set''': contained in MARY_BASE/lib/modules/xx/lexicon/allophones.xx.xml , where xx corresponds to the new language. 278 289 279 - After creating the minimal components, follow the steps in section V. 290 - After creating the minimal components, you will need wav files (in a wav directory) and the corresponding transcriptions (one file per wav file in a text directory). [[BR]] 291 Then copy the openmary/lib/hts/HTS-demo_for_MARY-4.0-beta.tar.gz file in the directory where you have your wav and transcription data and unpack the file: 292 {{{ 293 tar -zxvf HTS-demo_for_MARY-4.0-beta.tar.gz 294 }}} 295 296 Once you have unpacked the HTS demo for MARY 4.0 beta, follow the instructions as normal from step 1. Provide general settings for: 297 {{{ 298 db.gender = male (or female) 299 db.locale = new_language locale (according to your minimal NLP components, ex. tr for Turkish, te for Telugu, etc.) 300 db.marybase = /path/to/mary/base/ 301 db.voicename = new_language_voice_name 302 }}} 303 304 305 280 306 281 307 [[BR]]