Changes between Version 2 and Version 3 of HMMVoiceCreation-MARY-5.0


Ignore:
Timestamp:
11/07/11 16:29:04 (14 years ago)
Author:
marcela_charfuelan
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • HMMVoiceCreation-MARY-5.0

    v2 v3  
    44For creating HMM-based voices we use a version of the speaker dependent training scripts provided by [http://hts.sp.nitech.ac.jp/ HTS] that was adapted to the MARY 4.1.0 platform. The steps for building a HMM voice for the MARY platform can be summarised in:[[BR]] 
    55 
    6 I) Download MARY TTS including Voice import tools[[BR]] 
    7 II) Check necessary programs and files[[BR]] 
    8 III) Check data: audio and text files[[BR]] 
    9 IV) Run the Voice import tools [[BR]] 
    10 V) Creating other voice in a language different from German or English (US). 
    11 VI) Adaptive scripts 
     6I) [#point1 Download MARY TTS including Voice import tools] [[BR]] 
     7II) [#point2 Check necessary programs and files] [[BR]] 
     8III) [#point3 Check data: audio and text files] [[BR]] 
     9IV) [#point4 Run the Voice import tools] [[BR]] 
     10V) [#point5 Creating other voice in a language different from German or English (US).] [[BR]] 
     11VI) [#point6 Adaptive scripts] [[BR]] 
     12 
    1213 
    1314The training scripts used here are the latest versions, that is, it is required HTS_2.2 and HTK-3.4.1. Some scripts have been added-modified to:[[BR]] 
     
    2122 
    2223''' 
    23 == I) Download MARY TTS including Voice import tools == 
     24== I) [=#point1] Download MARY TTS including Voice import tools == 
    2425''' 
    2526 
     
    3738 
    3839''' 
    39 == II) Check the necessary programs and files: == 
     40== II) [=#point2]Check the necessary programs and files: == 
    4041''' 
    4142 
     
    7879 
    7980''' 
    80 == III) Check data: audio and text files[[BR]] == 
     81== III) [=#point3] Check data: audio and text files[[BR]] == 
    8182''' 
    8283 
     
    8889 
    8990''' 
    90 == IV) Run the Voice Import tools == 
     91== IV) [=#point4] Run the Voice Import tools == 
    9192''' 
    9293 
     
    234235cp /voicebuildingdir/mary/voice-yourvoice-hsmm/target/voice-yourvoice-hsmm-5.0-SNAPSHOT.jar $MARY_BASE/target/marytts-5.0-SNAPSHOT/lib/ 
    235236}}} 
    236 ''' 
    237 === V) Creating other voice in a language different from German or English (US). === 
    238 ''' 
     237 
     238== '''V) [=#point5] Creating other voice in a language different from German or English (US)''' == 
     239  
    239240 
    240241If you are creating a voice in other language you will need to specify:  (NOTE: THIS NEED TO BE UPDATED) 
     
    254255 
    255256 
    256 === VI) Adaptive scripts === 
    257 ''' 
    258  
    259 '''1.''' For running the HTS Speaker adaptation/adaptive training demo we need the following directories in your voicebuilding directory: 
    260 text: 
    261  bdl clb slt jmk rms 
    262 wav: 
    263   bdl clb slt jmk rms 
    264  
    265 '''2.''' With the voicebuilding tools we need to create phonelabels and phonefeatures directories for each set of data. This can be done working each set with voicebuilding tools, that is, use the general settings to define where your wave, text, etc. directories are. Then for each data set run the steps 1-8 of the speaker dependent tutorial. As a result we should have the following directories: 
    266 phonelabels 
    267  bdl clb slt jmk rms 
    268 phonefeatures 
    269  bdl clb slt jmk rms 
    270  
    271 '''3.''' Create raw data from you wav files, this can be done using the script $MARY_BASE/lib/external/hts/data/scripts/wav2raw. As a results we should have the following directories: 
    272 hts/data/raw: 
    273  bdl clb slt jmk rms 
    274  
    275 '''4.''' Having the previous directories, the run the voiceimportools and excute the steps: 
    276 - HMMVoiceDataPreparation, setting the adaptScripts variable in true 
    277 - HMMVoiceConfigure, setting the adaptScripts variable in true 
     257 
     258 
     259== '''VI) [=#point6] Adaptive scripts''' == 
     260 
     261 
     262 
     263 
     264'''1.''' For running the HTS Speaker adaptation/adaptive training demo we need the following directories in your voicebuilding directory:[[BR]] 
     265text:[[BR]] 
     266 bdl clb slt jmk rms[[BR]] 
     267wav:[[BR]] 
     268  bdl clb slt jmk rms[[BR]] 
     269 
     270 
     271'''2.''' With the voicebuilding tools we need to create phonelabels and phonefeatures directories for each set of data. This can be done working each set with voicebuilding tools, that is, use the general settings to define where your wave, text, etc. directories are. Then for each data set run the steps 1-8 of the speaker dependent tutorial. As a result we should have the following directories:[[BR]] 
     272phonelabels:[[BR]] 
     273 bdl clb slt jmk rms[[BR]] 
     274phonefeatures:[[BR]] 
     275 bdl clb slt jmk rms[[BR]] 
     276 
     277'''3.''' Create raw data from you wav files, this can be done using the script $MARY_BASE/lib/external/hts/data/scripts/wav2raw. As a results we should have the following directories:[[BR]] 
     278hts/data/raw:[[BR]] 
     279 bdl clb slt jmk rms[[BR]] 
     280 
     281'''4.''' Having the previous directories, the run the voiceimportools and excute the steps:[[BR]] 
     282'''4.1''' HMMVoiceDataPreparation, setting the adaptScripts variable in true[[BR]] 
     283'''4.2''' HMMVoiceConfigure, setting the adaptScripts variable in true[[BR]] 
    278284 
    279285If adapting other sets, be aware of the file names format for the adaptive scripts. Since it is used a mask for the names it is better if the names of your files have a particular format.  
     
    286292}}} 
    287293 
    288 Having this distribution of files, our settings for configureAdapt looked like: 
     294Having this distribution of files, our settings for configureAdapt looked like:[[BR]] 
    289295{{{ 
    290296HMMVoiceConfigure.dataSet        = pavoque 
     
    296302}}} 
    297303 
    298 - HMMVoiceFeatureSelection 
    299 - HMMVoiceMakeData,  setting the adaptScripts variable in true 
    300 - HMMVoiceMakeVoice 
    301 - HMMVoiceCompiler 
     304'''4.3''' HMMVoiceFeatureSelection[[BR]] 
     305'''4.4''' HMMVoiceMakeData,  setting the adaptScripts variable in true[[BR]] 
     306'''4.5''' HMMVoiceMakeVoice[[BR]] 
     307'''4.6''' HMMVoiceCompiler[[BR]] 
    302308 
    303309