Context Navigation

Changes between Version 1 and Version 2 of HMMVoiceCreation-MARY-5.0

Timestamp:: 11/07/11 15:57:58 (14 years ago)
Author:: marcela_charfuelan
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

HMMVoiceCreation-MARY-5.0

-                      v1
+                      v2
 IV) Run the Voice import tools [[BR]]
 V) Creating other voice in a language different from German or English (US).
+VI) Adaptive scripts
 The training scripts used here are the latest versions, that is, it is required HTS_2.2 and HTK-3.4.1. Some scripts have been added-modified to:[[BR]]
 - Use MARY instead of festival as text analyzer.[[BR]]
+- Use MARY instead of festival as text analyser.[[BR]]
 - Train bandpass voicing strengths for mixed excitation.[[BR]]
 …
 '''
+In your voice building directory run the voice import tools:
+{{{
+In your voice building directory run the voice import tools (trunk version):
+{{{
+export MARY_BASE="/your/directory/openmary/"
+java -cp $MARY_BASE/marytts-lang-en/target/marytts-lang-en-5.0-SNAPSHOT.jar:$MARY_BASE/marytts-builder/target/marytts-builder-5.0-SNAPSHOT-jar-with-dependencies.jar marytts.tools.voiceimport.DatabaseImportMain
+}}}
+{{{#!comment
 export MARY_BASE="/your/path/to/MARY TTS/"
 java -Xmx1024m -jar $MARY_BASE/java/voiceimport.jar
 …
 The result of this step is a '''lab''' directory.
 '''4-''' Run the TranscriptionAligner component of the Label-Transcript Alignment group.  This program will create the '''allophones''' directory.
 '''5-''' Run the PhoneUnitLabelComputer component of the Label-Transcript Alignment group. This procedure has as input the '''lab''' directory and will create as an output the  '''phonelab''' directory.
+'''4-''' Run the PhoneUnitLabelComputer component of the Label-Transcript Alignment group. This procedure has as input the '''lab''' directory and will create as an output the  '''phonelab''' directory.
+'''5-''' Run the TranscriptionAligner component of the Label-Transcript Alignment group.  This program will create the '''allophones''' directory.
 '''6-''' Run the FeatureSelelection component of the Feature Extraction group. This program will create a '''mary/features.txt''' file, it requires the MARY server running. Select here all the features and save the file.
 …
 Using the settings editor of this component you can also change other variables like using LSP instead og MGC, sampling frequency, etc., the same as you would do when running "make configure + parameters" with the original HTS scripts.
+'''11-''' Run the HMMVoiceFeatureSelection component of the HMM Voice trainer group. This program reads the '''mary/features.txt''' file (created in step 6), and generates the file '''mary/hmmFeatures.txt'''. The hmmFeatures.txt file contains extra features, apart from phone and phonological features, that will be used to train HMMs. Select on the window extra features for training or simply copy on the window the following:[[BR]]
+{{{
+accented
+next_tobi_endtone
+onsetcoda
+prev_accent
+next_is_pause
+tobi_accent
+syl_break
+pos_in_syl
+stressed
+prev_syl_break
+segs_from_word_start
+selection_prosody
+prev_is_pause
+next_tobi_accent
+syls_from_phrase_start
+words_from_phrase_start
+tobi_endtone
+}}}
+Delete other features and save.
+'''11-''' Run the HMMVoiceFeatureSelection component of the HMM Voice trainer group. This program reads the '''mary/features.txt''' file (created in step 6), and generates the file '''mary/hmmFeatures.txt'''. The hmmFeatures.txt file contains extra features, apart from phone and phonological features, that will be used to train HMMs. You can select or delete on the window extra context features (all can be used).
 '''12-''' Run the HMMVoiceMakeData component of the HMM Voice trainer group to run the HTS procedure "make data". This procedure require the following files:
 …
 The procedures can be repeated manually as well, going to the hts/data directory and running "make str-mary" and "make cmp-mary".
-NOTE: the Makefile in data/ includes a gv: section to calculate global variance files. In MARY, these files are generated little endian and contain a header of size one short to indicate the size of the vectors it contains.
 …
 '''
+'''14-''' Run the HMMVoicePackager component of the Install Voice group. The default setting values of this component are already fixed for the HTS-demo_CMU-ARCTIC-SLT voice. Some settings of the voice can be changed here, for example:
+{{{
+'''14-''' Run the HMMVoiceCompiler component of the Install Voice group. The default setting values of this component are already fixed.
+{{{#!comment
+Some settings of the voice can be changed here, for example:
   HMMVoicePackager.useMixExc   =  true
                                    set this variable to true if using mixed excitation
 …
 }}}
 The HMMVoicePackager will pack in a zip file located in MARY_BASE/download the following files:  [[BR]]
 - A mary config file: german-hsmm-voice.config [[BR]]
+The HMMVoiceCompiler will pack in a zip file located in /voicebuildingdir/mary/voice-yourvoice-hsmm/target/voice-yourvoice-hsmm-5.0-SNAPSHOT.zip the following files:  [[BR]]
+- A mary config file: voice.config [[BR]]
 - HMM files corresponding to this voice:
   - one example of phonefeatures for testing the synthesiser: data/phonefeatures/cmu_us_arctic_slt_xxxx.pfeats  [[BR]]
+  - one example of phonefeatures for testing the synthesiser: data/phonefeatures/features_example.pfeats  [[BR]]
   - the HTS trees: voices/qst001/ver1/*.inf  [[BR]]
   - the HTS PDF models: voices/qst001/ver1/*.pdf [[BR]]
   - global variance models (if useGV is set to true): data/gv/gv-*-littend.pdf [[BR]]
+  - global variance models (if useGV is set to true): voices/qst001/ver1/gv-*.pdf [[BR]]
   - filter taps for mixed excitation: data/filters/mix_excitation_filters.txt [[BR]]
   - trickyPhones.txt file, if one was created during training [[BR]]
 …
 After successfully packing a new voice, you must run the MARY Component Installer to install the voice!
+NOTE: workaround until the component installer is updated:
+{{{
+cp /voicebuildingdir/mary/voice-yourvoice-hsmm/target/voice-yourvoice-hsmm-5.0-SNAPSHOT.jar $MARY_BASE/target/marytts-5.0-SNAPSHOT/lib/
+}}}
 '''
 === V) Creating other voice in a language different from German or English (US). ===
 '''
 If you are creating a voice in other language you will need to specify:
+If you are creating a voice in other language you will need to specify:  (NOTE: THIS NEED TO BE UPDATED)
 - '''Minimal NLP components''': if you are creating a new voice from scratch, for example following the steps in [http://mary.opendfki.de/wiki/NewLanguageSupport NewLanguageSupport], you will need to create Minimal NLP components for the new language. These minimal components are necessary to run the MARY server in the new language and extract context features ('''phonefeatures''' directory).
 …
+=== VI) Adaptive scripts ===
+'''
+'''1.''' For running the HTS Speaker adaptation/adaptive training demo we need the following directories in your voicebuilding directory:
+text:
+ bdl clb slt jmk rms
+wav:
+  bdl clb slt jmk rms
+'''2.''' With the voicebuilding tools we need to create phonelabels and phonefeatures directories for each set of data. This can be done working each set with voicebuilding tools, that is, use the general settings to define where your wave, text, etc. directories are. Then for each data set run the steps 1-8 of the speaker dependent tutorial. As a result we should have the following directories:
+phonelabels
+ bdl clb slt jmk rms
+phonefeatures
+ bdl clb slt jmk rms
+'''3.''' Create raw data from you wav files, this can be done using the script $MARY_BASE/lib/external/hts/data/scripts/wav2raw. As a results we should have the following directories:
+hts/data/raw:
+ bdl clb slt jmk rms
+'''4.''' Having the previous directories, the run the voiceimportools and excute the steps:
+- HMMVoiceDataPreparation, setting the adaptScripts variable in true
+- HMMVoiceConfigure, setting the adaptScripts variable in true
+If adapting other sets, be aware of the file names format for the adaptive scripts. Since it is used a mask for the names it is better if the names of your files have a particular format.
+For example we have experimented adapting a neutral voice to different styles with the male German PAVOQUE database. For this database the file names have the format:
+{{{
+neutr --> pavoque_neutr_*.*    training data, big corpus, male voice with neutral style.
+obadi --> pavoque_obadi_*.*    data for adaptation, small corpus, the same male voice but with depressed style.
+poppy --> pavoque_poppy_*.*    data for adaptation, small corpus, the same male voice but with happy style.
+spike --> pavoque_spike_*.*    data for adaptation, small corpus, the same male voice but with angry style.
+}}}
+Having this distribution of files, our settings for configureAdapt looked like:
+{{{
+HMMVoiceConfigure.dataSet        = pavoque
+HMMVoiceConfigure.adaptTrainSpkr = neutr
+HMMVoiceConfigure.adaptSpkr      = 'obadi poppy spike'
+HMMVoiceConfigure.adaptSpkrMask  = */pavoque_%%%%%_*
+                                    (here the voice names are exactly 5 letters long, it can not be a voice name with more that 5 letters!)
+HMMVoiceConfigure.adaptF0Ranges  = 'neutr 40 280 obadi 40 280 poppy 40 280 spike 40 280'
+}}}
+- HMMVoiceFeatureSelection
+- HMMVoiceMakeData,  setting the adaptScripts variable in true
+- HMMVoiceMakeVoice
+- HMMVoiceCompiler
 [[BR]]