Context Navigation

Changes between Version 13 and Version 14 of HMMVoiceCreationMary4.0

Timestamp:: 12/08/09 12:40:03 (16 years ago)
Author:: marcela_charfuelan
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

HMMVoiceCreationMary4.0

-                      v13
+                      v14
+= '''Voice Import Tools Tutorial : How to build a HMM-based voice for the MARY 4.0 (beta) platform''' =
+For creating HMM-based voices we use a version of the speaker dependent training scripts provided by [http://hts.sp.nitech.ac.jp/ HTS] that was adapted to the MARY 4.0 beta platform. The steps for building a HMM voice for the MARY platform can be summarised in:[[BR]]
+I)   Checking the necessary programs and files[[BR]]
+II)  Data preparation[[BR]]
+III) Training of HMM models[[BR]]
+IV)  Adding a new HMM voice in the Mary system.[[BR]]
+V)   Creating other voice in German (''to train a HMM voice with another speech database'').[[BR]]
+VI) (''NEW'') Creating other voice in a language different from German or English (US).
+The previous steps will be explained below creating a HMM voice using the HTS '''speaker dependent training demo''' adapted to the MARY 4.0 beta platform.[[BR]]
+= '''Voice Import Tools Tutorial : How to build a HMM-based voice for the MARY 4.0 platform''' =
+For creating HMM-based voices we use a version of the speaker dependent training scripts provided by [http://hts.sp.nitech.ac.jp/ HTS] that was adapted to the MARY 4.0 platform. The steps for building a HMM voice for the MARY platform can be summarised in:[[BR]]
+I) Download MARY TTS including Voice import tools[[BR]]
+II) Check necessary programs and files[[BR]]
+III) Check data: audio and text files[[BR]]
+IV) Run the Voice import tools [[BR]]
+V) Creating other voice in a language different from German or English (US).
 The training scripts used here are the latest versions, that is, it is required HTS_2.1 and SPTK-3.2. Some scripts have been added-modified to:[[BR]]
 …
 '''
+=== I) Checking the necessary programs and files: ===
+'''
+== I) Download MARY TTS including Voice import tools ==
+'''
+Click on the latest MARY release [http://mary.dfki.de/download/4.0%20beta/openmary-standalone-install-4.0beta.jar MARY download] or download the file and run it with:
+{{{
+java -jar openmary-standalone-install-4.0beta.jar
+}}}
+'''
+== II) Check the necessary programs and files: ==
+'''
+To facilitate the checking and installation of the necessary external programs, once installed MARY TTS open a command line shell in your voice building directory and run:
+{{{
+$MARY_BASE/lib/external/download_install_external_programs.sh
+}}}
+With the option '''-check''', this script will check if the necessary programs and versions are installed (that is, the programs can be found in the PATH or in the paths provided by the user); with the option '''-install''' this script will try to download and install the necessary programs in: $MARY_TTS/lib/external/bin (if problems, it will suggest how to install manually the programs).
+If you have already installed some of the required programs, you can provide the paths (if they are not in the PATH), for example:
+{{{
+$MARY_BASE/lib/external/download_install_external_programs.sh -check /your/path/to/htk/bin /your/path/to/Festival/festvox/src/ehmm/bin
+}}}
+The necessary programs that this script checks are:[[BR]]
 '''MARY requirements:'''[[BR]]
 - Operating System - Linux (tested on Ubuntu 9.04) [[BR]]
 - MARY TTS 4.0 (beta) including Voice import tools during installation - link: [http://mary.dfki.de/download/4.0%20beta/openmary-standalone-install-4.0beta.jar MARY TTS 4.0 beta] [[BR]]
+- HTS '''speaker dependent training demo''' adapted to the MARY 4.0 beta platform:
+     * without CMU-ARCTIC-SLT data (112K):  included in your MARY TTS 4.0 beta installation: $MARY_BASE/lib/hts/HTS-demo_for_MARY-4.0-beta.tar.gz [[BR]]
+     * with CMU-ARCTIC-SLT data (92MB) - link: [http://mary.dfki.de/download/4.0%20beta/HTS-demo_CMU-ARCTIC-SLT_for_MARY-4.0-beta.tar.gz HTS-demo_CMU-ARCTIC-SLT_for_MARY-4.0-beta] [[BR]]
+'''HTS requirements:''' please download and follow the instructions for installing:[[BR]]
+- HTS '''speaker dependent training demo''' adapted to the MARY 4.0 beta platform, included in your MARY TTS 4.0 beta installation.
+'''HTS requirements:'''[[BR]]
 - [http://hts.sp.nitech.ac.jp/archives/2.1/HTS-2.1_for_HTK-3.4.tar.bz2 HTS-2.1_for_HTK-3.4.patch] [[BR]]
 - HTK-3.4 and HDecode patched with HTS-2.1_for_HTK-3.4.patch links:
 …
 - [http://downloads.sourceforge.net/hts-engine/hts_engine_API-1.01.tar.gz hts_engine_API-1.01] [[BR]]
 '''Other requirements:''' the following programs are also required: [[BR]]
+'''Other requirements:'''[[BR]]
 - awk normally available in linux [[BR]]
 - perl normally available in linux [[BR]]
 …
 - EHMM for automatic labeling, available with [http://festvox.org/download.html festvox-2.1] [[BR]]
+The HTS demo for MARY 4.0 beta, includes a shell script "check_programs.sh" that will help you to check if all the previous programs are installed.
+'''
+=== II) Data preparation (): ===
+'''
+Where to start? There are three options a, b and c:
+'''a-''' If you would like to try the HTS-demo_CMU-ARCTIC-SLT for MARY 4.0 beta from scratch:[[BR]]
+Download the [http://mary.dfki.de/download/4.0%20beta/HTS-demo_CMU-ARCTIC-SLT_for_MARY-4.0-beta.tar.gz HTS-demo_CMU-ARCTIC-SLT_for_MARY-4.0-beta] (92MB), unpack the file and go to that directory:
+{{{
+   tar -zxvf HTS-demo_CMU-ARCTIC-SLT_for_MARY-4.0-beta.tar.gz
+   cd HTS-demo_CMU-ARCTIC-SLT_for_MARY-4.0-beta
+}}}
+'''b-''' If you have already created a unit selection voice for MARY, with the CMU-ARCTIC-SLT data, and want to build a HMM-based voice for that,
+copy the $MARY_BASE/lib/hts/HTS-demo_for_MARY-4.0-beta.tar.gz (112K) in your unit selection voice creation directory and unpack the file:
+{{{
+   tar -zxvf HTS-demo_for_MARY-4.0-beta.tar.gz
+}}}
+If you have already created a unit selection voice for this data, most probably you have already created phonefeatures, phonelab and a mary/features.txt file for that, so you can run steps 1-3, skip steps 4-11 and continue with section III HMM models training.
+'''c-''' If you want to create a HMM voice in other language please see the section V or VI below.[[BR]]
+Once you have unpacked the HTS demo for MARY 4.0 beta, follow the steps:
+'''1-''' Check if all the required programs (and versions) are available in your system, you can run the shell script:
+{{{
+./check_programs.sh
+}}}
+This is a simple shell script that will check which programs are available in the PATH and report what is missing. You can provide the paths where you have installed the  required programs if they are not found in the PATH. The script will check minimal requirements for programs, versions, suggest how to install missing programs, etc.
+If all the necessary programs are installed correctly you can continue with step 2.
+'''2-''' Run the Voice Import Tools program
+The Voice Import Tools programs can be started from: '''Applications -> OpenMary -> Voice import tools'''[[BR]]
+When starting the voice import tools, go to your working directory (the directory where you have unpacked the HTS demo for MARY 4.0 beta) and provide information for:
+{{{
+  db.gender    = female
+  db.locale    = en_US
+  db.marybase  = /path/to/$MARY_BASE/
+  db.voicename = slt-hsmm
+}}}
+If you are not familiar or have problems with the Voice Import Tools program, please read the instructions in the Tutorial: [http://mary.opendfki.de/wiki/VoiceImportToolsTutorial VoiceImportToolsTutorial]
+Please remember that whenever you are in doubt about the settings of a particular component you can check its corresponding help for a description of the meaning
+(and possible values) of each variable.
+After starting the Voice Import Tools check the global settings of the voice, make sure that the allophones file is provided and exist:
+'''
+== III) Check data: audio and text files[[BR]] ==
+'''
+In your voice building directory execute the step-by-step procedure in [http://mary.opendfki.de/wiki/VoiceImportToolsTutorial VoiceImportToolsTutorial] to make
+sure that the data, sound (wav) and text files are in the correct place and format.[[BR]]
+As a result of this step your voice building directory should contain a wav and text directories.
+'''
+== IV) Run the Voice Import tools ==
+'''
+In your voice building directory run the voice import tools:
+{{{
+export MARY_BASE="/your/path/to/MARY TTS/"
+java -Xmx1024m -jar $MARY_BASE/java/voiceimport.jar
+}}}
+After starting the Voice Import Tools check the global settings of the voice, make sure that the allophones file is provided and exists:
 {{{
 db.alophonesSet = $MARY_BASE/lib/modules/xx/lexicon/allophones.xx.xml  (where xx is the corresponding language)
 }}}
+'''3-''' Run the HMMVoiceDataPreparation of the HMM Voice Trainer group to check if text, wav or raw files are available and in the correct paths.
+If just raw is provided, the program will do the conversion.  If no text files are available but utts in festival format, the program will do the conversion as well.
+'''4-''' Run the AllophonesExtractor of the Automatic Labeling group to create the '''prompt_allophones''' directory required in the next step. This component requires the MARY server. [[BR]]
+'''5-''' Run the EHMMlabeler component of the Automatic Labeling group to label automatically the wav files using the corresponding transcriptions. This procedure might
+And run the following components:
+'''1-''' Run the HMMVoiceDataPreparation of the HMM Voice Trainer group to set up the environment to create a HMM voice and check if required external programs and text and wav files are available and in the correct paths.
+'''2-''' Run the AllophonesExtractor of the Automatic Labeling group to create the '''prompt_allophones''' directory required in the next step. This component requires the MARY server. [[BR]]
+'''3-''' Run the EHMMlabeler component of the Automatic Labeling group to label automatically the wav files using the corresponding transcriptions. This procedure might
 take several hours. For running EHMMLabeler, please use the settings editor of this component to set, according to your festvox installation, the variable:
 {{{
    EHMMLabeler.ehmm  = ../festvox/src/ehmm/bin/
 }}}
+The result of this step is a '''ehmm/lab''' directory.
+'''4-''' Run the LabelPauseDeleter component of the Automatic Labeling group. Please use the settings editor of this component to set the variable:
+{{{
+   LabelPauseDeleter.threshold  =  10
+}}}
 The result of this step is a '''lab''' directory.
+'''6-''' Run the LabelPauseDeleter component of the Automatic Labeling group. Please use the settings editor of this component to set the variable:
+{{{
+   LabelPauseDeleter.threshold  =  10
+}}}
+'''7-''' Run the TranscriptionAligner component of the Label-Transcript Alignment group.  This program will create the '''allophones''' directory.
+'''8-''' Run the PhoneUnitLabelComputer component of the Label-Transcript Alignment group. This procedure has as input the '''lab''' directory created with the EHMMLabeler and will create as an output the  '''phonelab''' directory.
+'''9-''' Run the FeatureSelelection component of the Feature Extraction group. This program will create a '''mary/features.txt''' file, it requires the MARY server running. Select here all the features and save the file.
+'''10-''' Run the PhoneUnitFeatureComputer component of the Feature Extraction group to extract context feature vectors from the text data. This procedure will create a '''phonefeatures''' directory. For running this component the MARY server should be running as well.
+'''11-''' Run the PhonelabelFeatureAligner component of the Verify Alignment group. This procedure will verify alignment between "phonefeatures" and "phonelabels".[[BR]]
+'''5-''' Run the TranscriptionAligner component of the Label-Transcript Alignment group.  This program will create the '''allophones''' directory.
+'''6-''' Run the PhoneUnitLabelComputer component of the Label-Transcript Alignment group. This procedure has as input the '''lab''' directory and will create as an output the  '''phonelab''' directory.
+'''7-''' Run the FeatureSelelection component of the Feature Extraction group. This program will create a '''mary/features.txt''' file, it requires the MARY server running. Select here all the features and save the file.
+'''8-''' Run the PhoneUnitFeatureComputer component of the Feature Extraction group to extract context feature vectors from the text data. This procedure will create a '''phonefeatures''' directory. For running this component the MARY server should be running as well.
+'''9-''' Run the PhonelabelFeatureAligner component of the Verify Alignment group. This procedure will verify alignment between "phonefeatures" and "phonelabels".[[BR]]
 As a result of steps 1-11 we should have:[[BR]]
 …
 - phonelab directory [[BR]]
 - mary/features.txt file [[BR]]
+'''
+=== III) HMM models training: ===
+'''
+'''12-''' Run the HMMVoiceConfigure component of the HMM Voice trainer group. The default setting values of this component are already fixed for the HTS-demo_CMU-ARCTIC-SLT voice, although some setting depends on your installation, please provide paths for:
+{{{
+  HMMVoiceConfigure.htsPath       = /yourpath/htk-hts2.1/bin
+  HMMVoiceConfigure.htsEnginePath = /yourpath/hts_engine_API-1.01/bin
+  HMMVoiceConfigure.sptkPath      = /yourpath/SPTK-3.2/bin
+  HMMVoiceConfigure.tclPath       = /yourpath/ActiveTcl-8.6/bin
+  HMMVoiceConfigure.soxPath       = /yourpath/usr/bin
+}}}
+- MARY_BASE/external/externalPaths.txt
+'''
+=== HMM models training: ===
+'''
+'''10-''' Run the HMMVoiceConfigure component of the HMM Voice trainer group. The default setting values ar already fixed for the arctic slt voice, some setting depends on your installation, and willbe taken from MARY_BASE/external/externalPaths.txt
 If running configure for other voice, for example a male German voice, please use the settings editor of this component to set the variables:
 …
 Using the settings editor of this component you can also change other variables like using LSP instead og MGC, sampling frequency, etc., the same as you would do when running "make configure + parameters" with the original HTS scripts.
 '''13-''' Run the HMMVoiceFeatureSelection component of the HMM Voice trainer group. This program reads the '''mary/features.txt''' file (created in step 11), and generates the file '''mary/hmmFeatures.txt'''. This file contains extra features, apart from phone and phonological features, that will be used to train HMMs. When running this program a small set of features will be presented on top, separated by an empty line:[[BR]]
+'''11-''' Run the HMMVoiceFeatureSelection component of the HMM Voice trainer group. This program reads the '''mary/features.txt''' file (created in step 11), and generates the file '''mary/hmmFeatures.txt'''. This file contains extra features, apart from phone and phonological features, that will be used to train HMMs. When running this program a small set of features will be presented on top, separated by an empty line:[[BR]]
 {{{
    pos_in_syl
 …
 If you are not sure about using other features, use the first four, delete the others and save the file.
 '''14-''' Run the HMMVoiceMakeData component of the HMM Voice trainer group to run the HTS procedure "make data". This procedure require the following files:
+'''12-''' Run the HMMVoiceMakeData component of the HMM Voice trainer group to run the HTS procedure "make data". This procedure require the following files:
 {{{
    HMMVoiceMakeData.allophonesFile   = allophones.en_US.xml  # allophones set (language dependent)
 …
 '''15-''' Run the HMMVoiceMakeVoice component of the HMM Voice trainer group, here again particular training steps can be repeated selecting them (setting in 1, all the others in 0) from the settings of this component. This is equivalent to run again:
+'''13-''' Run the HMMVoiceMakeVoice component of the HMM Voice trainer group, here again particular training steps can be repeated selecting them (setting in 1, all the others in 0) from the settings of this component. This is equivalent to run again:
 {{{
    perl scripts/Training.pl scripts/Config.pm > logfile &
 …
 '''
 === IV) Adding a new voice in the MARY platform: ===
 '''
 '''16-''' Run the HMMVoiceInstaller component of the Install Voice group. The default setting values of this component are already fixed for the HTS-demo_CMU-ARCTIC-SLT voice. Some settings of the voice can be changed here, for example:
+=== Adding a new voice in the MARY platform: ===
+'''
+'''14-''' Run the HMMVoiceInstaller component of the Install Voice group. The default setting values of this component are already fixed for the HTS-demo_CMU-ARCTIC-SLT voice. Some settings of the voice can be changed here, for example:
 {{{
   HMMVoiceInstaller.useMixExc   =  true
 …
 '''
+=== V) Creating other voice in German ===
+'''
+- If you are creating the HMM-based voice for German from scratch it will be necessary: [[BR]]
+  * NLP components for German, those should be available with MARY 4.0
+  * a wav or raw directory with the speech files you will use for training the German voice. [[BR]]
+  * transcriptions of the files, one text file per speech file, or transcriptions in festival format if available. [[BR]]
+then copy the $MARY_BASE/lib/hts/HTS-demo_for_MARY-4.0-beta.tar.gz file in the directory where you have your wav and transcription data and unpack the file:
+{{{
+   tar -zxvf HTS-demo_for_MARY-4.0-beta.tar.gz
+}}}
+Once you have unpacked the HTS demo for MARY 4.0 beta, follow the instructions as normal from step 1. Provide general settings for:
+{{{
+   db.gender    =  male  (or female)
+   db.locale    =  de
+   db.marybase  =  /path/to/mary/base/
+   db.voicename =  german_voice
+}}}
+- If you have already created a German unit selection voice for MARY and want to build a HMM-based voice for that, copy the $MARY_BASE/lib/hts/HTS-demo_for_MARY-4.0-beta.tar.gz (112K) in your unit selection voice creation directory and unpack the file:
+{{{
+   tar -zxvf HTS-demo_for_MARY-4.0-beta.tar.gz
+}}}
+If you have already created a unit selection voice for German, most probably you have already created phonefeatures, phonelab and a mary/features.txt file for that, so you can run steps 1-3, skip steps 4-11 and continue with section III HMM models training.
+'''
+=== VI) Creating other voice in a language different from German or English (US). ===
+=== V) Creating other voice in a language different from German or English (US). ===
 '''
 …
 - After creating the minimal components, you will need wav files (in a wav directory) and the corresponding transcriptions (one file per wav file in a text directory). [[BR]]
+Then copy the $MARY_BASE/lib/hts/HTS-demo_for_MARY-4.0-beta.tar.gz file in the directory where you have your wav and transcription data and unpack the file:
+{{{
+   tar -zxvf HTS-demo_for_MARY-4.0-beta.tar.gz
+}}}
+Once you have unpacked the HTS demo for MARY 4.0 beta, follow the instructions as normal from step 1. Provide general settings for:
+Afterwards follow the instructions as normal from step 1. Provide general settings for:
 {{{
    db.gender    =  male  (or female)