Context Navigation

Changes between Version 13 and Version 14 of VoiceImportToolsTutorial

Timestamp:: 09/18/07 20:32:16 (18 years ago)
Author:: sach01
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

VoiceImportToolsTutorial

-                      v13
+                      v14
 database.config contains the values of the settings - you can change the settings also in this file, but be aware that
 this may cause problems.
 Simplest way of Using Voice Import Components:
 …
  == Explanation on Individual Voice Import Components ==
+'''Feature Extraction from Acoustic Data'''
+== 1. Feature Extraction from Acoustic Data ==
 '''!PraatPitchmarker'''[[BR]]
 …
+'''Support for Transcription Conversion'''
+== 2. Support for Transcription Conversion ==
 …
+'''Feature Vector Extraction from Text Data'''
+== 3. Feature Vector Extraction from Text Data ==
 '''!PhoneUnitFeatureComputer'''[[BR]]
 …
+'''Automatic Labeling'''
+== 4. Automatic Labeling ==
 …
+== 5. Label or Pause Correction and Label-Feature Alignment ==
 '''!LabelledFilesInspector'''[[BR]]
 …
  * labDir     - Half Phone Labels directory
+'''Basic Data Files'''
+== 6. Basic Data Files ==
 Following components will create basic binary files, which contain whole voice database. So that it is easier and faster to access Database. These files are needed for various voice building steps and for synthesis.
 …
  * mcepTimeline  - file containing all mcep files. Will be created by this module
+'''Building acoustic models'''
+== 7. Building acoustic models ==
 …
+'''!PhoneFeatureFileWriter'''[[BR]]
+It produces a file containing all the target cost features for the phone sized units. The module needs a file defining which features are to be used and what weights are given to them. They must be the same features as the ones that the !PhoneFeatureComputer used. If you do not have a feature definition, the module tries to create one.
+For more information, see the example file: Marybase/lib/modules/import/examples/PhoneUnitFeatureDefinition.txt
+Configuration Settings:
+ * featureDir  - directory containing the phone features
+ * featureFile - file containing all phone units and their target cost features.Will be created by this module
+ * unitFile    - file containing all phone units
+ * weightsFile - file containing the list of phone target cost features, their values and weights
+'''DurationCARTTrainer'''[[BR]]
+It builds an acoustic model of durations in the database using the program "wagon" from the Edinburgh Speech tools.
+Configuration Settings:
+ * durTree          - file containing the duration CART. Will be created by this module
+ * estDir           - directory containing the local installation of the Edinburgh Speech Tools
+ * featureDir       - directory containing the phonefeatures
+ * featureFile      - file containing all phone units and their target cost features
+ * labelDir         - directory containing the phone labels
+ * stepwiseTraining - "false" or "true"
+ * unitFile         - file containing all phone units
+ * waveTimeline     - file containing all wave files
+'''F0CARTTrainer'''[[BR]]
+It builds acoustic models of F0 like DurationCARTTrainer. It uses "wagon" and the files produced by !PhoneUnitfileWriter and !PhoneFeatureFileWriter.
+Configuration Settings:
+ * estDir           - directory containing the local installation of the Edinburgh Speech Tools
+ * f0LeftTreeFile   - file containing the left f0 CART. Will be created by this module
+ * f0MidTreeFile    - file containing the middle f0 CART. Will be created by this module
+ * f0RightTreeFile  - file containing the right f0 CART. Will be created by this module
+ * featureDir       - directory containing the phonefeatures
+ * featureFile      - file containing all phone units and their target cost features
+ * labelDir         - directory containing the phone label files
+ * stepwiseTraining - "false" or "true"
+ * unitFile         - file containing all phone units
+ * waveTimeline     - file containing all wave files