Changes between Version 13 and Version 14 of VoiceImportToolsTutorial


Ignore:
Timestamp:
09/18/07 20:32:16 (17 years ago)
Author:
sach01
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • VoiceImportToolsTutorial

    v13 v14  
    157157database.config contains the values of the settings - you can change the settings also in this file, but be aware that  
    158158this may cause problems.  
     159 
    159160  
    160161Simplest way of Using Voice Import Components:  
     
    175176 
    176177 
    177  
    178178 == Explanation on Individual Voice Import Components == 
    179179 
    180 '''Feature Extraction from Acoustic Data''' 
     180 
     181 
     182== 1. Feature Extraction from Acoustic Data == 
     183 
    181184 
    182185'''!PraatPitchmarker'''[[BR]] 
     
    205208   
    206209 
    207 '''Support for Transcription Conversion''' 
     210== 2. Support for Transcription Conversion == 
     211 
    208212 
    209213 
     
    227231 
    228232 
    229 '''Feature Vector Extraction from Text Data''' 
     233 
     234== 3. Feature Vector Extraction from Text Data == 
     235 
    230236 
    231237'''!PhoneUnitFeatureComputer'''[[BR]] 
     
    255261 
    256262 
    257 '''Automatic Labeling''' 
     263 
     264== 4. Automatic Labeling == 
     265 
    258266 
    259267 
     
    327335 
    328336 
     337 
     338== 5. Label or Pause Correction and Label-Feature Alignment == 
     339  
     340 
     341 
    329342'''!LabelledFilesInspector'''[[BR]] 
    330343 
     
    370383 * labDir     - Half Phone Labels directory 
    371384 
    372 '''Basic Data Files''' 
     385 
     386== 6. Basic Data Files == 
     387 
    373388 
    374389Following components will create basic binary files, which contain whole voice database. So that it is easier and faster to access Database. These files are needed for various voice building steps and for synthesis.  
     
    401416 * mcepTimeline  - file containing all mcep files. Will be created by this module 
    402417 
    403 '''Building acoustic models''' 
     418 
     419 
     420== 7. Building acoustic models == 
     421 
    404422 
    405423 
     
    415433 
    416434 
     435'''!PhoneFeatureFileWriter'''[[BR]] 
     436 
     437It produces a file containing all the target cost features for the phone sized units. The module needs a file defining which features are to be used and what weights are given to them. They must be the same features as the ones that the !PhoneFeatureComputer used. If you do not have a feature definition, the module tries to create one. 
     438 
     439 
     440For more information, see the example file: Marybase/lib/modules/import/examples/PhoneUnitFeatureDefinition.txt 
     441 
     442Configuration Settings: 
     443 
     444 * featureDir  - directory containing the phone features 
     445 * featureFile - file containing all phone units and their target cost features.Will be created by this module 
     446 * unitFile    - file containing all phone units 
     447 * weightsFile - file containing the list of phone target cost features, their values and weights 
     448 
     449'''DurationCARTTrainer'''[[BR]] 
     450 
     451It builds an acoustic model of durations in the database using the program "wagon" from the Edinburgh Speech tools. 
     452 
     453Configuration Settings: 
     454 
     455 * durTree          - file containing the duration CART. Will be created by this module 
     456 * estDir           - directory containing the local installation of the Edinburgh Speech Tools 
     457 * featureDir       - directory containing the phonefeatures 
     458 * featureFile      - file containing all phone units and their target cost features 
     459 * labelDir         - directory containing the phone labels 
     460 * stepwiseTraining - "false" or "true" 
     461 * unitFile         - file containing all phone units 
     462 * waveTimeline     - file containing all wave files 
     463 
     464'''F0CARTTrainer'''[[BR]] 
     465 
     466It builds acoustic models of F0 like DurationCARTTrainer. It uses "wagon" and the files produced by !PhoneUnitfileWriter and !PhoneFeatureFileWriter. 
     467 
     468Configuration Settings: 
     469 
     470 * estDir           - directory containing the local installation of the Edinburgh Speech Tools 
     471 * f0LeftTreeFile   - file containing the left f0 CART. Will be created by this module 
     472 * f0MidTreeFile    - file containing the middle f0 CART. Will be created by this module 
     473 * f0RightTreeFile  - file containing the right f0 CART. Will be created by this module 
     474 * featureDir       - directory containing the phonefeatures 
     475 * featureFile      - file containing all phone units and their target cost features 
     476 * labelDir         - directory containing the phone label files 
     477 * stepwiseTraining - "false" or "true"  
     478 * unitFile         - file containing all phone units 
     479 * waveTimeline     - file containing all wave files 
    417480 
    418481