wiki:TranscriptionTool

Context Navigation

Version 2 (modified by sach01, 17 years ago) (diff)
--

Transcription Tool

MARY Transcription Tool, a graphical user interface, supports a semi-automatic procedure for transcribing new language text corpus and automatic training of Letter-to-sound(LTS) rules for that language. It stores all functional words in that language to build a simple POS tagger.

How to run?

Run below commands through Shell script:

export MARY_BASE="[PATH TO MARYBASE]"

java -Xmx1024m -classpath $MARY_BASE/java:$MARY_BASE/java/mary-common.jar:\
$MARY_BASE/java/log4j-1.2.8.jar:$MARY_BASE/java/weka.jar:\
$MARY_BASE/java/mysql-connector-java-5.1.7-bin.jar\
 -Djava.endorsed.dirs=$MARYBASE/lib/endorsed\
 marytts.tools.transcription.TranscriptionGUI

Instructions for Transcription Tool

Prepare phoneset for your language. For example, phoneset for English (locale: en_US): http://mary.opendfki.de/wiki/TranscriptionTool/allophones_en-US.xml
Specify phoneset (allophones.xml) file using menu item 'specify phoneset'
Load your data for transcription
(two options: 1. Open (MenuItem) 2. Load from MySql database (list of words as a table) )
Save your data to a specified location. (then 'TrainPredict' button will be enable)
If you transcribe some data, click 'trainpredict' button to predict transcription for remaining words. you have to manually correct predicted transcription.
Description of colors:
- Black - Manually written or manually corrected
- Red - Transcription syntax is not permitted (eg: user written transcription is incorrect)
- LightGray - Predicted by 'TrainPredict' (predicted based on manual transcription)

Context Navigation

Transcription Tool

How to run?

Instructions for Transcription Tool

Attachments (2)

Download in other formats: