= Transcription Tool =
MARY Transcription Tool, a graphical user interface, supports a semi-automatic procedure for transcribing new language text corpus and automatic training of Letter-to-sound(LTS) rules for that language. It stores all functional words in that language to build a simple POS tagger.
== How to run? ==
Run below commands through Shell script:
{{{
export MARY_BASE="[PATH TO MARYBASE]"
java -Xmx1024m -classpath $MARY_BASE/java:$MARY_BASE/java/mary-common.jar:\
$MARY_BASE/java/log4j-1.2.8.jar:$MARY_BASE/java/weka.jar:\
$MARY_BASE/java/mysql-connector-java-5.1.7-bin.jar\
-Djava.endorsed.dirs=$MARYBASE/lib/endorsed\
marytts.tools.transcription.TranscriptionGUI
}}}
{{{
#!html
Instructions for Transcription Tool
- Prepare phoneset for your language. For example, phoneset for English (locale: en_US): http://mary.opendfki.de/wiki/TranscriptionTool/allophones_en-US.xml
- Specify phoneset (allophones.xml) file using menu item 'specify phoneset'
- Load your data for transcription
(two options: 1. Open (MenuItem) 2. Load from MySql database (list of words as a table) )
- Save your data to a specified location. (then 'TrainPredict' button will be enable)
- If you transcribe some data, click 'trainpredict' button to predict transcription for remaining words. you have to manually correct predicted transcription.
- Description of colors:
- Black - Manually written or manually corrected
- Red - Transcription syntax is not permitted (eg: user written transcription is incorrect)
- LightGray - Predicted by 'TrainPredict' (predicted based on manual transcription)
}}}