Version 2 (modified by sach01, 16 years ago) (diff) |
---|
Transcription Tool
MARY Transcription Tool, a graphical user interface, supports a semi-automatic procedure for transcribing new language text corpus and automatic training of Letter-to-sound(LTS) rules for that language. It stores all functional words in that language to build a simple POS tagger.
How to run?
Run below commands through Shell script:
export MARY_BASE="[PATH TO MARYBASE]" java -Xmx1024m -classpath $MARY_BASE/java:$MARY_BASE/java/mary-common.jar:\ $MARY_BASE/java/log4j-1.2.8.jar:$MARY_BASE/java/weka.jar:\ $MARY_BASE/java/mysql-connector-java-5.1.7-bin.jar\ -Djava.endorsed.dirs=$MARYBASE/lib/endorsed\ marytts.tools.transcription.TranscriptionGUI
Instructions for Transcription Tool
- Prepare phoneset for your language. For example, phoneset for English (locale: en_US): http://mary.opendfki.de/wiki/TranscriptionTool/allophones_en-US.xml
- Specify phoneset (allophones.xml) file using menu item 'specify phoneset'
- Load your data for transcription
(two options: 1. Open (MenuItem) 2. Load from MySql database (list of words as a table) ) - Save your data to a specified location. (then 'TrainPredict' button will be enable)
- If you transcribe some data, click 'trainpredict' button to predict transcription for remaining words. you have to manually correct predicted transcription.
- Description of colors:
- Black - Manually written or manually corrected
- Red - Transcription syntax is not permitted (eg: user written transcription is incorrect)
- LightGray - Predicted by 'TrainPredict' (predicted based on manual transcription)
Attachments (2)
- MySqlDetails.png (12.8 KB) - added by sach01 15 years ago.
- Transcription-Tool.png (73.7 KB) - added by sach01 15 years ago.
Download all attachments as: .zip