Context Navigation

Changes between Version 17 and Version 18 of FrequentlyAskedQuestions

Timestamp:: 12/08/09 19:59:42 (16 years ago)
Author:: masc01
Comment:: updated questions

Legend:

: Unmodified
: Added
: Removed
: Modified

FrequentlyAskedQuestions

-                      v17
+                      v18
 = Frequently Asked Questions =
-'''Can I install additional voices/components after the main installation, or do I have to reinstall everything "from scratch"?'''
+You can run the installer again and select only the new packages that you wish to install. Make sure you specify the same installation directory as before. This should work OK, maybe with the exception of link creation and uninstaller creation.
+=== Can I install additional voices/components after the main installation, or do I have to reinstall everything "from scratch"? ===
+It would of course be nicer to have an integrated update manager -- see ticket:8. Help is welcome! :-)
+To install or uninstall additional languages and voices, please run the MARY TTS/bin/mary-component-installer.
+'''What exactly is the license for the software?'''
+=== Maryserver takes ages to start up! What can I do? ===
+The core OpenMary system, as released on this development page, is distributed under a very liberal BSD-style license which basically allows you to do anything you want with the code provided that you acknowledge where you have it from: http://mary.dfki.de/download/MARY%20software%20user%20agreement.html Scientific publications based on MARY are requested to cite the MARY reference paper Schröder & Trouvain (2003).
+The German language modules as well as the English part-of-speech tagger is released in binary form, under a research license: http://mary.dfki.de/download/DFKI%20MARY%20software%20user%20agreement.html You must not use this code in a commercial setup unless you obtain a separate license from DFKI, and there are other restrictions. Do read the license agreement carefully when you use the German component.
+The MBROLA binaries and voices, finally, are distributed with MARY because that is allowed by the MBROLA license: http://mary.dfki.de/download/Mbrola%20software%20user%20agreement.html These can only be used in a non-commercial, non-military setting.
+'''How difficult is it to add support for Hebrew/Italian/Spanish/Hindi/...? Is Mary modular in that sense?'''
+Mary is very modular, and a number of modules exist in a language-independent and configurable implementation, but there is still enough work left to do.
+For many languages, you could start with the existing MBROLA diphone voices:
+http://tcts.fpms.ac.be/synthesis/mbrola/mbrcopybin.html
+You would then need at least the following MARY TTS modules:
+ * needed: a Tokeniser, cutting the input into sentences and tokens (it may be possible to re-use source:trunk/java/de/dfki/lt/mary/modules/JTokeniser.java for a number of languages)
+ * optional: a text normalisation which expands numbers, abbreviations  etc. into a pronounceable form (but that can be left out at the beginning)
+ * optional: a part-of-speech tagger, distinguishing at least between content words and function words
+ * crucially needed: a phonemiser, converting the input text into sound symbols, e.g. in SAMPA. This can be based on rules for some languages (probably, Spanish), but a pronounciation lexicon is required for others when the link between spelling and pronounciation is less regular. Then, also, the lexicon must be complemented with "letter-to-sound" rules for unknown words.
+ * optional: a prosody assignment module, predicting e.g. ToBI labels based on part-of-speech and other information.
+source:trunk/java/de/dfki/lt/mary/modules/ProsodyGeneric.java, written by my student Stephanie Becker, may be a good place to start.
+ * needed: a duration assignment module, predicting phone durations. As a very first start, the Klatt rules as currently used in the Tibetan language component: source:trunk/java/de/dfki/lt/mary/modules/tib/KlattDurationModeller.java
+could be used, of course adapted to the language-specific phoneme set.
+ * optional: an intonation contour realisation module. For example, there is a generic source:trunk/java/de/dfki/lt/mary/modules/TobiContourGenerator.java that can be used for different languages by writing appropriate config files.
+ * needed: synthesis, e.g. using MBROLA voices.
+So, in summary, for adding a new language, you most crucially need a
+phonemiser, and you need to get at least a tokeniser and a duration
+assigner to work. Assuming that there is already an acceptable MBROLA
+voice for your language.
+On the bright side, as data representation is based on Unicode, there
+should be no problem with non-European scripts.
+What takes time is starting components and, in particular, unit selection voices. So the more languages and voices you install, the longer maryserver will take to start up. Use the mary-component-installer to uninstall what you don't need.
+'''Unfortunately, I'm just a C++ programmer and have no experience with Java. I have made some changes in the Mary source code -- how I can compile and test my changes?'''
+=== What exactly is the license for the software? ===
+The OpenMary core system is released under the Lesser GNU General Public License [LGPL|http://www.gnu.org/licenses/lgpl-3.0-standalone.html]. Language components for English, German, Telugu and Turkish are currently also released under the LGPL.
+Different speech synthesis voices are distributed under different licenses:
+ * the [Arctic license|http://mary.dfki.de/download/voices/arctic-license.html]
+ * the [Creative Commons Attribution-NoDerivatives license|http://mary.dfki.de/download/by-nd-3.0.html]
+ * The [MBROLA license|http://mary.dfki.de/download/Mbrola%20software%20user%20agreement.html]
+ * maybe other licenses in the future.
+The installer should show you the respective license for a component you select. You must agree to a license before you can install and use a component.
+make sure you have a recent (>= 1.4) Java SDK installed (try "javac
+=== How difficult is it to add support for Hebrew/Italian/Spanish/Hindi/...? Is Mary modular in that sense? ===
+MARY 4.0 comes with substantial support for adding new languages, see NewLanguageSupport. This should help you get started; to do intelligent language-specific things, of course additional work will be needed.
+=== I have no experience with Java. I have made some changes in the Mary source code -- how I can compile and test my changes? ===
+make sure you have a recent (>= 1.5) Java SDK installed (try "javac
 -version" on the command line). If you don't have it, get it from
 http://java.sun.com. You also need the tool "ant", which you can get
 …
 with the OpenMary source package.
-'''First Start - Mbrola Jni Caller - EXCEPTION_ACCESS_VIOLATION'''
+(Using Windows and eclipse)
+If your Mary-Server doesn't start and your logfile shows:
+=== Will there be support for the open 'ogg vorbis' format for audio output? ===
+{{{
+...
+-07-06 20:31:12,635 [main] INFO  MbrolaSynthesizer Starting my own MbrolaCaller
+(de.dfki.lt.mary.modules.MbrolaJniCaller)
+#
+# An unexpected error has been detected by HotSpot Virtual Machine:
+#
+#  EXCEPTION_ACCESS_VIOLATION (0xc0000005) at pc=0x00000000, pid=3320, tid=3324
+#
+# Java VM: Java HotSpot(TM) Client VM (1.5.0_04-b05 mixed mode, sharing)
+# Problematic frame:
+# C  0x00000000
+#
+# An error report file with more information is saved as hs_err_pid3320.log
+#
+# If you would like to submit a bug report, please visit:
+#   http://java.sun.com/webapps/bugreport/crash.jsp
+#
+}}}
+(or similar), copy the files mbrola.dll and MbrolaJNI.dll to your system-directory (e.g. C:\Windows\System32).
+'''Will there be support for the open 'ogg vorbis' format for audio output?'''
+'''What are the requirements for MARY with ubuntustudio w/o online connection?'''
+ * ubuntustudio 7.04 comes without a java compiler - which java (gij/gcj, the jre6 from sun.com, ...) is needed?
+ * Is the online connection a requirement for the mary installer? what if it is not available -- is the sources package an option?
+If someone writes a reliable ogg vorbis encoder in Java, we will be happy to add support for it. We do not intend to use native libraries though, the deployment issues are simply too complex and time-consuming.