| | 1 | |
| | 2 | = Voice building for a new language = |
| | 3 | |
| | 4 | |
| | 5 | == 1. Download xml dump of wikipedia in your language == |
| | 6 | |
| | 7 | == 2. Extract clean text and most frequent words == |
| | 8 | |
| | 9 | == 3. Transcribe most frequent words == |
| | 10 | |
| | 11 | a. Create pronunciation dictionary and train letter-to-sound rules |
| | 12 | b. Minimal NLP components for the new language |
| | 13 | |
| | 14 | == 4. Run feature maker with the minimal nlp components == |
| | 15 | |
| | 16 | == 5. Database selection == |
| | 17 | |
| | 18 | select a phonetically/prosodically balanced recording script |
| | 19 | |
| | 20 | == 6. Manually check/correct transcription of all words in the recording script [Optional] == |
| | 21 | |
| | 22 | == 7. Record script with a native speaker using our recording tool "Redstart" == |
| | 23 | |
| | 24 | == 8. Build an unit selection and/or hmm-based voice with Voice import tool == |