blob: db9c41429d7cfe0f9e9df06452524137066e3827 (
plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
|
16khz American English male voice for the festival speech synthesis system.
This voice provides an American English male voice using a residual
excited LPC diphone synthesis method. It uses the CMU Lexicon
pronunciations. Prosodic phrasing is provided by a statistically
trained model using part of speech and local distribution of breaks.
Intonation is provided by a CART tree predicting ToBI accents and
an F0 contour generated from a model trained from natural speech.
The duration model is also trained from data using a CART tree.
This voice can be activated via (voice_kal_diphone)
-Julian Assange <proff@iq.org>
|