Speech Synthesis Databases
In order to make building voices easier we offer speech synthesis databases which serve as examples to the techniques described in the festvox document.

General Databases

  • CMU ARCTIC, 7 single speaker speech databases with around 1200 phonetically balanced uttrances.
  • CMU FAF, 107 paragraphs (15,000 words) of single speaker monologues with interesting prosody. Basic of Aesop's fables and country descriptions in the CIA world fact book.
  • CMU SIN, speech in noise: speech recorded while noise is playing in the speakers ears (and when not).
  • CSTR US KED timit University of Edinburgh's male US TIMIT, 452 phonetically balanced utterances.

Limited Domain Databases

Diphone Databases

MBROLA voices and binaries (US mirror)

  • A US mirror of The MBROLA projects wide range of pre-built diphone databases for many languages and binaries for the mbrola program itself for many platforms.