Dictation is an assistive technology (AT) tool that can help kids who struggle with writing. You may hear it referred to as “speech-to-text,” “voice-to-text,” “voice recognition” or “speech recognition” technology. Kids can use dictation to write with their voices, instead of writing by hand or with a keyboard. Welcome to the HP Technology Center. In today’s world, knowledge is power. The HP technology center provides information on leading-edge technologies empowering you to use your HP product more effectively. Understanding the technologies that enable your PC to perform is important.
- Speech Technology Magazine
- Download Speech Technology Center Driver License Test
- Download Speech Technology Center Driver Windows 7
- Corpora, data sets and synthetic voices
- fda database - for evaluating pitch determination algorithms
- SVitchboard 1 - small vocabulary tasks from Switchboard 1
- MOCHA-TIMIT - acoustic + articulatory recordings
- Eustace - corpus for investigating durational effects in speech
- mngu0 - corpus of multimodal articulatory data for one British English male speaker
- REHASP 0.5 - single-speaker corpus of repeated Harvard sentence prompts
- NAM corpus - recordings of non-audible murmured speech
- VCTK corpus - from the Voice Clone Toolkit, comprising 109 native speakers of English
- Southern British Female Voice Package - commercial voice package for use with Festival/HTS
- Trained models for ASR or TTS
- Articulatory feature classifier trained on 2000 hours of telephone speech
- GlobalPhone MLPs - phoneme and articulatory feature classifiers trained on several languages
- Software, tools and pronunciation dictionaries
- The Combilex Lexicon, for General American or Received Pronunciation
- Outputs from collaborative projects
- Blizzard Challenge data and tools
- Hurricane Challenge data
- EMIME - cross-lingual speaker-adaptive speech synthesis
- SSPNet resources for social signal processing
- Simple4All - building speech synthesis systems with very limited resources
- How-to videos for the Lexus driver, for Warranty Service Guides, Lexus Enform and more.
- Explore your possibilities with Milestone’s industry-leading video management software. XProtect VMS helps protect your business.
Festival offers a general framework for building speech synthesis systems as well as including examples of various modules. As a whole it offers full text to speech through a number APIs: from shell level, though a Scheme command interpreter, as a C++ library and an Emacs interface. Festival is multi-lingual (currently English (British and American), and Spanish) though English is the most advanced. Other groups release new languages for the system. And full tools and documentation for build new voices are available through Carnegie Mellon's FestVox project (http://festvox.org)
The system is written in C++ and uses the Edinburgh Speech Tools Library for low level architecture and has a Scheme (SIOD) based command interpreter for control. Documentation is given in the FSF texinfo format which can generate, a printed manual, info files and HTML.
Festival is free software. Festival and the speech tools are distributed under an X11-type licence allowing unrestricted commercial and non-commercial use alike.
This distribution includes:
- Full English (British and American English) text to speech
- Full C++ source for modules, SIOD interpreter, and Scheme library
- Lexicon based on CMULEX and OALD (OALD is restricted to non-commercial use only)
- Edinburgh Speech Tools, low level C++ library
- Full documentation (html, postscript and GNU info format)
Note there are some licence restrictions on the voices themselves. The US English voices have the same restrictions as Festival. The UK lexicon (OALD) is restriction to non-commercial use.
Addition voices are also available.
Festival version 2.4 sources, voices
In Europe: http://www.cstr.ed.ac.uk/downloads/festival/2.4
In North America: http://festvox.org/festival
Speech Technology Magazine
Requirements
To run Festival you need:
- A Unix-like environment, e.g Linux, FreeBSD, OSX, cygwin under Windows.
- A C++ compiler: we have used GCC versions up to 4.
- GNU Make any recent version.
Download Speech Technology Center Driver License Test
New from 2.1
Download Speech Technology Center Driver Windows 7
- Support for the new versions of C++ that have been released
- Integrated and updated support for HTS, Clustergen, Multisyn and Clunits voices
- 'Building Voices in Festival' document describing process of building new voices in the system http://festvox.org/
Alan W Black (CMU)
Rob Clark (Edinburgh)
Junichi Yamagishi (Edinburgh)
Keiichiro Oura (Nagoya)