SYNTHETIC SPEECH - Inteligable, Repeatble & Reliable

 

Synthetic Speech

TTS, or Text-To-Speech, is the creation of audible speech from computer readable text.TTS is separate from speech recognition. You can think of TTS as “talking” and speech recognition as “listening”. There is some shared technology, but neither is just the reverse of the other. And the talking/listening analogy is limited too. Neither technology really involves much language understanding. People new to the idea of TTS often underestimate the difficulty of the task.

After all, humans can typically learn this stuff in early childhood. They talk, listen, understand, and even translate without much apparent effort. Humans do all this work without even being aware of it in most cases, but that doesn’t make it easy. If programmers could create software that really understands human language we could avoid most of the guesswork in TTS, but that hasn’t happened yet. Until then, TTS is more like learning to read a foreign language aloud without ever understanding the words.

With a good dictionary, grammar rules, etc. you can get better and better but will still make mistakes occasionally that are obvious to native speakers. TTS is often described as two conceptual stages. In the first stage, it decides how the text should be spoken, that is, how each word should be pronounced, what length and pitch each phoneme should have, etc.

We'll Make Sure Template Works For You

 

text to speech

In the second stage, the system does its best to create audio that matches the specifications produced by stage one. The type of TTS we do is called a ‘concatenated’ system, meaning that we record a human speaker to make a voice database. We re-use small chunks of the recordings to create new sentences containing words that were never recorded. Further, we do “unit selection” synthesis. This means that we use large voice databases and do clever searches on-the-fly to find chunks in the voice database that best match the requested sentences.

CONCATENATED PHRASE SPEECH

Here we take much larger sections of speech; words and phrases. Phrases are triggered by the computer to say whole words or phrases & to concatenate them together. this type of system can be heard on trains, busses etc. We have implemented this at Birmingham Int. Airport.

THE CHOICE

TTS can handle pretty well any word sent to it, however you can sometimes tell it is a computer, because it doesn’t always get the inflection right. Concatenated on the other hand is very lifelike. We normally recommend TTS because it requires little maintenance & is very ‘intelligible’.

Product Gallery

Services Overview

OpalPort is a subsidiary of MicroWatt. More information can be found at www.microwatt.co.uk

Contact Us

Address:The Plough Cottage, Station Road, South Cerney, Cirencester, Gloucestershire, GL7 5UE
Sales:+44 (0)1285 862220
E-mail: info@microwatt.co.uk