API Docs

Introduction - What we do:

iSpeech, Inc. provides two types of services called Text-to-Speech (TTS) and Automated Speech Recognition (ASR). Text-to-Speech lets users send us text and we will provide thoses users with computer-generated speech. ASR allows users to submit audio data from a microphone or a file and we return a transcription of what was said.

Introduction - Types of services:

iSpeech provides TTS and ASR services on various mobile, computer, and web platforms. For example, we support:

Mobile	Desktop/Server	Web
iOs	Java	Javascript (TTS)
Android	.NET	Flash/Flex/Air
BlackBerry

Developers can build programs that use iSpeech services by using the correct iSpeech SDKs. iSpeech is capable of developing applications for paying customers who would rather pay us to create their app.

Getting access with API Keys:

iSpeech's services are reachable with the correct identification code called an Application Programming Interface (API) key. The key is a token to access our TTS and ASR services. Currently, iSpeech has 3 types of API keys with over 100 different options.

The three types of API Keys are:

Mobile Development
Mobile Production
Web/General API Production

The iSpeech mobile development server is used as a "sandbox" environment that you can use to test your application before releasing it to the public.

Mobile Development	Mobile Production	Web/Other Production
For development only	For public use	Required paid access
		For websites, desktop apps, servers

Building an Application - Mobile Platforms:

In order to build a mobile application that uses iSpeech, you should download the appropriate package by using the "Getting Started" menu. The download will contain an iSpeech SDK library, documentation, and source code for a sample program.

The sample applications must be opened in the appropriate integrated development environment (IDE). For BlackBerry and Android, you should use Eclipse. For iPhone, you should use XCode. The sample applications can be executed as-is without changing the API key and server type.

Once you get the sample application working, try changing your API key and the production flags. In the BlackBerry application, open the Main package and the iSpeechSample class and set the following two variables.

							public static String _APIKey = ""; // your api key, http://www.ispeech.org/developers
							
							public static boolean _Production = true; // false is "Mobile Development" and true is "Mobile Production"

Building an Application - Desktop/Server Platforms

The desktop/server SDKs for Java and .Net are only available on request. These SDKs are very useful for integrated voice recognition (IVR) phone systems. These SDKs require a general API key and are not compatible with mobile API keys.

You can create a general/web key under My Account -> My API keys. The web/desktop type API key requires credits which you must purchase.

Building an Application - Web Platforms

Many developers are interested in integrating TTS and ASR into websites. There are two main requirements for using iSpeech web services. The first requirement is that you submit the text for TTS or audio data for ASR to the iSpeech servers using the HTTP protocol. The second requirement is that you must have a way to receive and present the data to the user.

For TTS by default, you will receive an MP3 that you can play with flash and/or javascript. For ASR, you must use a browser plugin that has access to the microphone such as Flash or a Java applet.

Optimization

For the web platform, you can use HTTP GET requests rather than POST because it requires fewer packets. However, GET is limited by maximum URL length.

You can improve download speeds and save credits by downloading one sentence at a time in a playlist. The time that it takes to convert one sentences causes natural sounding short pause.

Troubleshooting

The frequently asked questions (FAQ) page is available to answer common questions. For other questions, you can ask your question by posting on the iSpeech Forums or with the support contact form .

Your application will require an internet connection. If there is a problem with your internet connection, iSpeech services will become unreachable.

Download SDKs

Blackberry

iOS

Android

Java

.NET

PHP

Flash

Python

Ruby

Perl

Speech-Enable Your Blackberry®:

iSpeech Text to Speech (TTS) and Speech Recognition (ASR) SDK for BlackBerry lets you Speech-enable any BlackBerry App quickly and easily with iSpeech Cloud. The SDK has a 50KB footprint and supports 27 TTS and ASR languages and 15 for free-form dictation voice recognition.

iSpeech Developer API key

The iSpeech Android SDK can be tested with this API key: "developerdemokeydeveloperdemokey".
The "developerdemokeydeveloperdemokey" API key has a 100 word limit. It's meant to be used in debug mode.
To get your own API key, you need to register.

Get Blackberry SDK from GitHub

Speech-Enable Your iPhone® or iPad:

iSpeech iOS SDK lets you add human quality Text to Speech (TTS) and Speech Recognition (ASR) to Speech-enable any app. iSpeech's Mobile SDKs support 27 TTS and ASR (defined grammar) languages and 15 languages for free-form dictation voice recognition. .

iSpeech Developer API key

Get iOS SDK from GitHub

Speech-Enable Your Android®:

iSpeech Text to Speech (TTS) and Speech Recognition (ASR) SDK for Android lets you Speech-enable any Android App quickly and easily with iSpeech Cloud. The SDK has a small footprint and supports 27 TTS and ASR languages and 15 for free-form dictation voice recognition.

iSpeech Developer API key

Get Android SDK from GitHub

Speech-Enable Your Java® Application:

iSpeech Text to Speech (TTS) and Speech Recognition (ASR) SDK for Java lets you Speech-enable any Java App quickly and easily with iSpeech Cloud. The SDK has a small footprint and supports 27 TTS and ASR languages and 15 for free-form dictation voice recognition.

iSpeech Developer API key

Get Java SDK from GitHub

Speech-Enable Your Java® Application:

Speech Text to Speech (TTS) and Speech Recognition (ASR) SDK for JavaScript lets you Speech-enable any JavaScript application quickly and easily with the iSpeech Cloud. The SDK is open source and supports 27 TTS and ASR languages and 15 for free-form dictation voice recognition.

iSpeech Developer API key

Get JavaScript Speech Synthesis SDK from GitHub Get JavaScript Speech Recognition SDK SDK from GitHub

Speech-Enable Your .NET® Application:

iSpeech Text to Speech (TTS) and Speech Recognition (ASR) SDK for .NET lets you Speech-enable any .NET App quickly and easily with iSpeech Cloud. The SDK has a small footprint and supports 27 TTS and ASR languages and 15 for free-form dictation voice recognition.

iSpeech Developer API key

Get .NET SDK from GitHub

Speech-Enable Your PHP® Application:

iSpeech Text to Speech (TTS) and Speech Recognition (ASR) SDK for PHP lets you Speech-enable any PHP App quickly and easily with iSpeech Cloud. The SDK has a small footprint and supports 27 TTS and ASR languages and 15 for freeform dictation voice recognition.

iSpeech Developer API key

Get PHP SDK from GitHub

Speech-Enable Your Flash® Application:

iSpeech Text to Speech (TTS) and Speech Recognition (ASR) SDK for the Flash lets you Speech-enable any Flash, Flex, or Air App quickly and easily with iSpeech Cloud. The SDK has a small footprint and supports 27 TTS and ASR languages and 15 for freeform dictation voice recognition.

iSpeech Developer API key

Get Flash SDK from GitHub

Speech-Enable Your Python® Application:

iSpeech Text to Speech (TTS) and Speech Recognition (ASR) SDK for Python lets you Speech-enable any Python App quickly and easily with iSpeech Cloud. The SDK has a small footprint and supports 27 TTS and ASR languages and 15 for freeform dictation voice recognition.

iSpeech Developer API key

Get Python SDK from GitHub

Speech-Enable Your Ruby® Application:

iSpeech Text to Speech (TTS) and Speech Recognition (ASR) SDK for Ruby lets you Speech-enable any Ruby App quickly and easily with iSpeech Cloud. The SDK has a small footprint and supports 27 TTS and ASR languages and 15 for freeform dictation voice recognition.

iSpeech Developer API key

Get Ruby SDK from GitHub

Speech-Enable Your Perl® Application:

iSpeech Text to Speech (TTS) and Speech Recognition (ASR) SDK for Perl lets you Speech-enable any Perl App quickly and easily with iSpeech Cloud. The SDK has a small footprint and supports 27 TTS and ASR languages and 15 for freeform dictation voice recognition.

iSpeech Developer API key

Get Perl SDK from GitHub

Technical Support

Inquiry Form

Faq

Why Text-to-Speech as a service?

iSpeech's Text-to-Speech WebService puts the power of our TTS clusters anywhere, anytime. Our "zero-footprint Text-toSpeech" leaves traditional embedded solutions and TTS servers at a loss for words.

How fast is iSpeech?

iSpeech is as fast as you need it. iSpeech has a unique, patent-pending multi-threaded, multi-core method for running a single Text-to-Speech conversion. This allows us to attack one single Text-to-Speech conversion with an unlimited number of processors simultaneously. So a Text-to-Speech conversion that would normally take 30 minutes can be done in less than 30 seconds.

Don't want to deal with the overhead of audio files?

We can take care of that for you. Ask us about using our SAN to store and access your files. Our TTS Web Service automatically recognizes files that have already been converted by a unique hash and pulls them through the SAN, so nothing needs to be converted twice. iSpeech is completely efficient.

Want to try iSpeech, but don't want to commit?

All of our Text to Speech options are pay-as-you-go. We want to earn your business, and we'll never take you for granted. We even have a free, ad-supported TTS voice that you can try without paying. The free Text-to-Speech voice will help you get a feel for our service at no cost.

What if I need help implmeneting Text-to-Speech in your application?

The iSpeech Team will get you running TTS conversions fast. If you feel like our Text-to-Speech API is too much for you to handle, we also have a simple, easy to use, TTS conversion user interface. The UI allows for you to manually upload word documents, Adobe Acrobat PDFs, PowerPoint, HTML, RSS feeds, and more.

What if I need some functionality that iSpeech doesn't seem to have?

Ask us. If it would help you, chances are we will implmenet it in a hurry. If you need a custom Text-to-Speech solution, we will build it for you. iSpeech is here to help you speech-enable your website or application.

What is the largest Text-to-Speech conversion you guys can handle?

To optimize our servers, we have limited conversions to 100,000 words. If you need to convert more words, we suggest you break up a TTS conversion into separate parts. For instance, if you have a book convert the individual chapters separately. It also makes it easier to navigate and play the MP3 files. Chapters will act much like chapters on a DVD.