Cairo provides an open source speech resource server written entirely in the Java programming language. To achieve maximum compatibility with existing and future speech clients the Cairo server has been designed from the ground up to comply with the MRCPv2 standard.
The purpose of Cairo is not to duplicate functionality provided by existing open source speech projects such as FreeTTS and Sphinx, but rather to build upon them and provide additional functionality such as clustering, load balancing and failover support in order to meet the requirements necessary for use in enterprise scale deployments of speech/telephony applications.
There are four projects that make up Cairo.
The cairo-rtp library provides a simple Java API for supporting RTP audio streaming between a media source and a media sink.
This is the first release for the cairo-rtp project. This release provides the necessary RTP streaming capabilities required for implementing MRCPv2 clients and servers.
The cairo-sip library provides a simple Java API for supporting SIP/SDP message communication between MRCPv2 clients and servers.
The Session Initiation Protocol (SIP) and the Session Description Protocol (SDP) are critical elements of the Media Resource Control Protocol Version 2 (MRCPv2) standard as described in the MRCPv2 specification:
"MRCPv2 is not a "stand-alone" protocol - it relies on a session management protocol such as the Session Initiation Protocol (SIP) to establish the MRCPv2 control session between the client and the server, and for rendezvous and capability discovery. It also depends on SIP and SDP to establish the media sessions and associated parameters between the media source or sink and the media server."
This is the first release for the cairo-sip project. This release provides the necessary SIP and SDP capabilities required for implementing MRCPv2 clients and servers.
"Cairo sets out to provide an enterprise grade, Media Resource Control Protocol Version 2 (MRCPv2) compliant speech server solution utilizing existing open source speech resources.
Cairo v0.2 adds support for establishing sessions using Session Initiation Protocol (SIP) signalling and improves speech recognition performance by matching the voice models used by Sphinx to the sampling rate of the received audio data..." (more)
The latest version of Cairo can be downloaded here.
After installation of Cairo please see the Getting Started Guide for instructions on starting up the Cairo server processes and running the MRCPv2 client demos.