This page describes how to download and install a recognition server based on
our prototype system that integrates with the DARPA Communicator client/server
architecture. This server decodes audio data and displays the resulting
hypothesis in a text window. A demo program is included that makes it easy to
get started.
There are five easy steps to Communicator bliss:
In order to run this server, one must have previously installed the DARPA Communicator system and be somewhat familiar with its operation. Step 2: Install the prototype system Download the latest version of the ISIP prototype system in which we provide an integrated API for the DARPA Communicator system. The system can be easily installed following the steps below:
Step 3: Install the recognition server demo Once the DARPA Communicator and ISIP prototype systems have been successfully installed, we can download the recognition server demo (Solaris x-86 and linux) or recognition server demo (Solaris sparc) and proceed with building the server program:
Before compiling the recognition server, make sure you have sourced the ISIP environment (source ISIP_ENV.sh) to establish the proper run-time environment for the prototype system. Step 4: Run the application demo Steps 1 to 3 above must have been successfully completed for the next step to work. The recognition demo program can now be invoked as follows:
Now you are finally ready to run an audio server that will send data to the recognizer in batch mode:
Step 5: Modify the hub program file The demo_pgm.text can be easily modified to operate in your local environment: ;; ------------------------------------------------- ;; SERVERS ;; ------------------------------------------------- SERVER: recognize_serv ;; HOST: 130.18.6.111 HOST: localhost PORT: 12346 OPERATIONS: reinitialize recognize_speech ;; ------------------------------------------------- ;; CONDITIONS (default order) ;; ------------------------------------------------- RULE: :binary_port & :binary_host & :call_id --> recognizer_speech IN: :binary_port :binary_host :call_id OUT: none! The tag recognize_speech handles incoming audio broker requests, decodes the received audio data and outputs the hypothesis. Currently, the recognizer expects the broker to send GAL_INT_16 or GAL_BINARY data and expects the audio server to send the following control messages of type GAL_STRING:
Success! Kick back, smoke a victory cigar, and recite George Peppard s famous line from the A-Team: "I love it when a plan comes together." |