Package Summary
This package is a simple wrapper around the pocketsphinx speech recognizer, using gstreamer and a Python-based interface.
- Maintainer: Michael Ferguson <mike@vanadiumlabs.com>
- Author: Michael Ferguson
- License: BSD
- Source: git https://github.com/mikeferguson/pocketsphinx.git
Overview
This package provides access to the CMU Pocket Sphinx speech recognizer. It uses gstreamer to automatically split the incoming audio into utterances to be recognized, and offers services to start and stop recognition.
Currently, the recognizer requires a language model and dictionary file. These can be automatically built from a corpus of sentances using the Online Sphinx Knowledge Base Tool. Example launch files, language models, and dictionary files can be found in the demo directory of the package. The voice_cmd example controls a mobile base using commands such as "move forward" or "stop". The robocup example uses some of the standard items and names from the RoboCup@Home contest, for instance, it should recognize "hello, my name is Michael" or "Bring me the Original Pringles".
Installing
There is nothing to build, however, to install pocketsphinx itself we will need to run rosdep. In a directory under your ROS package path, do the following:
svn checkout http://albany-ros-pkg.googlecode.com/svn/trunk/rharmony rharmony sudo apt-get install gstreamer0.10-pocketsphinx
Nodes
recognizer.py
Interface to pocket sphinx.Published Topics
~output (std_msgs/String)- Complete utterance as heard by pocket sphinx.
Services
~start (std_srvs/Emtpy)- Connect the audio stream to the speech recognizer and start listening.
- Disconnect audio stream from speech recognizer.
Parameters
~lm (str, default: required)- Location of language model file.
- Location of dictionary file.






