E-Mail:
Get our new Windows 7 eBook (PDF) for $7 with 70+ Tips. Download Now!

Speak to me, Linux

  • No Related Post

The idea of having voice recognition on your Linux desktop sounds like great idea to me personally. Now, in this article on NewsForge covering this subject, there are two different applications designed to meet the needs of their perspective users. The apps are called the Sphinx-2 speech recognition engine and the Festival text to speech application.

They both look to have a lot of promise, but what about people that are still struggling to get their USB headsets to work in Linux? Since hardly anyone uses those sound card mics any longer, this is the mic that most of us will be using. In addition to getting tutes for these speech programs, I would love to see one for USB headsets that was simple to follow and equally easy to implement.

Voice control is the next step in human interaction with computers. Voice recognition, and its flip side, speech synthesis, can help you streamline your day-to-day work and organize your Linux desktop in a better way.

To begin conversing with your Linux desktop, download the Sphinx-2 speech recognition engine and the Festival text to speech application. Although the CMU Sphinx Group provides several versions of Sphinx (Sphinx-2, -3, and -4), I use only Sphinx-2, as it is the fastest. Even though it is not as accurate as Sphinx-3 or Sphinx-4, it runs in real time, and therefore works well with live applications.

The installation of Sphinx-2 and Festival should be trivial; most distributions already have binaries, and even compiling from source should not be difficult. Debian users might find Festival a little tricky to install if they own an onboard sound card with AC97 codecs. (The symptom is speech that sounds twice as fast as it should, no matter what speed you set up; unfortunately I couldn’t find any solution except for changing the sound card.)

Happily, the normal desktop user will not have to learn Festival’s command-line interface, as great applications such as KDE Text-to-Speech System (KTTS) and Perlbox Voice fill this gap. KDE 3.4 will talk to you via Festival, Festival Lite (flite) or FreeTTS (another free speech synthesis written in Java), in a multitude of languages and accents. If you want to use KTTS with your present KDE desktop, sources as well as binaries for Debian, SUSE, and Mandrake are available at KTTS’s home page.

KTTS works by sending the text to be spoken via DCOP to the KTTS daemon. KTTS can read you pop-ups from Knotify, Web sites, or any other text. You can open Konqueror, navigate to a Web site, select Tools, and choose Speak Text. If you didn’t highlight anything, KTTS will read the whole page.

What Do You Think?

 
33 queries / 0.338 seconds.