GUADEC Presentation: Speech Recognition + Accessibility

8 07 2009

Almost There!!! Today is my presentation at GUADEC.

I will talk about speech recognition and accessibility.

The talk is: Using the Speech as a Tool of Accessibility [1]

[1] http://www.grancanariadesktopsummit.org/node/214





GUADEC 2008

2 05 2008

I’m glad to announce I was accepted to make a presentation at Guadec 2008. I will make the presentation together with Nickolay V. Shmyrev who was my mentor on Google Summer of Code last year.

More information about our presentation can be found at [1].

[1] http://live.gnome.org/GnomeVoiceControl/Guadec2008

 





Benchmark of Sphinx2, Sphinx3, PocketSphinx

8 08 2007

I made a Benchmark of Sphinx2, Sphinx3, and PocketSphinx to analyze the memory usage, time to decode, and errors on recognition ( errors on sentences and on words ).
The tests were made on a AMD Athlon(TM) XP 2000+ 1670.608 MHz with 512MB of memory.

Sphinx2

Memory used: 4.7% of 512MB = 24,064MB

root@controle03# time perl scripts_pl/decode/slave.pl
MODULE: DECODE Decoding using models previously trained
Decoding 130 segments starting at 0 (part 1 of 1)
Using files: 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% Finished
SENTENCE ERROR: 78.462% (102/130) WORD ERROR RATE: 37.904% (293/773)

real 0m49.394s
user 0m46.599s
sys 0m0.296s

Sphinx3

Memory used: 5.8% of 512MB = 29,696MB

root@controle03# time perl scripts_pl/decode/slave.pl
MODULE: DECODE Decoding using models previously trained
Decoding 130 segments starting at 0 (part 1 of 1)
0%
SENTENCE ERROR: 80.8% (105/130) WORD ERROR RATE: 34.7% (268/773)

real 5m47.224s
user 5m36.393s
sys 0m1.932s

PocketSphinx

Memory used: 0.6% of 512MB = 3,072MB

root@controle03# time perl scripts_pl/decode/slave.pl
MODULE: DECODE Decoding using models previously trained
Decoding 130 segments starting at 0 (part 1 of 1)
0%
SENTENCE ERROR: 78.5% (102/130) WORD ERROR RATE: 37.9% (292/773)

real 0m0.211s
user 0m0.156s
sys 0m0.052s

With this benchmark we can see that Sphinx3 has less errors on recognition of words ( WORD ERROR RATE ) than Sphinx3 and PocketSphinx, but the time of decode is much bigger compared with Sphinx2 and PocketSphinx. The memory usage of Sphinx3 is greater than Sphinx2 and PocketSphinx.

The PocketSphinx has less memory usage and is the faster decoder.

The Gnome-Voice-Control is been modified to use PocketSphinx. Soon we’ll have the release 0.3 using PocketSphinx.

Thanks,
Raphael Nunes





Gnome-Voice-Control 0.2 Demonstration

29 06 2007

Hi, we made a new release of the gnome-voice-control, and we made a demonstration video to show the new features added.

We had some problems of recognition on some actions over the gedit menu, I think that my english isn’t so good :-(

I hope that you enjoy this release :-)

I would like to thank Hugo Poloni by the help on making this screencast.

The release can be found on:

http://live.gnome.org/GnomeVoiceControl/Downloads

Information on instalation can be found here

Additionally to the information on instalation above, to make work the gnome-voice-control 0.2 you need to enable the Assistive technologies on your linux, and the system variable LANG need to be set as LANG=C.

To enable Assistive technologies , go to Preferences/Acessibility/Assistive technologies and check the box “Enable Assistive Technologies”. You need to logoff to the changes have effect.





Gnome Voice Control – Demonstration

16 06 2007

Hi, I made a brief demonstration’s video to show the functionalities of the Gnome Voice Control First Release that is been developed on GSoC.

I hope that you enjoy.

I would like to thank luizirber more known as Gaucho, by the help on making the demonstration video.

The software can be found on:

http://www.comp.ufscar.br/~raphael18/gnome-voice-control-0.1.tar.gz

Instructions to instalation can be found on:

http://raphaelnunes.wordpress.com/2007/06/07/gnome-voice-control-01-release-gsoc/ 

Thanks,
Raphael Nunes





GNOME-VOICE-CONTROL 0.1 RELEASE GSoC

7 06 2007

First I will write a brief introduction about me:

My name is Raphael Nunes, I’m student of Computer Engineering at Federal University of São Carlos (UFSCar). Currently I am working with SYS administration at SimuCAD (Department of Production Engineering – UFSCar) and I’m participating on Google Summer of Code 2007 working on a GNOME Project.

In this first post I would like to announce the first release of the software that I ‘m developing in the Google Summer of Code 2007 (GSoC). To whom don’t know GSoC enter in the page http://code.google.com/soc. The software is gnome-voice-control and is being developed to Gnome Foundation. The project application that I submitted to GSoC can be found at http://code.google.com/soc/gnome/appinfo.html?csaid=4F64D394968BB092.

The gnome-voice-control is an applet software developed to control the GNOME Desktop by voice.

The gnome-voice-control can be downloaded from http://www.comp.ufscar.br/~raphael18/gnome-voice-control-0.1.tar.gz

To install gnome-voice-control:

First download sphinx2 that is a speech decoder developed by Carnegie Mellon University.

wget http://cmusphinx.org/download/nightly/sphinx2.nightly.tar.gz

Extract

tar -xvvzf sphinx2.nightly.tar.gz

Enter in the sphinx folder

cd sphinx2

Install the sphinx with the commands as root

./configure –prefix=/usr

make

make install

Exit the sphinx2 folder

cd ..

Once sphinx2 installed, download gnome-voice-control

wget http://www.comp.ufscar.br/~raphael18/gnome-voice-control-0.1.tar.gz

Extract

tar -xvvzf gnome-voice-control-0.1.tar.gz

Enter in gnome-voice-control folder

cd gnome-voice-control-0.1

To install gnome-voice-control execute the following commands as root

./configure –prefix=/usr –with-sphinx2=/usr

make

make install

Now, the gnome-voice-control is installed.

Add the gnome-voice-control to gnome-panel

Click with the right button at the panel and choose add to panel, then will appear a window to choose applets. In the Micellaneous section has an applet with name VoiceControl, add it.

After all, you will see an applet with a text ‘idle’. To make the gnome-voice-control function, click with the right button on the applet and choose start control. When the applet show the text ‘ready’ you can start to speech.

To make the gnome-voice-control stop, click with the right button on the applet and choose stop control.

I hope that you like the software.

The actions implemented in this first release are:

RUN TERMINAL : Open gnome-terminal

RUN MAIL : Open Evolution

RUN BROWSER : Open Epiphany

MINIMIZE WINDOW

MAXIMIZE WINDOW

CLOSE WINDOW

NEXT WINDOW

Comments about the software are appreciated.

Thanks,

Raphael Nunes








Follow

Get every new post delivered to your Inbox.