<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Benchmark of Sphinx2, Sphinx3, PocketSphinx</title>
	<atom:link href="http://raphaelnunes.wordpress.com/2007/08/08/benchmark-of-sphinx2-sphinx3-pocketsphinx/feed/" rel="self" type="application/rss+xml" />
	<link>http://raphaelnunes.wordpress.com/2007/08/08/benchmark-of-sphinx2-sphinx3-pocketsphinx/</link>
	<description>Português, English, Deutsch, Français</description>
	<lastBuildDate>Fri, 12 Jun 2009 08:12:47 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: ngun</title>
		<link>http://raphaelnunes.wordpress.com/2007/08/08/benchmark-of-sphinx2-sphinx3-pocketsphinx/#comment-191</link>
		<dc:creator>ngun</dc:creator>
		<pubDate>Sun, 15 Jun 2008 00:50:17 +0000</pubDate>
		<guid isPermaLink="false">http://raphaelnunes.wordpress.com/2007/08/08/benchmark-of-sphinx2-sphinx3-pocketsphinx/#comment-191</guid>
		<description>Is it possible to make it work with the Gnome on screen keyboard? I mean all you need to say is the letters, numbers, capital letters, shift key+ letter for starters. Is this more difficult?

Advantages:

For the physically impaired using the above keyboard.
For those using other languages, who use the English QWERTY keyboard, where language support by voice may take a long time to come or never if it is a minor language.
other things ihave not thought of yet!</description>
		<content:encoded><![CDATA[<p>Is it possible to make it work with the Gnome on screen keyboard? I mean all you need to say is the letters, numbers, capital letters, shift key+ letter for starters. Is this more difficult?</p>
<p>Advantages:</p>
<p>For the physically impaired using the above keyboard.<br />
For those using other languages, who use the English QWERTY keyboard, where language support by voice may take a long time to come or never if it is a minor language.<br />
other things ihave not thought of yet!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: kasra</title>
		<link>http://raphaelnunes.wordpress.com/2007/08/08/benchmark-of-sphinx2-sphinx3-pocketsphinx/#comment-183</link>
		<dc:creator>kasra</dc:creator>
		<pubDate>Mon, 10 Mar 2008 13:00:11 +0000</pubDate>
		<guid isPermaLink="false">http://raphaelnunes.wordpress.com/2007/08/08/benchmark-of-sphinx2-sphinx3-pocketsphinx/#comment-183</guid>
		<description>I have an acousic model(triphone) and I use it in s3.6 (.cont.) and it works well,
what should I do to use it in pocketsphinx0.4?
It has the error : #codebooks (650) != 1 in libpocketsphinx\s2_semi_mgau.c&quot;, line 1150.
Thanks</description>
		<content:encoded><![CDATA[<p>I have an acousic model(triphone) and I use it in s3.6 (.cont.) and it works well,<br />
what should I do to use it in pocketsphinx0.4?<br />
It has the error : #codebooks (650) != 1 in libpocketsphinx\s2_semi_mgau.c&#8221;, line 1150.<br />
Thanks</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: chief</title>
		<link>http://raphaelnunes.wordpress.com/2007/08/08/benchmark-of-sphinx2-sphinx3-pocketsphinx/#comment-172</link>
		<dc:creator>chief</dc:creator>
		<pubDate>Mon, 24 Dec 2007 01:06:19 +0000</pubDate>
		<guid isPermaLink="false">http://raphaelnunes.wordpress.com/2007/08/08/benchmark-of-sphinx2-sphinx3-pocketsphinx/#comment-172</guid>
		<description>Hi

Great job on gnome-voice-conrol!  I&#039;ve used versions 0.2 and 0.3, and I thought I&#039;d share my experiences in the hope that it might further development a little and help anyone who&#039;s encountered the same (minor) problems that I had to get it up and running.

Regarding installation:

the 0.2 .deb package on the ubuntu repos installed without a problem.  It took me a minute or two to realise it was a panel applet but apart from that it was all good (changing the package description to specify &quot;panel&quot; might be a useful addition though?).

with compiling 0.3 from source, I had a little trouble finding some of the listed dependencies and found some of the instructions vague, but I think that was due to my level of experience as opposed to your instructions! I got it installed without too much trouble anyway so I&#039;m obviously learning.

Regarding initial usage:

0.2; Again, because I installed the software from the repos, I had no instructions or any idea how to use it.  A quick google revealed the youtube screencast and that let me know what commands were supposed to work.  I wasted some time repeating &quot;run browser&quot; in different tones/speeds of voice before doing some more searching and discovering the program opens epiphany by default which I dont have installed (perhaps you could implement a check and generate an error notification if expected software is not available?).  Anyway, after I got firefox opening I set about trying to get the other commands recognised...

I had most success using the following approach:

1. Speaking clearly, and *slightly* exagerating my pronuciation, eg with the word &quot;next&quot; emphasise like this - nnex-T.  Or with &quot;file&quot; emphasise the &quot;f&quot; and &quot;l&quot; sounds like this - ffy-le.
2. Not pausing between words if both words are part of the same command.
3. Speaking at my normal speed/slightly faster, avoiding slowing down or &quot;dumbing down&quot; my voice.
4. Trying not to sound frustrated when repeating commands - finding a neutral tone that works and repeating it consistently gives the best results (easier said than done sometimes).

Comparing 0.2 and 0.3:

While I appreciated the additional commands added to 0.3 I found it to be much more &quot;paranoid&quot; than 0.2.  What I mean by this is, 0.2 generally just sat dormant and only reacted when it recognised a command.  It mostly ignored background noise and normal speech and if it didnt understand me it ignored me.  However, 0.3 is reacting much more to background sounds and best-guessing a command, any command.  Just typing or coughing or moving your chair usually issues a command of some sort.  Making random, silly noises will issue valid commands with 0.3 as well.

Regarding command list:

You might consider the following additional commands ifthey haven&#039;t already been implemented;

shutdown computer
browse files
run messenger
run music player

That&#039;s about all I can think of right now, except to say i&#039;ll be keeping a close eye on this project and promoting it where I can.  Keep up the good work!</description>
		<content:encoded><![CDATA[<p>Hi</p>
<p>Great job on gnome-voice-conrol!  I&#8217;ve used versions 0.2 and 0.3, and I thought I&#8217;d share my experiences in the hope that it might further development a little and help anyone who&#8217;s encountered the same (minor) problems that I had to get it up and running.</p>
<p>Regarding installation:</p>
<p>the 0.2 .deb package on the ubuntu repos installed without a problem.  It took me a minute or two to realise it was a panel applet but apart from that it was all good (changing the package description to specify &#8220;panel&#8221; might be a useful addition though?).</p>
<p>with compiling 0.3 from source, I had a little trouble finding some of the listed dependencies and found some of the instructions vague, but I think that was due to my level of experience as opposed to your instructions! I got it installed without too much trouble anyway so I&#8217;m obviously learning.</p>
<p>Regarding initial usage:</p>
<p>0.2; Again, because I installed the software from the repos, I had no instructions or any idea how to use it.  A quick google revealed the youtube screencast and that let me know what commands were supposed to work.  I wasted some time repeating &#8220;run browser&#8221; in different tones/speeds of voice before doing some more searching and discovering the program opens epiphany by default which I dont have installed (perhaps you could implement a check and generate an error notification if expected software is not available?).  Anyway, after I got firefox opening I set about trying to get the other commands recognised&#8230;</p>
<p>I had most success using the following approach:</p>
<p>1. Speaking clearly, and *slightly* exagerating my pronuciation, eg with the word &#8220;next&#8221; emphasise like this &#8211; nnex-T.  Or with &#8220;file&#8221; emphasise the &#8220;f&#8221; and &#8220;l&#8221; sounds like this &#8211; ffy-le.<br />
2. Not pausing between words if both words are part of the same command.<br />
3. Speaking at my normal speed/slightly faster, avoiding slowing down or &#8220;dumbing down&#8221; my voice.<br />
4. Trying not to sound frustrated when repeating commands &#8211; finding a neutral tone that works and repeating it consistently gives the best results (easier said than done sometimes).</p>
<p>Comparing 0.2 and 0.3:</p>
<p>While I appreciated the additional commands added to 0.3 I found it to be much more &#8220;paranoid&#8221; than 0.2.  What I mean by this is, 0.2 generally just sat dormant and only reacted when it recognised a command.  It mostly ignored background noise and normal speech and if it didnt understand me it ignored me.  However, 0.3 is reacting much more to background sounds and best-guessing a command, any command.  Just typing or coughing or moving your chair usually issues a command of some sort.  Making random, silly noises will issue valid commands with 0.3 as well.</p>
<p>Regarding command list:</p>
<p>You might consider the following additional commands ifthey haven&#8217;t already been implemented;</p>
<p>shutdown computer<br />
browse files<br />
run messenger<br />
run music player</p>
<p>That&#8217;s about all I can think of right now, except to say i&#8217;ll be keeping a close eye on this project and promoting it where I can.  Keep up the good work!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Lexen</title>
		<link>http://raphaelnunes.wordpress.com/2007/08/08/benchmark-of-sphinx2-sphinx3-pocketsphinx/#comment-133</link>
		<dc:creator>Lexen</dc:creator>
		<pubDate>Tue, 06 Nov 2007 00:13:17 +0000</pubDate>
		<guid isPermaLink="false">http://raphaelnunes.wordpress.com/2007/08/08/benchmark-of-sphinx2-sphinx3-pocketsphinx/#comment-133</guid>
		<description>I am using gnome-voice-control 0.2 as we speak and some thoughts come to mind.  Does this program limit it&#039;s options?  What I mean is, if I say &quot;run&quot; does it have a list of installed programs and judge what I say based on that list, or on the complete dictionary?  I think that if it had the list of programs it would be a lot more reliable.  It might also help if it knew the name of the program more then the command.  Example:  &quot;open office writer&quot; should run &quot;ooffice -writer %U.&quot;

     Have you considered sphinx4?  I know that it is said to be slow and speed is key, but inaccuracy is what makes people give up, so it would make sense to use the latest software.

     Is there a list of possible commands that the user can see to help them use the program to the fullest?  I have the &quot;close window&quot; command working almost without flaw, but I don&#039;t know what other options I have.  I also have no idea how to type.  I have been trying &quot;type hello&quot; and nothing happens.  Maybe I am doing the wrong thing.

     Despite the criticisms, this is a great program and is the only program of it&#039;s kind that I have been able to install.  It is really easy to use and that is very important to me and new computer users.

Thanks,
Lexen</description>
		<content:encoded><![CDATA[<p>I am using gnome-voice-control 0.2 as we speak and some thoughts come to mind.  Does this program limit it&#8217;s options?  What I mean is, if I say &#8220;run&#8221; does it have a list of installed programs and judge what I say based on that list, or on the complete dictionary?  I think that if it had the list of programs it would be a lot more reliable.  It might also help if it knew the name of the program more then the command.  Example:  &#8220;open office writer&#8221; should run &#8220;ooffice -writer %U.&#8221;</p>
<p>     Have you considered sphinx4?  I know that it is said to be slow and speed is key, but inaccuracy is what makes people give up, so it would make sense to use the latest software.</p>
<p>     Is there a list of possible commands that the user can see to help them use the program to the fullest?  I have the &#8220;close window&#8221; command working almost without flaw, but I don&#8217;t know what other options I have.  I also have no idea how to type.  I have been trying &#8220;type hello&#8221; and nothing happens.  Maybe I am doing the wrong thing.</p>
<p>     Despite the criticisms, this is a great program and is the only program of it&#8217;s kind that I have been able to install.  It is really easy to use and that is very important to me and new computer users.</p>
<p>Thanks,<br />
Lexen</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Huggins-Daines</title>
		<link>http://raphaelnunes.wordpress.com/2007/08/08/benchmark-of-sphinx2-sphinx3-pocketsphinx/#comment-79</link>
		<dc:creator>David Huggins-Daines</dc:creator>
		<pubDate>Wed, 22 Aug 2007 19:49:01 +0000</pubDate>
		<guid isPermaLink="false">http://raphaelnunes.wordpress.com/2007/08/08/benchmark-of-sphinx2-sphinx3-pocketsphinx/#comment-79</guid>
		<description>Hey,

It looks like PocketSphinx may actually have failed to run completely, which isn&#039;t surprising since the decoding scripts are sometimes broken.

You should definitely see about 20% less runtime and memory consumption, but not something as dramatic as above :-)</description>
		<content:encoded><![CDATA[<p>Hey,</p>
<p>It looks like PocketSphinx may actually have failed to run completely, which isn&#8217;t surprising since the decoding scripts are sometimes broken.</p>
<p>You should definitely see about 20% less runtime and memory consumption, but not something as dramatic as above <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: bastianazzo</title>
		<link>http://raphaelnunes.wordpress.com/2007/08/08/benchmark-of-sphinx2-sphinx3-pocketsphinx/#comment-77</link>
		<dc:creator>bastianazzo</dc:creator>
		<pubDate>Thu, 09 Aug 2007 05:13:40 +0000</pubDate>
		<guid isPermaLink="false">http://raphaelnunes.wordpress.com/2007/08/08/benchmark-of-sphinx2-sphinx3-pocketsphinx/#comment-77</guid>
		<description>I think you have a very good point here, and I&#039;m glad to see this comparison.
PocketSphinx seems to be actually much faster, other than being lighter.

Honestly, do you think these results are acceptable? I mean, did you expect such close results (2% difference is nothing) with such different time and memory consumption?

Great job, by the way!!</description>
		<content:encoded><![CDATA[<p>I think you have a very good point here, and I&#8217;m glad to see this comparison.<br />
PocketSphinx seems to be actually much faster, other than being lighter.</p>
<p>Honestly, do you think these results are acceptable? I mean, did you expect such close results (2% difference is nothing) with such different time and memory consumption?</p>
<p>Great job, by the way!!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Maxo</title>
		<link>http://raphaelnunes.wordpress.com/2007/08/08/benchmark-of-sphinx2-sphinx3-pocketsphinx/#comment-76</link>
		<dc:creator>Maxo</dc:creator>
		<pubDate>Wed, 08 Aug 2007 20:44:54 +0000</pubDate>
		<guid isPermaLink="false">http://raphaelnunes.wordpress.com/2007/08/08/benchmark-of-sphinx2-sphinx3-pocketsphinx/#comment-76</guid>
		<description>Sweet.  I can&#039;t wait for the next release.  I had a lot of fun play with 0.2.
I would say the biggest things I would like to see is the ability to make certain voice commands run whatever you want, such as setting the browser command to firefox, or whatever you want it to do.
But I know these things will probably have to wait until the more important kinks are being worked out.</description>
		<content:encoded><![CDATA[<p>Sweet.  I can&#8217;t wait for the next release.  I had a lot of fun play with 0.2.<br />
I would say the biggest things I would like to see is the ability to make certain voice commands run whatever you want, such as setting the browser command to firefox, or whatever you want it to do.<br />
But I know these things will probably have to wait until the more important kinks are being worked out.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
