<?xml version="1.0" encoding="utf-8" ?>
<rss version="2.0" xmlns:osnews="http://osnews.com/rss2#">
	<channel>
		<title>OSNews: </title>
		<link>http://www.osnews.com/story/15258/Latest_Advancements_in_Speech_Recognition</link>
		<description>Exploring the Future of Computing</description>
		<language>en-us</language>
		<copyright>Copyright 2001-2009, David Adams</copyright>
		<webMaster>adam+nospam@osnews.com</webMaster>
		<lastBuildDate>Tue, 10 Nov 2009 06:56:35 GMT</lastBuildDate>
		<image>
			<url>http://www.osnews.com/images/osnews.gif</url>
			<title>OSNews.com</title>
			<link>http://www.osnews.com</link>
		</image>
		<item>
			<title>Not the first w/out training</title>
			<link>http://osnews.com/thread?145292</link>
			<guid isPermaLink="true">http://osnews.com/thread?145292</guid>
			<description>I don't think it is the &quot;first speech recognition software capable of handling continuous speech without the user having to train it in advance.&quot;  Sphinx (<a href="http://cmusphinx.sourceforge.net/html/cmusphinx.php" rel="nofollow">http://cmusphinx.sourceforge.net/html/cmusphinx.php</a>)  doesn't really use training (although you can) and has been around for a long time (decade?).  Perhaps they meant the first commercial or consumer software package, but it is definitely not the first software.<br />
<br />
As my friend who uses Sphinx on a robot noted, &quot;you can get near 100% accuracy with Sphinx4 if you use a JSGF finite state grammar and are in quiet conditions and/or have a good mic close to your mouth.&quot;  That said, Sphinx is usually limited to a small vocabulary (&lt; 100 words), so this new software might be better at larger sets of words.</description>
			<pubDate>Sat, 22 Jul 2006 01:08:00 GMT</pubDate>
			<author>donotreply@osnews.com (neocephas)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>Hope it's better than DNS 8</title>
			<link>http://osnews.com/thread?145299</link>
			<guid isPermaLink="true">http://osnews.com/thread?145299</guid>
			<description>Tried DNS 8 a few months ago, and as soon as I would speak, CPU usage would go up to 99% and my computer would grind to a halt, this on a P4 2.8ghz w/512MB of RAM. It was hard to tell how accurate it was, as having to speak one sentence at a time and then having to wait for 30 seconds or so for the results was more trouble than it was worth. <br />
<br />
I'm desperate looking for a way to get text from a printed book onto a computer. I tried one of those OCR pen scanners (C-pen 800), and I guess I can't scan in a straight line or something, because that thing didn't work for sh*t.<br />
<br />
I'd gladly pay $1,000 or more for a workable solution.Edited 2006-07-22 02:28</description>
			<pubDate>Sat, 22 Jul 2006 02:27:00 GMT</pubDate>
			<author>donotreply@osnews.com (WorknMan)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>RE: Not the first w/out training</title>
			<link>http://osnews.com/thread?145324</link>
			<guid isPermaLink="true">http://osnews.com/thread?145324</guid>
			<description>Sphinx relies on limited domain of discourse to reduce the difficulty of the task.  It's not just that it's a small vocabulary, but that it has task specific grammar to use.<br />
<br />
If you want to have fun with Sphinx or similar systems, call a voicemail system that uses it and ask it about something relevant but unrelated -- like ask the bus kiosk how the weather is.<br />
<br />
viavoice and dragondictate both do continuous speech recognition now, but require training to get efficiency up to usable.<br />
<br />
i would have to try dragon 9 before i would believe that it has a high recognition rate without training. that's a very difficult problem.<br />
<br />
however, i've had reasonably good luck with both dragon and viavoice, after careful training.<br />
<br />
neither works well for programming, though.</description>
			<pubDate>Sat, 22 Jul 2006 06:38:00 GMT</pubDate>
			<author>donotreply@osnews.com (Cloudy)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>RE: Hope it's better than DNS 8</title>
			<link>http://osnews.com/thread?145326</link>
			<guid isPermaLink="true">http://osnews.com/thread?145326</guid>
			<description>Buy a copy of the book you can affort to destroy. remove it from its binding.  Get a decent OCR program and a flat bed scanner. Train the scanner for the fonts used in the book.<br />
<br />
If you have to do this with multiple books, find a library that has a copy machine designed for copying from bound material, and use it to make copies of the pages you need to scan.<br />
<br />
The trick is a decent flatbed scanner and decent OCR software, and you should be able to get both together for  less than a grand.</description>
			<pubDate>Sat, 22 Jul 2006 06:40:00 GMT</pubDate>
			<author>donotreply@osnews.com (Cloudy)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>DNS and OCR </title>
			<link>http://osnews.com/thread?145329</link>
			<guid isPermaLink="true">http://osnews.com/thread?145329</guid>
			<description>Dragon, the world's first continuous-speech dictation software was sidelined <br />
after it's creators (Jim and Janet Baker) sold it to L &amp; H for stock.<br />
<br />
L &amp; H went &quot;belly up&quot; after it was found they made up $277 million in revenue.<br />
<br />
<br />
<br />
from the wired article<br />
<br />
<a href="http://www.wired.com/wired/archive/11.02/code_pr.html" rel="nofollow">http://www.wired.com/wired/archive/11.02/code_pr.html</a> <br />
<br />
&quot;Left with nothing, Jim and Janet Baker turned to the courts. In a failed <br />
attempt to retrieve Dragon from among the L&amp;H assets that were now locked up <br />
by bankruptcy laws, they hired the powerhouse law firm run by David Boies. ... <br />
The shelves of their home are crowded with figurines, all colors and sizes - <br />
made of glass, wood, plastic, brass - and all shaped like dragons, emblems of <br />
the company they no longer own. Sitting at her dining room table, Janet Baker <br />
is stoic. Her still hands rest on a place mat. It's as if she's at a vast <br />
distance from Dragon Systems.<br />
<br />
But she's not. The Dragon application, with the 300,000-line recognizer at <br />
its heart, lives just a couple of dozen exits north on Route 128. The code's <br />
new owner, ScanSoft, bought it at auction in the luxurious law offices of a <br />
bankruptcy firm...<br />
<br />
Janet Baker has reservations about how her software will fare. 'ScanSoft will <br />
make incremental improvements,' she says politely, 'but they won't apply the <br />
resources we did. The progress in the field has slowed immensely with Dragon <br />
out of the picture.'<br />
<br />
<br />
<br />
As for the OCR problem, try using a flatbed scanner. You won't have the <br />
problem of a shaky hand with no hand involved.</description>
			<pubDate>Sat, 22 Jul 2006 07:06:00 GMT</pubDate>
			<author>donotreply@osnews.com (mikesum32)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>dragon is better, just takes time</title>
			<link>http://osnews.com/thread?145348</link>
			<guid isPermaLink="true">http://osnews.com/thread?145348</guid>
			<description>I suffer from RSI occasionally, so I use Dragon to type things that don't require immediate responses (e.g. long e-mails on the backburner or notes to myself).  If you use Dragon with their own program (DragonPad) and then cut-and-paste the results into whatever editor you'd really like to use (MS Word, or an edit box is Firefox), everything is great.  The point of Dragon isn't to replace your keyboard entirely, but just to make it so we don't use it quite as much, since keyboards cause injuries and are awkward to use, despite most techies managing to have mastered them.<br />
<br />
I threw in for the upgrade to Dragon 9, I hope it gives what's promised -- namely improved accuracy.</description>
			<pubDate>Sat, 22 Jul 2006 14:24:00 GMT</pubDate>
			<author>donotreply@osnews.com (morganth)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>RE[2]: Hope it's better than DNS 8</title>
			<link>http://osnews.com/thread?145349</link>
			<guid isPermaLink="true">http://osnews.com/thread?145349</guid>
			<description>Most places won't let you make copies of copyrighted material.</description>
			<pubDate>Sat, 22 Jul 2006 14:30:00 GMT</pubDate>
			<author>donotreply@osnews.com (CPUGuy)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>I find it weird to talk to your computer</title>
			<link>http://osnews.com/thread?145361</link>
			<guid isPermaLink="true">http://osnews.com/thread?145361</guid>
			<description>Using voice command, I always feel weird sitting in a room by myself talking to my computer.  Dictation is another story, because you are just converting speech to text, but I find the whole concept of voice command unsettling.</description>
			<pubDate>Sat, 22 Jul 2006 16:33:00 GMT</pubDate>
			<author>donotreply@osnews.com (gregk)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>DNS 9</title>
			<link>http://osnews.com/thread?145369</link>
			<guid isPermaLink="true">http://osnews.com/thread?145369</guid>
			<description>David Pogue gave Dragon Naturally Speaking a very favorable review in the NY Times tech section here: <br />
<a href="http://www.nytimes.com/2006/07/20/technology/20pogue.html?ei=5087%0A&amp;en=bbfdc94772adaeb6&amp;ex=1153713600&amp;adxnnl=1&amp;adxnnlx=1153590644-Or/2eqliWiHk/SZOnDWgIg" rel="nofollow">http://www.nytimes.com/2006/07/20/technology/20pogue.html?ei=5087~*...</a> <br />
<br />
I used DNS Preferred extensively several years ago. Pogue says that the initial training session is now optional and recognition is quite good at 99.6%. It improves with training.</description>
			<pubDate>Sat, 22 Jul 2006 17:52:00 GMT</pubDate>
			<author>donotreply@osnews.com (AndrewZ)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>RE[3]: Hope it's better than DNS 8</title>
			<link>http://osnews.com/thread?145379</link>
			<guid isPermaLink="true">http://osnews.com/thread?145379</guid>
			<description>Libraries will let you make copies of limited amounts of copyrighted material, since that's allowed under fair use.</description>
			<pubDate>Sat, 22 Jul 2006 20:03:00 GMT</pubDate>
			<author>donotreply@osnews.com (Cloudy)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>RE: I find it weird to talk to your computer</title>
			<link>http://osnews.com/thread?145450</link>
			<guid isPermaLink="true">http://osnews.com/thread?145450</guid>
			<description>Using voice command, I always feel weird sitting in a room by myself talking to my computer. Dictation is another story, because you are just converting speech to text, but I find the whole concept of voice command unsettling.<br />
<br />
Why? Are you scared your PC will talk back to you? <img src="/images/emo/confuse.gif" alt=";)" /></description>
			<pubDate>Sun, 23 Jul 2006 03:39:00 GMT</pubDate>
			<author>donotreply@osnews.com (1c3d0g)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>RE[2]: I find it weird to talk to your computer</title>
			<link>http://osnews.com/thread?145649</link>
			<guid isPermaLink="true">http://osnews.com/thread?145649</guid>
			<description>Why? Are you scared your PC will talk back to you? <img src="/images/emo/confuse.gif" alt=";)" /> <br />
<br />
<br />
You think clippy is bad? try listening to peedy read some time. <img src="/images/emo/wink.gif" alt=";)" /></description>
			<pubDate>Mon, 24 Jul 2006 05:10:00 GMT</pubDate>
			<author>donotreply@osnews.com (mipeligro)</author>
			<category>Comments</category>
		</item>
	</channel>
</rss>
