Linked by Eugenia Loli on Sun 18th Mar 2007 22:56 UTC
Windows It's Sunday afternoon here, there is no better time to spend by having some fun with some of the [in]abilities of operating systems. There is definitely quite some rocky road still for speech recognition and in extend natural language & artificial intelligence. Hopefully we will still be alive when that happens.
Order by: Score:
Thank you
by mcduck on Sun 18th Mar 2007 23:48 UTC
mcduck
Member since:
2005-11-23

Why say thank you back when it finally works? ;)

I was actually impressed he had the guts to sit there for 10 minutes arguing with the computer without leaving or beating the crap out of it.

Edited 2007-03-18 23:49

Reply Score: 2

That's really impressive.
by Anon on Sun 18th Mar 2007 23:49 UTC
Anon
Member since:
2006-01-02

While the guy was probably trying to take the micky out of the Voice Recognition. I think it handled really well.

He purposely made stupid noises, so of course the Voice Recognition is going to pick them up. Towards the end his only THEN decided to 'turn microphone off' etc...

I'm quite impressed to be honest, nothing as good and as integrated as that exists in the Linux world. That's one thing in Vista that is certainly impressive.

Reply Score: 5

RE: That's really impressive.
by mcduck on Sun 18th Mar 2007 23:53 UTC in reply to "That's really impressive."
mcduck Member since:
2005-11-23

I think it was really impressive, compared to speech recognition in the past. With a little practice, it should do well for letters and memos.

As for writing code, good luck ;>

Reply Score: 4

RE[2]: That's really impressive.
by Almafeta on Mon 19th Mar 2007 00:03 UTC in reply to "RE: That's really impressive."
Almafeta Member since:
2007-02-22

As for writing code, good luck ;>

I am reminded how Hypercard scripting looked an awful lot like English...

Reply Score: 2

Funny
by Square on Sun 18th Mar 2007 23:52 UTC
Square
Member since:
2005-10-01

It's a funny video. I'm not sure what's more funny the video or the comments that think that any voice to text program should be able to write code, or that its somehow proof that vista sucks couse it don't understand someone trying to sound out code

Reply Score: 2

That's funny!
by Almafeta on Sun 18th Mar 2007 23:59 UTC
Almafeta
Member since:
2007-02-22

That'll teach some of us to mutter under our breaths at our computer... it's listening now! :>

But then again, this wasn't a total failure -- even when he made a mistake, he was able to use the VUI to fix his errors.

Who knows? Maybe future OSs will listen to how often their users curse and ramble and dumb down -- er, that is, selectively limit the interface appropriately. (Clippit pops up: "I heard you talking about 'finding' something. Are you trying to find a file?")

Edited 2007-03-19 00:07

Reply Score: 1

Babel
by Punktyras on Mon 19th Mar 2007 00:10 UTC
Punktyras
Member since:
2006-01-07

Is this beginning of the end for thousands of languages that happens to be not English? Are we going to build another Babel tower? This time with windows i.e. Windows?

Reply Score: 1

RE: Babel
by Almafeta on Mon 19th Mar 2007 00:16 UTC in reply to "Babel"
Almafeta Member since:
2007-02-22

Is this beginning of the end for thousands of languages that happen to be not English?

As most languages out there are phonetic, Vista's speech recognition will be fairly easy to localize. In fact, this could be a boon for non-standard languages, because even if documentation can't be produced quickly or cheaply in various minor languages, it's possible that an oral interface for those languages would work much more naturally.

Reply Score: 1

RE: Babel
by n4cer on Mon 19th Mar 2007 17:28 UTC in reply to "Babel"
n4cer Member since:
2005-07-06

Is this beginning of the end for thousands of languages that happens to be not English? Are we going to build another Babel tower? This time with windows i.e. Windows?

Vista's speech recognition supports English (U.S.), English (U.K.), German (Germany), French (France), Spanish (Spain), Japanese, Chinese (Traditional), and Chinese (Simplified).

Reply Score: 2

will
Member since:
2005-06-29

http://www.youtube.com/watch?v=S9T3RXFDtfo&NR

good stuff. Just over 1:30 to do it. Not the exact same code, but still good.

Edited 2007-03-19 00:51

Reply Score: 2

PlatformAgnostic Member since:
2006-01-02

Dragon's been doing the speech recognition game for longer, so I'm not surprised that they have pretty refined voice commands. Also the guy seemed to have a lot more experience with Dragon than the first guy did.

Anyway, the lesson is that neither speech recognizer is really meant for writing code. I tried writing code with my TabletPC stylus once. I had to quit after taking a minute to write a single line.

Reply Score: 3

obligatory
by dillee1 on Mon 19th Mar 2007 07:12 UTC
dillee1
Member since:
2005-08-10

Dear aunt, let’s set so double the killer delete select all

Reply Score: 2

Get confortable with it first
by racs on Mon 19th Mar 2007 07:59 UTC
racs
Member since:
2006-05-14

Just like many things in the world of computers: you must learn to use it first. Have anybody born with the capabilities of typing?

Reply Score: 1

Is it true?
by fretinator on Mon 19th Mar 2007 15:50 UTC
fretinator
Member since:
2005-07-06

That you just say "Hasta La Vista" to shutdown Windows?

Reply Score: 2

Original post is mistitled
by helio9000 on Mon 19th Mar 2007 16:13 UTC
helio9000
Member since:
2006-05-24

Really? This one stupid video is "A Look at Vista's Speech Recognition"? Man what an egregiously lame post. (For starters this was dugg to death a *month* ago - "osnews, you read it here...er, 100th!"). I like osnews because often you will have an anchor link but then will point to a couple takes on a subject and that's what a real "look at Vista's speech recoginition" would do. Show a couple different takes rather than just this one guy who seems vaguely disappointed when it does work. We have no indication that this is anything but the first time he's ever put on a headset. The "response video" doesn't really make much sense because you have to train both and it is clear the dragon dude is very familiar with it and the moron in the first video is just messing around.

Also, it is hardly suprising that a consumer os's speech engine wouldn't be too well suited to the specialized syntax of script and so people much smarter than either of these two are building an engine optimized for code:

http://voicecode.iit.nrc.ca/VoiceCode/public/ywiki.cgi


IMHO Vista's speech recognition is pretty good. As good as Dragon? Probably not but certainly the best that has ever shipped as part of an OS thus far. The grid and numbering system is a tiny boon to handicapped people and great to have it built in. No less than Apple fanboi deluxe David Pogue agrees.

Pogue
http://tinyurl.com/2gpwja

btw- Leopard is also going to also tranlate to Braille devices which is awesome.

Spending all of 2 hours with it Eugenia has no credibilty on Vista in the first place (the same way someone spending a couple hours with linux and declaring it useless isn't to be taken seriously) but why not at least post a video of someone getting different results and have the post be an actual look at the app's charms and pitfalls instead of a silly gag video?

Another goofball trying to program with it

http://www.nxtgenug.net/article.aspx?ArticleID=150

Navigation (tiny bit of dictation) demo in French

http://tinyurl.com/25obht

Reply Score: 1

RE: Original post is mistitled
by n4cer on Mon 19th Mar 2007 17:43 UTC in reply to "Original post is mistitled"
n4cer Member since:
2005-07-06

One of the better videos online demoing Vista's speech recognition (though the video's audio isn't the best) is a demo given at SpeechTek.

http://video.google.com/videoplay?docid=5549626370637807497&q=vista...

Reply Score: 2

RE: Original post is mistitled
by Doc Pain on Mon 19th Mar 2007 18:21 UTC in reply to "Original post is mistitled"
Doc Pain Member since:
2006-10-08

"btw- Leopard is also going to also tranlate to Braille devices which is awesome. "

That's fine, but as long as it is usual to put barriers into the WWW, blind people mostly keep outside of it. More and more "Flash" stuff, no proper image references (img alt, longdesc), frames without proper encalsulation and alternative...

I did some studies in rehabilitational pedagogics, we found the poor quality of content presentation increasing. "Web designers" just don't care.

I'm looking forward ad MacOS X how it will serve the needs of blind people.

(This is off topic, so don't really mind. Sorry, cound not resist.)

Reply Score: 2

RE: Original post is mistitled
by Headrush on Tue 20th Mar 2007 15:13 UTC in reply to "Original post is mistitled"
Headrush Member since:
2006-01-03

I agree that YouTube video was "made" to look as poorly useable as possible, but you say its the best of any OS, have you even tried Speech Recognition in OS X?

Without any training I have found it to be quite robust and with applescript it is a snap to make custom speech commands for programs that don't even support speech.

You are right in that OS level speech commands is a totally different beast than dictation. People need to distinguish between products like Natural Speaking and iListen and the OS feature.

BTW, did I mention this post wasn't typed. Speech worked fine for me. IMing without typing is a god send also.

Reply Score: 1

Should be learning from his mistakes
by aarnott on Mon 19th Mar 2007 17:00 UTC
aarnott
Member since:
2005-08-09

The earlier commments says most of it. But this guy seems to be intentionally making it look worse than necessary by using the same voice non-commands repeatedly: "press caps lock" should be merely "caps lock". Worked the first time for me.
"Doctor, doctor, it hurts when I do this."

Reply Score: 3

Vista == Dragon
by AndrewZ on Mon 19th Mar 2007 21:10 UTC
AndrewZ
Member since:
2005-11-15

Microsoft licensed the original speech recognition code from Lernout & Hauspie a few years ago. Vista is using the same basic code as found in Dragon Naturally Speaking.

Reply Score: 1

RE: Vista == Dragon
by n4cer on Mon 19th Mar 2007 23:57 UTC in reply to "Vista == Dragon"
n4cer Member since:
2005-07-06

Actually, the TTS and Recognition engines in Vista (and in other products like MS Speech Server) are based off of the work of MSR and MSR Asia.

http://research.microsoft.com/srg/
http://research.microsoft.com/speech/

Reply Score: 2

Serious question...
by rcsteiner on Wed 21st Mar 2007 15:47 UTC
rcsteiner
Member since:
2005-07-12

IBM bundled both "voice dictation" and "voice navigation" with its OS/2 Warp 4 product back in 1996. Some Warp 4 packages even included a headset microphone.

It's been ten years. How much better is the voice stuff package with Vista? Does it require less training time than the Warp stuff from IBM did?

It would be interesting (at least to me) to see the level of progress that has been made, if any.

Reply Score: 2