Linked by Thom Holwerda on Fri 7th Oct 2011 20:48 UTC
PDAs, Cellphones, Wireless I don't think I've ever seen this before, but please correct me if I'm wrong. Samsung anf Google were supposed to unveil the Samsung Nexus Prime with Android Ice Cream Sandwich next week, but in a surprise announcement, the companies said that the press event is cancelled - out of respect for Steve Jobs. In the meantime, leaked specifications reveal that the Nexus Prime could be a real doozy.
Permalink for comment 492436
To read all comments associated with this story, please click here.
RE[7]: press release interpreted
by Neolander on Mon 10th Oct 2011 17:40 UTC in reply to "RE[6]: press release interpreted"
Neolander
Member since:
2010-03-08

I could go on and on about detecting phonemes in a noisy environments, people who "eat" phonemes when they speak too quickly, neologisms, context sensitivity and the languages that are heavily based on that such as Japanese, and so on, but I hope that at this stage you see my point.

Many people, in which I believe you are included, think that voice recognition is simple. This feeling comes from the fact that we do it everyday, in a relatively painless fashion, only asking people to repeat what they just said infrequently. The truth is, it is not, and there is a reason why children take so much time to get a rich vocabulary.

Voice recognition is a fantastically complex problem, whose complexity probably borders that of translating one language to another. It is not only a problem of processing power, but also of gathering the required knowledge in a way that is accessible to a computer program. AI gathers knowledge from where it is most useful, the user, and makes use of it to improve the recognition quality, so it obviously a vital part and has been there for ages. In academia, I am ready to bet that voice recognition is mostly studied in AI labs, in the same kind of team that works on automated translation.

Saying that "Siri is different from voice recognition because it is an AI" is thus deeply, totally wrong. Voice recognition IS AI. Slapping stuff after it which processes the extracted text, like a WolframAlpha backend that can find answers to an oral question, is certainly a nice touch, could qualify as an interesting integration effort, but is by no means the revolution you want to make it be.

Edited 2011-10-10 17:42 UTC

Reply Parent Score: 1