Does it come with built in malware and does it have DRM so it crashes?
Google widens search net and takes on Siri with iOS app
Google is moving closer to a planned search singularity with the extension of the Knowledge Graph system, a trial to allow personal Gmail search results to be included in generic web searches, and an iOS app that takes voice requests and tries to answer. Knowledge Graph, launched in May for English-speaking users of Google, …
-
-
Wednesday 8th August 2012 23:28 GMT Anonymous Coward
Re: Isn't competing with an Apple product
Of course not, plenty of products that compete with Apple are approved, like Sparrow, Kindle, Spotify... There's other voice search apps on the store already, like Evi or Voice Actions.
The only rule is that apps that look confusingly similar to the bundled apps can be rejected, but from the video I think Google probably scraped that through (knowing Google they probably iterated through less and less similar designs until it finally got accepted)
-
-
Wednesday 8th August 2012 23:04 GMT Anonymous Coward
How things change
I don’t believe that your phone should be an assistant. Your phone is a tool for communicating. You shouldn’t be communicating with the phone; you should be communicating with somebody on the other side of the phone.
– Andy Rubin, SVP of Mobile at Google and founder of Android,October 19, 2011
Rubin->reality translation:
"Shit, we don't have that feature yet"
-
Wednesday 8th August 2012 23:46 GMT Andrew Jones 2
Re: How things change
Google still don't believe your phone should be an assistant - when was the last time you asked an assistant how tall you have to be to ride space mountain, or what time the sun rises in Japan or how long the runtime is for the movie Ted?
The Google Voice search is also not new - Android has voice search forever - the only different is that if there is a specific answer to your question that Google knows about it - it can speak the answer instead of just providing a list of results (which it will still do even if it does know the answer).
Now - on to what I wanted to ask.
Could the register please find out more about the technology behind the spoken response? On all the Google Now videos on YouTube - the response actually sounds like Majel Barrett - who of course is the voice of the Star Trek computer. Now it's obvious that Majel Barrett probably hasn't sat and recorded every known word in the English Language - so could you find out how they managed this feat?
There are rumours that Google acquired some tech last year that enables them to record about 200 words of someone's voice and then some sophisticated tech can uses some special algorithm to generate any word you want in the persons voice - but there isn't much information out there about it.
Even if this tech is sci-fi fantasy - the response does not sound like the artificial voices TTS has always been plagued with - so it would still be interesting to find out more about the tech Google is using to enable a more human sounding voice.
Many thanks.
-
Thursday 9th August 2012 07:17 GMT Sean Timarco Baggaley
Re: How things change
Modern speech synthesis uses a matrix of phonemes at its core. The "Siri" voice in the UK is the same as the "Daniel" voice available for OS X Lion / Mountain Lion users and is based on the voice of Jon Briggs.
Jon went into a studio some years ago and recorded 5000 phrases in a monotone voice. From those phrases, a complete set of phonemes was extracted and these are what the synthesiser plays, adding pitch and volume changes to simulate stress and intonation.
This is why, although Siri sounds a lot better than the older fully synthesised systems of old (think Prof. Stephen Hawking), we can still tell that it's a synthetic voice. When humans modulate their voices, they do so by changing a lot more than mere pitch and volume: they constrict their throat, move their tongue, lips and change mouth-shape, change how they breathe, and so on. Our ears are trained to notice these changes, so we're still in the voice synthesis equivalent of the Uncanny Valley. But we're getting pretty damned close.
Crossing the valley completely is technically feasible, but will require paying the voice artist to record those 5000 phrases multiple times, in multiple intonations and projection levels. That alone will add days to the recording process, and it'll be very, very boring. Once processed, the resulting voice phoneme set will also be much bigger—the UK "Siri" voice is a 500MB-ish download already—and that's the biggest problem. It'd take much, much longer to process the raw sample data and produce the final phoneme matrices, and it'll take up a whopping great chunk of storage space too. Realistically, computers will need to come with either much faster storage, or a lot more RAM, as standard.
-
Thursday 9th August 2012 08:31 GMT Woodgar
Re: How things change
Sounds similar to a vocaloid http://www.vocaloid.com/ (page in Japanese), only more sophisticated.
Vocaloids use the same technique, but for singing rather than speaking, but the principle is the same, in that they use a voice bank of phonemes to construct a full singing voice.
-
Friday 10th August 2012 00:37 GMT CmdrX3
Re: How things change
All very interesting, but it still doesn't stop Siri from not having a fucking clue what I want most of the time. I have a five minute journey to my brothers house. If I try to use Siri to call him and say I'm on my way I'm usually there before El Thicko even manages to figure out who I want to call and on what number. I've tried using it on numerous occasions but it rarely gets things entirely accurate and I give up and do it manually which usually works out quicker. Hopefully IOS 6 will improve it. but I'm not holding my breath.
-
-
-
-
Thursday 9th August 2012 08:43 GMT g e
Ages?
It's been in Android (or installable as Google Voice) for ages, think I had it on my old HTC Magic, so up to 3 years ago there. Apple have had it since the 4s phone iteration, so about a year if memory serves. Apple bought it from Nuance (IIRC), dunno where Google got theirs, I guess it's maybe derived from Festival/Sphinx for ASR/TTS, perhaps? Wouldn't have thought Google would bother writing one from scratch as there's so much FOSS groundwork done out there already, maybe they bought someone too.
-
-
-
Thursday 9th August 2012 13:29 GMT nexsphil
Re: Be careful what you wish for
This is bloody true actually. Imagine a computer that simply answered your kids' questions about Santa with a flat "No", or labelled your favourite religion a "simplistic control scam". Or reminded you that political corruption and conspiracy is very, very far from "haha impossible" and that your generation will be regarded as morons for believing this for the next 1000 years.
Yeah, truth has limited use to humans. We tend to favour comfort.
-