Posts Tagged ‘Speech Recognition’

Apple’s Siri to Incorporate Neural Network?

Siri LogoWired’s Robert McMillan recently wrote, “…neural network algorithms are hitting the mainstream, making computers smarter in new and exciting ways. Google has used them to beef up Android’s voice recognition. IBM uses them. And, most remarkably, Microsoft uses neural networks as part of the Star-Trek-like Skype Translate, which translates what you say into another language almost instantly. People “were very skeptical at first,” Hinton says, “but our approach has now taken over.” One big-name company, however, hasn’t made the jump: Apple, whose Siri software is due for an upgrade. Though Apple is famously secretive about its internal operations–and did not provide comment for this article–it seems that the company previously licensed voice recognition technology from Nuance—perhaps the best known speech recognition vendor. But those in the tight-knit community of artificial intelligence researchers believe this is about to change. It’s clear, they say, that Apple has formed its own speech recognition team and that a neural-net-boosted Siri is on the way.”

Read more

MindMeld Makes Context Count In Search

mmapiMindMeld – you may know the term best from StarTrek and those fun-loving Vulcan practices. But it lives too at Expect Labs, as an app that listens to and understands conversations and finds relevant information within them, and as an API that lets developers create apps that leverage contextually-driven search and discovery – and may even find the information users need before they explicitly look for it.

Anticipatory computing is the term Expect Labs uses for that. “This is truly a shift in the way that search occurs,” says director of research Marsal Gavaldà. “Anticipatory computing is the most general term in the sense that we have so much information about what users are doing online that we can create accurate models to predict what a user might need based on long-ranging history of that user profile, but also about the context.”

The more specific set of functionality that contributes to the overarching theme of anticipatory computing, he explains, “means that you can create intelligent assistants that have contextual search capabilities, because our API makes it very easy to provide a very continuous stream of updates about what a user is doing or where a user is.”

Read more

Plumbing The Depths Of Deep Learning

Image Courtesy: Flickr/ Hey Paul Studios

Image Courtesy: Flickr/ Hey Paul Studios

Will deep learning take us where we want to go? It’s one of the questions that Oxford University professor of Computational Linguistics Stephen Pulman will be delving into at this week’s Sentiment Analysis Symposium. There, he’ll be participating in a workshop session today on compositional sentiment analysis and giving a presentation tomorrow on bleeding-edge natural language processing.

“There is a lot of hype about deep learning, but it’s not a magic solution,” says Pulman. “I worry whenever there is hype about some technologies like this that it raises expectations to the point where people are bound to be disappointed.”

That’s not to imply, however, that important progress isn’t taking place when it comes to deep learning, which leverages machine learning methods based on learning representations with applications to everything from NLP to computer vision to speech recognition.

Read more

Nuance’s Voice Is Heard: Its Tech Featured In Samsung Galaxy Gear SmartWatch And Surfi AI Answer Engine

Nuance Communications is high-profile this week. The company has announced that Samsung’s new Galaxy Gear wearable smart watch and Galaxy Note 3 tablet will integrate its voice and language capabilities. Additionally, word comes from SpeechTrans that its new natural language processing application for Windows 8 and Windows RT, which – like Apple’s Siri – leverages Nuance’s speech recognition smarts, has been released.

Read more

Facebook Acquires Speech Recognition Vendor Mobile Technologies

Facebook is acquiring Mobile Technologies, according to Tom Stocky, Facebook director, product management.

In a post here, Stocky announced the deal that would bring to Facebook “a company with an amazing team that’s behind some of the world’s leading speech recognition and machine translation technology….

It has always been our mission to make the world more open and connected. Although more than a billion people around the world already use Facebook every month, we are always looking for ways to help connect the rest of the world as well. Voice technology has become an increasingly important way for people to navigate mobile devices and the web, and this technology will help us evolve our products to match that evolution. We believe this acquisition is an investment in our long-term product roadmap as we continue towards our company’s mission.”

Mobile Technologies is the developer of Jibbigo, what it credits as “the world’s first speech-to-speech translator on a phone that runs online and even off-line, independent from the Internet.” The program, which comes in both iOS and Android versions, works by having a user hold down the record button while saying a phrase, which then appears as text in both languages, and is spoken aloud in the target language, according to Wikipedia.

The company also says it developed and deployed the first automatic, simultaneous interpretation service for lectures for use in educational settings.

A post on the company’s website notes that the Facebook acquisition provides the opportunity to apply its “technology at a truly global scale,” and that it looks forward to “finding new and interesting ways to apply it to Facebook’s long-term product roadmap.”

Read more

New On The Speech Recognition Scene: Droids With NLP Processors And More

There are new Motorola Droid devices in town: The three Verizon Android 4.2 smartphones unveiled at a press event yesterday include the Motorola Droid Mini, Ultra and Maxx. The line includes what the company touts as the longest-lasting 4G LTE smartphone in the Maxx, with the vendor claiming 48 hours on a single charge, and what it says is the thinnest 4G LTE smartphone around in the Ultra. The smartphones reportedly all come with a unique Kevlar fiber 3D unibody design and a few months’ free Google Music All Access subscription, too. But what will catch the eyes of readers of this blog is the proprietary Motorola X8 Mobile Computing System that’s behind the sleek-looking handsets.

In addition to the graphics and application processor cores found within the eight-core System are two new low-power cores, one to power contextual computing and one aimed at natural language processing. Read more

Vestec Secures Equity Investment from Leading Middle Eastern Investor

Waterloo, ON, Canada (PRWEB) June 27, 2013 – Vestec, Inc. a leader in powerful Artificial Intelligence technologies, announced today that they have secured a significant equity investment from Prince Mishaal bin Abdullah bin Turki bin Abdul Aziz Al Saud. Prince Mishaal has also joined Vestec’s Advisory Board and will help guide Vestec’s growth strategy in the Middle East.

“Advances in Artificial Intelligence research are creating a powerful paradigm shift in speech technologies across different platforms and languages,” said Dr. Fakhri Karray, Vestec’s primary Founder and CEO. “We are looking forward to leveraging Prince Mishaal’s deep expertise and strong relationships to facilitate Vestec’s growth in the Middle East.” Read more

iPhone 4s or iPhone 5. Whatever it’s called, does it mark the return of Siri?

A few short years ago, a group of semantic technology companies rode a wave of venture capital and inflated expectation. They were going to change the world. They were going to bring semantic technologies to the mainstream. They were going to make people very rich. They were the must-have keynotes of the conference circuit. And then, one by one, they disappeared. Powerset vanished inside Microsoft, to do something for Bing. Twine vanished inside Evri, amid rumours of a fire sale and investors covering their backs. Freebase vanished inside Google, and bits of Freebase DNA routinely pop up across Google’s sprawling empire. And Siri vanished inside Apple, as we scrambled to understand whether the Cupertino money machine was after semantic smarts or ‘just’ speech recognition technology. Now, though, the rumours suggest that Siri may be back, and that it’s going to be the thing that makes the next iPhone a compelling buy. Read more

RAMP Gives Online Video Publishers a Semantic Closed Caption Platform

Last year President Obama signed the 21st Video Accessibility Act into law, “requiring broadcasters to include closed captions with Web video content.” RAMP, a leading content optimization platform has responded with MediaCloud, a platform that “provides online video publishers with highly accurate, timecoded transcripts that can be easily integrated into popular video content management systems.” Read more

Apple Snags Siri – All Things Digital (blog)


Geek.com
Apple Snags Siri
All Things Digital (blog)
It will do this via a combination of technologies, including speech recognition, natural language processing and semantic Web search. Get that?
What Apple’s Acquisition of Siri Means for the Future of Mobile Search Mashable (blog)
Apple Buys Virtual Personal Assistant Startup Siri TechCrunch (blog)
Apple Buys Siri, A Mobile Assistant App, As War With Google Heats Up Forbes (blog)

all 81 news articles »

NEXT PAGE >>