Microsoft demonstrates “breakthrough” speech translation technology

November 9, 2012

Microsoft has posted a remarkable video demonstrating speech-to-speech language translation where spoken English is translated directly into Chinese and read aloud by machine in near real-time.

Microsoft’s Chief Research Officer Rick Rashid demonstrated the “breakthrough” speech translation technology on stage last month at Microsoft Research Asia’s 21t Century Computing event in Tianjin, China. Rashid also shares some background information on the history of automated speech translation in this guest post.

Rashid describes the advances made in speech technology to date, from simple waveform pattern matching first developed almost 60 years ago, to a technique known as hidden Markov developed in the late 1970’s.

The technology demonstrated by Rashid uses a technique called Deep Neural Networks, developed by Microsoft Research and the University of Toronto two years ago. Deep Neural Networks is a technique “patterned after human brain behaviour” and provides the ability to “train more discriminative and better speech recognisers” than older methods.

“We have been able to reduce the word error rate for speech by over 30% compared to previous methods. This means that rather than having one word in 4 or 5 incorrect, now the error rate is one word in 7 or 8. While still far from perfect, this is the most dramatic change in accuracy since the introduction of hidden Markov modeling in 1979, and as we add more data to the training we believe that we will get even better results.”

What’s even more interesting is that the Chinese language translation demonstrated mimics Rashid’s own voice, made possible by recording an hour’s worth of English speech data from Rashid prior to the presentation.

As Rashid quips, Star Trek’s universal translator is a reality now more so than ever.

Albizu Garcia

Albizu Garcia is the Co-Founder and CEO of Gain -- a marketing technology company that automates the social media and content publishing workflow for agencies and social media managers, their clients and anyone working in teams.
VIEW ALL POSTS

< Next Post

Carve Cases secures further investment for growth as organic as its product

Previous Post >

So, Google’s Play Store confused its own Nexus 10 with Apple’s iPad

Technology

This AI Copilot Doesn’t Wait for Prompts — It Thinks Like a Hacker

Hey HackerNoon, it’s Kuwguap again. A while back, I wrote about building RAWPA, my AI copilot...

June 27, 2025 HackerNoon

Government and Policy Technology

People who grow up around robots may develop feelings for them, lose ability to socialize: WEF ‘Summer Davos’

Will we merge ourselves so intimately with technology that it becomes so much a part of us that we...

June 27, 2025 Tim Hinchliffe

Business Technology

Check out the cool new pet-tech at Leap Venture Studio’s 9th Cohort Demo Day

Pet lovers are increasingly turning into tech lovers as well as the pet care world gets...

May 16, 2025 Sociable Team

Sociable's Podcast

Brains Byte Back

Brains Byte Back interviews startups, entrepreneurs, and industry leaders that tap into how our brains work. We explore how knowledge & technology intersect to build a better, more sustainable future for humanity. If you’re interested in ideas that push the needle, and future-proofing yourself for the new information age, join us every Friday. Brains Byte Back guests include founders, CEOs, and other influential individuals making a big difference in society, with past guest speakers such as New York Times journalists, MIT Professors, and C-suite executives of Fortune 500 companies.

It’s predicted that AI could replace half of all entry-level white-collar jobs in the next five years, especially in the U.S., where fewer regulations and bigger investments are speeding things up. Routine tasks like document review and data entry could is already be being picked up by AI, so what does that mean for the future of entry-level work? Redefine it or eliminate it?

Leslie Thomas, Chief Psychometric Officer at Kryterion, breaks down what this means for your career and how certification is evolving to keep up, including avoiding cheating. She explains how her team works with companies to define what people actually need to know in an AI-powered workplace. She offers a valuable method in terms of defining where your job will fall in line in the world of AI.

You'll learn how Kryterion is using AI to build smarter assessments, why soft skills like creativity and adaptability matter more than ever, and how to figure out which parts of your job are at risk.

If you're asking what to learn next or how to stay relevant, this episode gives you a great place to start.Find out more about Leslie Thomas here.

Access her ebook here.

Learn more about Kryterion here.

Reach out to today's host, Erick Espinosa – [email protected]

Get the latest on tech news – https://sociable.co/

Leave an iTunes review – https://rb.gy/ampk26