Home / Resources / Blog / Tacotron 2 to Create Ripples in AI-based Text-to-Speech Translation

Tacotron 2 to Create Ripples in AI-based Text-to-Speech Translation

Google has launched an AI-powered speech synthesis system named Tacotron 2, poised to set a major breakthrough with its human-like articulation ability. Reports from tech analysts state that the new text-to-speech system delivers an AI-generated computer speech, which cannot be easily distinguished from human voice. Google’s AI researchers quote that their model has achieved a MOS (Mean Opinion Score) of 4.53, in comparison to a MOS of 4.58 for professionally recorded speech. The tech giant’s vision shift from “mobile-first” to “AI-first”, announced during the Google I/O 2017 developers conference by Sundar Pichai, is bearing more fruits. Several AI products were launched last year, including Google Lens, Smart Reply for Gmail and Google Assistant for iPhone. Tacotron 2 is the latest addition to this list.

How it Works? The system first creates a spectrogram of the text, which contains a visual representation of how the speech should sound. This image is then fed into Google’s WaveNet algorithm, which brings AI skills closer, in order to mimic human speech. The algorithm has the ability to easily learn different voices and can even generate artificial breaths.

Looking at the capabilities, Tacotron 2 can detect the context and differentiate between two identically-spelled words. For example, it can distinguish between the noun “desert” and the verb “desert” and alter the pronunciation accordingly. Context-driven pronunciation is the highlight of Tacotron 2. The system can understand the sentence type (such as a statement or a question) and adjust the pitch and modulation of the sentence while speaking.

With Tacotron 2, Google is taking one more step towards realizing its “AI-first” dream. In the coming days, we can expect more brilliant AI products from the tech master.

Zerone develops bespoke software solutions carefully customized for the needs of our clients. Contact an expert today

Want to discuss your project?
We can help!

Name*
Business Email*
Phone*

Related blogs

Revitalizing Legacy Software: Embracing The Ai Revolution Beyond New Applications

#Artificialintelligence

Re-imagining Outsourcing In The Post-pandemic Era

#Artificialintelligence

Covid-19 Effect: An Ai Tool To Ensure Social Distancing In Office Premises

#Artificialintelligence

Want to discuss your project? We can help!

Enter your details to download the blog

Name*
Business Email*
Phone*

Never Miss a Beat

Join our LinkedIn community for the latest industry trends, expert insights, job opportunities, and more!

Tacotron 2 to Create Ripples in AI-based Text-to-Speech Translation

Revitalizing Legacy Software: Embracing The Ai Revolution Beyond New Applications

Re-imagining Outsourcing In The Post-pandemic Era

Covid-19 Effect: An Ai Tool To Ensure Social Distancing In Office Premises

We’re glad you’re here. Tell us a little about your requirement.

Thank you!