Google has developed AI voice that sounds indistinguishable from the voice of a real human

9comments

Google has pioneered a brand new text-to-speech system that it calls Tacotron 2 and it works with stunning accuracy, delivering voice narrations that are indistinguishable from the voice of a real human. This is not an exaggeration: Tacotron 2 is the second generation of the technology and it consists of two deep neural networks, one that converts the text into a special spectogram (like the one you see in the picture above), and the second one, the WaveNet, that reads this chart and interprets it into a real voice.

The system is currently only trained to work in English with the one female voice that you can hear below. It can not only read, but it will also be able to tell nuance, and if a certain word is highlighted in all caps, it will add an accent to that word. It is also able to deal with a small amount of typing errors.

Here are a few examples, showing the Tacotron 2 in action:

“That girl did a video about Star Wars lipstick.”



Recommended For You

We find it hard to tell which one is narrated by a real person, and which one is the computer generated voice (it's the second one).

Here are a couple more examples showing the capabilities of the system. Note that all of the below phrases are unseen by Tacotron 2 during training.

Tacotron 2 text-to-speech system in action


Complex words in sentences: "Generative adversarial network or variational auto-encoder."


"Basilar membrane and otolaryngology are not auto-correlations."


Tacotron 2 knows the right pronunciation depending on semantics: "He has read the whole thing."


"He reads books."


"Don't desert me here in the desert!"


"He thought it was time to present the present."


It can deal with typing errors: "Thisss isrealy awhsome."


It changes prosody with punctuation, notice the comma: "This is your personal assistant, Google Home."


"This is your personal assistant Google Home."


It can adapt to stress with intonation: "The buses aren't the problem, they actually provide a solution."


"The buses aren't the PROBLEM, they actually provide a SOLUTION."

And it even handles tongue-twisters"Peter Piper picked a peck of pickled peppers. How many pickled peppers did Peter Piper pick?"


"She sells sea-shells on the sea-shore. The shells she sells are sea-shells I'm sure."


What is most impressive about the Tacotron 2 system is that it is not just some sort of technology that will stay in the lab. Google is already using the WaveNet network to generate the more realistic voice in Google Assistant. Once the Tacotron 2 is polished, it will roll out to systems like the Assistant.

Grab the Pixel 10 at Mint Mobile for $450 off

$349
$799
$450 off (56%)
Mint Mobile now sells the Google Pixel 10 with a massive $450 discount. The promo is available on select color variants with 128GB of storage. You also get a 12-month unlimited data plan for $180 instead of $360.
Buy at Mint Mobile

Pixel 10 Pro: now $475 off at Mint

$524
$999
$475 off (48%)
Grab the pro-grade, compact Pixel 10 Pro at Mint Mobile with a 12-month unlimited plan, and you can save a huge $475. The data plan comes with a discount, too: 50% off, to be exact.
Buy at Mint Mobile

The Pixel 10 Pro XL is $700 off at Mint right now

$499
$1199
$700 off (58%)
The high-end Gemini AI-enhanced Pixel 10 Pro XL is now available with a mind-blowing discount. You can now save $700 on the phone, plus 50% off unlimited 12-month plans.
Buy at Mint Mobile

The Pixel 10 Pro Fold is now $400 off

$1399
$1799
$400 off (22%)
The foldable Pixel 10 Pro Fold is another standout holiday offer. Right now, you can get the device for $400 off at Mint Mobile. On top of that, you save $180 on 12-month unlimited data plans.
Buy at Mint Mobile
Google News Follow
Follow us on Google News
COMMENTS (9)

Latest Discussions

by 30zpark • 3

Recommended For You

FCC OKs Cingular\'s purchase of AT&T Wireless