I'm sorry, Dave. I'm afraid I can do that: Microsoft unveils Custom Neural Voice – synthetic, but human-sounding speech • The Register Forums

This post has been deleted by its author

Friday 5th February 2021 16:38 GMT Anonymous Coward

@AC

Its been done already.

0 0 Reply

This post has been deleted by its author

Thursday 4th February 2021 18:54 GMT Dave 126

The magician (and technology first adopter, friend of Silicon Valley types like Gates and Jobs) Penn Juliette was presented with a magic trick where the contestant had trained an artificial voice on hundreds of hours of Penn Juliette speaking (from TV shows and podcasts). After performing the trick, the contestant told Penn that for erhical reasons he would delete the artificial voice - unless Penn would like a copy for himself.

Penn pointed out that he was the only person on Earth who had zero conceivable use for an artificial voice that sounded like Penn Juliette.

4 0 Reply

Thursday 4th February 2021 19:15 GMT chivo243

computer voices

Love them in movies, but dislike them in real life.

0 0 Reply

Thursday 4th February 2021 21:15 GMT Stuart Halliday

We know.

Of course we all know what it'll be used for....

That age old subject - pornography

Well, what other use has technology got?

1 0 Reply

Thursday 4th February 2021 22:11 GMT Anonymous Coward

synthetic, but human-sounding speech

Well, you wouldn't be able to pass it off as Stephen Hawking, but "human-sounding" is still a bit of a stretch if you ask me.

2 0 Reply

Thursday 4th February 2021 22:21 GMT John Brown (no body)

I see a use case

There is an enormous number of books out there in electronic text format and often quite expensive, if available in audio.

Personally, I use them on long drives, but there are a lot of visually impaired people out there who could benefit from improved "auto voicing" of text material. Some of the existing stuff is pretty good, but can not only get tedious when listening to longer texts such as novels, but can be quite disconcerting when they come across strange words, names or places that don't exist in dictionaries, eg SF and Fantasy.

I did once convert a series of 8 novels for a blind friend, manually finding all the highlighted "I don't know this word" markers and changing them by spelling and phonetics to pronounce them more realistically, adding them to the custom dictionary so at least I only had to do each new word once. It was ok, but a long chore and not ideal for long listening sessions. Things have moved on since then, but this sounds like a at least a small, possibly a large leap forward.

4 0 Reply

Thursday 4th February 2021 23:40 GMT Anonymous Coward

Re: I see a use case

Blind person here. It isn't that great a step forward for quality, at least it doesn't sound like it from the examples. I don't have any reason to believe this is better for pronunciation. Also, you can get used to anything if you use it all the time. My typical screenreading voice is very robotic. I chose it purely for pronunciation accuracy and fast reading speed.

The main reason this doesn't help is that the neural speech synthesis is only available as a cloud service. I can't install it for local speech, which means it's out for most of my reading. It only works if I want to send some text in, get an audio file back, and listen to that later. Given that I already have relatively high-quality speech software which does run locally, and I'm used to poor-quality speech which modern software far exceeds, I'm unlikely to run up a cloud account for this. For the many others with a visual disability, that won't even be an option if they want to because it requires use of the API, which is going to confuse most nontechnical people.

10 0 Reply
1. Friday 5th February 2021 17:05 GMT Anonymous Coward
  
  @Blind AC Re: I see a use case
  
  Uhm errr yes.
  
  Actually you could use it off the cloud.
  
  The issue is training the model and then using it.
  
  If you built out a device with a decent enough GPU or similar (Nvidia has some options)
  
  You could get it to work locally.
  
  The issue would be that you would routinely have to connect to the cloud to update as their models improve over time.
  
  The problem is that its the 'cloud' companies that make money from this. And also use your use of their model to help improve it.
  
  0 0 Reply
  1. Friday 5th February 2021 18:41 GMT doublelayer
    
    Re: @Blind AC I see a use case
    
    I'll be the first to admit I haven't read very much about this, but it doesn't look like that's an option. The pricing pages include the price for creating a model, storage of a model and running said model. I don't see anything about downloading the model, let alone downloading the engine that uses the model. All prices there are about sending text to the cloud, where they are converted to audio using the previously-trained model. If people want to run it locally, they'll need the synthesis software along with their created model. If that's actually available, I haven't found anything about it. I think the original contention about cloud-only may be correct.
    
    0 0 Reply

Friday 5th February 2021 11:00 GMT FelixReg

Language learning

They mention Duolingo as a user of the tech. Interesting.

Around 1990, I got close to nowhere trying to get a computer to speak natively in a foreign language with my own voice as I hear it in my own head.

So, yeah, this is cool!

0 0 Reply

Friday 5th February 2021 12:00 GMT Warm Braw

A natural-sounding voice that conveys friendliness, empathy, and professionalism...

... which will shortly be fronting up all manner of businesses that are hostile, careless and incompetent.

The technology is quite impressive (though did they really give human Zoe such downbeat material to read?), but I fear it would be more honest, and indeed more cost effective, for most "Customer Service" operations simply to have their CEO record the message "You're not getting your money back" directly to their premium rate answerphone.

1 0 Reply

Friday 5th February 2021 12:05 GMT fitzpat

These Microsoft voices can't be any worse than Heathrow Airport's slightly pissed off yet disinterested synthetic female announcer.

"Flight whatever is now boarding at gate 21, you might want to get on your plane at some point, meatbags"

2 0 Reply

Friday 5th February 2021 17:05 GMT Anonymous Coward

Deep Fakes

So basically you can take a montage of various actors and create a face and body that is unique but based on several people.

And you can create a natural sounding human voice which is also 100% artificial.

Now you can replace actors / actresses with completely computer generated scenes or overlay onto some stock footage.

Imagine the next X-men movie where its not a cartoon, but none of it is real.

The next stop after that would be to replace the infinite room of monkeys with an AI to recycle old movie classics.

Being the smart person I am, I'm going to start working on the AI talent agency. Real Humans need not apply.

1 0 Reply

Sunday 7th February 2021 15:10 GMT david1024

ME WANTS NOW!

I want the HAL-9000 voice on my echo.

0 0 Reply

Monday 8th February 2021 11:02 GMT TRT

I'm convinced...

That at least one of the Just Eat adverts has a VoiceOver that's computer generated. It sounds... creepy; wrong somehow.

Of course, if it happens to be a real human, someone who has managed to impersonate a creepy AI trying to imitate a realistic human voice, then hats off to that voice actor!

Topics

Special Features

Vendor Voice

Resources

COMMENTS

@AC

computer voices

We know.

synthetic, but human-sounding speech

I see a use case

Re: I see a use case

@Blind AC Re: I see a use case

Re: @Blind AC I see a use case

Language learning

A natural-sounding voice that conveys friendliness, empathy, and professionalism...

Deep Fakes

ME WANTS NOW!

I'm convinced...

I sometimes wonder...

POST COMMENT House rules

Enter your comment

Add an icon

Other stories you might like

Microsoft foresees a new type of AI PC: A Surface designed with help from machines

AI spam is winning the battle against search engine quality

Google Cloud chief is really psyched about this AI thing

What's up with AI lately? Let's start with soaring costs, public anger, regulations...

Microsoft rolls out safety tools for Azure AI. Hint: More models

Don't rent out that container ship yet: CIOs and biz buyers view AI PCs with some caution

Cloud Software Group and Microsoft pledge another eight years of co-opetition

Law prof predicts generative AI will die at the hands of watchdogs

AI PCs are here but a killer application for biz users? Nope

Psst, hey. It's the NSA. You want some AI security advice?

UK unions publish AI bill to protect workers from 'risks and harms' of tech

Devaluing content created by AI is lazy and ignores history

About Us

Our Websites

Your Privacy