Blog article
See all stories »

Voice Verification versus Voice Recognition

At the start of a new decade there still seems to be confusion between these two terms with many people using them interchangeably - so here’s the distinction set out plainly:

Voice recognition, also known as speech recognition, converts spoken words or phrases to text which is then processed by a computer to recognise what is being said. Unfortunately voice recognition has been around for some time and has a reputation for being unreliable; we have all been caught shouting a name at your phone or screen in frustration! Because the computer is focusing on a certain word or phrase its abilities are limited – nemeses include background noise and any variation in the user’s voice.

In comparison, voice verification is trained to a particular user, where it identifies the individual based on their unique voice biometrics, in a similar way to fingerprint matching and blood DNA. Impressionist Rory Bremner tried and failed to fool it! This technology is particularly valuable in verification of payment transactions to guard against fraud.  Using a combination of voice biometrics with transactional history, trends and patterns it creates a highly secure, unique authorisation environment.  This allows consumers to use their own unique voice to sign and authorise transactions, guaranteeing transactional security and protecting users against personal data compromise reducing the risk of fraud and identity theft.

To sum up: voice verification identifies who is speaking rather than what is being said

With consumers continuing to move to the medium of the internet for e-commerce and banking, and with world becoming increasingly mobile, voice recognition systems are an attractive alternative to entering passwords and details through a keyboard. Since voice verification systems only require a mobile phone and your voice it is the natural solution to the challenges of dealing with mobility, security, identity and of course payment over the next decade.

6479

Comments: (5)

Nick Collin
Nick Collin - Collin Consulting Ltd - London 05 January, 2010, 12:16Be the first to give this comment the thumbs up 0 likes

Yes, it's an important distinction.  I wrote a couple of white papers on this subject about 10 years ago, based on experience with technology developed by Stanford Research Institute (SRI) and commercialised by their spin-off, Nuance - see www.ncollin.demon.co.uk/Future/fin_futures.htm .

At the time, voice recognition was, if anything, more advanced than voice verification, so I'm surprised when you imply it's unreliable.  As far as I'm aware, when used with carefully designed dialogues, voice recognition is now increasingly deployed in a number of appropriate mainstream applications.

I've always thought that the holy grail would be a combination of the two technologies so that you could call up your bank or whatever, give a simple instruction ("this is Nick Collin - what's the balance on my current account?"), and be safely authenticated, all in one streamlined transaction.  Maybe we've reached that stage - any comments?

Nick Ogden
Nick Ogden - RTGS.global - London 15 January, 2010, 15:02Be the first to give this comment the thumbs up 0 likes

Thanks for your feedback Nick. Indeed, what you thought was going to happen 10 years ago with the evolution of the two technologies is happening now, and with additional capabilities.

Voice signatures are not only used for payments in our view, but also as identification and verification tokens. Our Voice Transact network is currently in the process of adding national Trust Centres to enable consumers to manage and control their own identity, security and payment verification with a number of businesses they chose to do business with. As technology evolves and we see the growing shift towards online and mobile, security and identity fraud is a significant issue. As a result, consumers not only require but are now demanding the right to control how, when and for what reason their personal financial data is used.

 

Stephen Wilson
Stephen Wilson - Lockstep Consulting - Sydney 05 February, 2010, 03:42Be the first to give this comment the thumbs up 0 likes

Sorry for the late comment on an oldish thread ...

Of all biometrics in banking, I do like voice the best, especially when the system allows for varying the challenge phrases, to thwart replay attack.

Yet there's a human dimension that strikes me as problematic. Personal electronic banking in the workplace is commonplace, and so it should be: it's better than having employees standing in queues on company time. 

But in open plan offices, how comfortable are people going to be using voice controlled phone banking?  Can eavesdroppers pick up enough cues, maybe from listening to multiple sessions, to mimic a customer?

Even if the systems' specificity can stop mimicry, I would expect that customers will feel exposed using voice biometric within ear-shot. 

I wonder what sort of advice are banks giving their customers about this? Trust us, the system is impregnable?  Or, cup your hand over the handset? ;-)

 

Nick Ogden
Nick Ogden - RTGS.global - London 08 February, 2010, 14:52Be the first to give this comment the thumbs up 0 likes

Hi Stephen, thanks for your comment. To answer your questions, it’s necessary to stress that we are talking about voice verification using biometrics and not voice recognition, so the system is distinguishing who is speaking rather than what is being said.  In terms of mimicry, impressionist Rory Bremner tried and failed to fool it in a three-hour test session that was conducted in 2007. In a similar way to how fingerprint matching and blood DNA works, voice biometrics are trained to a particular user based on their unique voice biometrics so it’s level of security speaks for itself.

In regards to whether people are going to feel comfortable using voice controlled phone banking in the workplace shouldn’t present an issue since they will only be required to say authentication phrases such as ‘voice signature’ or ‘Stephen Wilson’ to verify themselves. They won’t ever be asked ‘expose’ personal information out loud as this will all ready be stored in the trust centre.

 

Stephen Wilson
Stephen Wilson - Lockstep Consulting - Sydney 09 February, 2010, 06:01Be the first to give this comment the thumbs up 0 likes

Thanks Nick.

I was being a bit facetious, and as I said, I do reckon that voice is the most promising biometric in banking and finance. 

Nevertheless, I expect there to be some resistance to using voice biometrics in public places. If speaking a challenge phrase like your name takes over from entering a PIN (and that's the whole idea!) then people may worry that being overheard (or recorded) will compromise their security. If today they protect their PINs, then tomorrow they may expect to have to protect their voice 'signature' somehow.

So what kinds of standard advice do you find it necessary to give to users about voice authentication?  Do you coach people on the need to speak clearly, how to choose a challenge phrase, the effects of a cold, effects of telephone line quality or speaker phones etc.?

Cheers,

Stephen Wilson.

 

Nick Ogden

Nick Ogden

Founder and Director

RTGS.global

Member since

17 Sep 2008

Location

London

Blog posts

47

Comments

59

This post is from a series of posts in the group:

Innovation in Financial Services

A discussion of trends in innovation management within financial institutions, and the key processes, technology and cultural shifts driving innovation.


See all

Now hiring