U.S. patent application number 09/853017 was filed with the patent office on 2004-11-04 for sonic/ultrasonic authentication device.
This patent application is currently assigned to Comsense Technologies Ltd.. Invention is credited to Altman, Nathan, Atsmon, Alon, Dvash, Nir, Sege, Alan, Tamir, Asaf.
Application Number | 20040220807 09/853017 |
Document ID | / |
Family ID | 33419034 |
Filed Date | 2004-11-04 |
United States Patent
Application |
20040220807 |
Kind Code |
A9 |
Tamir, Asaf ; et
al. |
November 4, 2004 |
Sonic/ultrasonic authentication device
Abstract
A method for verifying and identifying users, and for verifying
users' identity, by means of an authentication device capable of
transmitting, receiving and recording audio or ultrasonic signals,
and capable of converting the signals into digital data, and
performing digital signal processing. Voice pattern(s) and user(s)
information of one or more authorized user(s) are recorded and
stored on the authentication device. User(s) identification is
verified by inputting to the authentication device a vocal
identification signal from a user, and comparing the voice pattern
of the vocal identification signal with the recorded voice
pattern(s) of the authorized user(s), and if a match is detected
issuing an indication that the user is identified as an authorized
user.
Inventors: |
Tamir, Asaf; (Ramal Gan,
IL) ; Sege, Alan; (Santa Monica, CA) ; Dvash,
Nir; (Petach Tikva, IL) ; Altman, Nathan; (Tel
Aviv, IL) ; Atsmon, Alon; (Yehud, IL) |
Correspondence
Address: |
RICHARD F. JAWORSKI
Cooper & Dunham LLP
1185 Avenue of the Americas
New York
NY
10036
US
|
Assignee: |
Comsense Technologies Ltd.
|
Prior
Publication: |
|
Document Identifier |
Publication Date |
|
US 0169608 A1 |
November 14, 2002 |
|
|
Family ID: |
33419034 |
Appl. No.: |
09/853017 |
Filed: |
May 10, 2001 |
Related U.S. Patent Documents
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
|
|
09853017 |
May 10, 2001 |
|
|
|
09806789 |
Jan 22, 2002 |
|
|
|
09806789 |
Jan 22, 2002 |
|
|
|
PCT/IL99/00525 |
Oct 4, 1999 |
|
|
|
Current U.S.
Class: |
704/246 |
Current CPC
Class: |
B32B 2305/342 20130101;
B32B 2425/00 20130101; G06Q 20/12 20130101; G06Q 20/40145 20130101;
G06K 19/07745 20130101; G07F 7/1008 20130101; G06Q 20/4014
20130101; G06K 19/077 20130101; H01L 2924/0002 20130101; B32B
37/182 20130101; B32B 2309/68 20130101; G06Q 20/04 20130101; G06Q
20/3823 20130101; G06Q 20/341 20130101; H01L 2924/00 20130101; H01L
2924/0002 20130101 |
Class at
Publication: |
704/246 |
International
Class: |
G10L 017/00; G10L
015/00 |
Claims
1. A method for verifying and identifying users, and for verifying
users' identity, by means of an authentication device capable of
transmitting, receiving and recording audio and ultrasonic signals,
and capable of converting said signals into digital data, and
performing digital signal processing, the method comprising: a)
recording on said authentication device voice pattern(s) of one or
more authorized user(s); b) storing on said authentication device
user information providing identifying details of said authorized
user(s); c) inputting to said authentication device a vocal
identification signal from a user; and d) comparing the voice
pattern of said vocal identification signal with the recorded voice
pattern(s) of said authorized user(s), and if a match is detected
issuing an indication that said user is identified as an authorized
user.
2. A method according to claim 1, wherein a predefined pattern of
audio and/or ultrasonic signals is transmitted by said
authentication device, whenever a match of voice patterns is
detected.
3. a method according to claim 1, wherein the authentication device
emits a predefined pattern of light signals from a light-emitting
device, to indicate a match of voice pattern.
4. A method according to claim 1, wherein the authentication device
is a credit card comprising a magnetic strip and/or a smart chip,
the method further comprising the following steps: a) inputting to
said authentication device a vocal identification signal from a
user; and b) comparing the voice pattern of said vocal
identification signal with the recorded voice pattern(s) of said
authorized user(s), and if a match is detected activating said
magnetic strip and/or a smart chip and allowing a magnetic card
reader to read the card information therefrom.
5. A method according to claim 1, wherein the authentication device
is utilized to permit the access of user(s) to a computerized
system, the method consisting of: a) providing a computerized
system comprising: a.1) an audio signal input device capable of
receiving ultrasonic signals; a.2) a sound processing device
suitable to receive inputs from said audio signal input device, and
to receive audio and ultrasonic input signals, and capable of
converting said signals into digital data; a.3) a data base
containing of voice patterns of authorized users; a.4) an
application capable of receiving digital data inputs from said
sound processing device, of activating other applications, and of
transmitting digital data over network links; b) transmitting from
said authentication device an ultrasonic signal comprising the
recorded voice pattern; c) receiving said ultra sonic signal by
said audio signal input device; d) processing said ultrasonic
signal and extracting said recorded voice pattern therefrom; and e)
comparing said recorded voice pattern with the voice patterns
stored in said database of authorized users, and if a match is
detected enabling access to said computerized system.
6. A method according to claim 1, wherein the authentication device
is utilized to permit the access of user(s) to a computerized
system, the method consisting of: a) inputting a vocal
identification signal of a user to an authentication device; b)
transmitting from said authentication device an ultra sonic signal
comprising said vocal identification signal and user information
stored on said authentication device; c) receiving said ultrasonic
signal by said audio signal input device, and processing said
ultrasonic signal to extract therefrom said vocal identification
signal and said user information; d) fetching from a database the
voice pattern of the authorized user associated with said user
information, and e) comparing the fetched voice pattern to the
transmitted voice pattern and; e.1) if a match is detected enabling
access to said computerized system; e.2) if a match is not
detected, disabling the access to said computerized system.
7. a method according to claim 6, wherein the user verification is
performed at a remote server connected to a computer network and/or
the Internet, comprising: a) inputting a vocal identification
signal of a user to an authentication device; b) transmitting from
said authentication device an ultra sonic signal comprising said
vocal identification signal and user information stored on said
authentication device; c) receiving said ultrasonic signal by an
audio signal input device, and processing said ultrasonic signal to
extract said vocal identification signal and said user information;
d) transmitting said vocal identification signal and said user
information to said remote server, via said computer network and/or
the Internet over a secure link; e) receiving said vocal
identification signal and said user information by said remote
server; f) fetching from a database the voice pattern of the
authorized user associated with said user information; and g)
comparing the fetched voice pattern with the transmitted voice
pattern, and; g.1) if a match is detected, enabling access to said
remote server; g.2) if a match is not detected, disabling the
access to said remote server.
8. A method according to claim 1, wherein the authentication device
is utilized to permit the access of user(s) to a computerized
system, the method consisting of: a) inputting a vocal
identification signal of a user to an authentication device; b)
verifying the user identity, on said authentication device, by
performing the following steps: b.1) processing said vocal
identification signal to obtain the user's voice pattern; b.2)
comparing said voice pattern to the voice pattern stored on said
authentication device, and transmitting an ultrasonic signal
comprising a match or mismatch indication, and the user
information; b.3) receiving said ultrasonic signal by an audio
signal input device, and processing said ultrasonic signal to
extract said match or mismatch indication and said user
information; and b.4) enabling access to said computerized system
whenever a match indication is extracted from said ultrasonic
signal.
9. A method according to any one of claims 5 to 8, wherein the
vocal identification signal, and the user information, are
converted into digital data and modulated into an ultrasonic signal
utilizing Frequency Shift Keying techniques.
10. A method according to any one of claims 5 to 9, wherein the
audio signal input is received through telephony infrastructures,
thereby allowing the identification of users through said telephony
infrastructures.
11. A method according to claim 10, further comprising an
Interactive Voice Response device/application for allowing access
of authorized users to personal information, and/or manipulating
said information.
12. A method according to claim 1, wherein voice recognition is
utilized for the verification of authorized users, comprising a
verification procedure in which the pronunciation of a predefined
word or phrase is checked.
13. A method according to claim 1, wherein voice recognition is
utilized to input into the authentication device vocal instructions
received from the user, comprising: a) playing a vocal menu, from
the authentication device, where said vocal menu comprises an
ordered list of possible options; b) inputting a vocal signal
comprising the options selected by the user to said authentication
device; and c) performing the task(s) associated with the selected
option(s).
14. A method according to claim 13, further comprising carrying out
arithmetic calculation in combination with the vocal menu, by
performing the following steps: a) playing a vocal menu consisting
one or more arithmetic operations; b) vocally selecting a desired
arithmetic operation; c) vocally inputting the numeric value(s) on
which said arithmetic operation should be performed; d) calculating
the result of said arithmetic operation; and e) vocally outputting
said result.
15. A method according to claim 14, further comprising calculating
the extra payments to be paid in addition to a basic payment for a
service, by activating an extra payment calculation function,
vocally inputting the basic payment sum, calculating the extra
payment to be paid according to said sum, and vocally outputting
the extra payment calculation result.
16. A method according to claim 1, wherein voice recognition is
utilized to input vocal instructions received from the user, to the
authentication device, comprising instructions for launching a
selected application, or for performing selected tasks on a
computerized system, comprising: a) inputting to said
authentication device an audio signal, received from the user,
comprising instruction to carry out a desired task; b) performing
voice recognition procedures to recognize said desired task, spoken
by the user; c) transmitting an ultrasonic signal comprising
instructions for carrying out said desired task to the computerized
system, d) receiving said ultrasonic signal by said audio signal
input device, and processing said ultrasonic signal to extract said
instructions; and e) performing said instructions.
17. An authentication device capable of transmitting, receiving and
recording audio and ultrasonic signals, and capable of converting
said signal into digital data, and performing digital signal
processing, said authentication device comprising: a) an input
device capable of receiving audio and ultrasonic input signals and
of outputting an analog electric signal; b) an analog-to-digital
converter suitable to receive analog electric signals from said
input device, and to output equivalent digital signals; c) a memory
device for storing data; d) a press button for activating the
device operation; e) a processing unit suitable to receive inputs
from said press button, analog-to-digital converter, and to input
and output digital data from/to said memory device; f)
digital-to-analog converter suitable to receive digital signals
from said processing unit, and to output equivalent analog signals;
and g) an output device capable of receiving analog electric
signals and of transmitting audio and ultrasonic input signals,
that receives analog signals from said digital to analog
converter.
18. An authentication device according to claim 17, comprising a
light-emitting device in which a pattern of light pulses is issued
by the processing unit to indicate a match.
19. An authentication device according to claim 17, comprising a
magnetic strip enabling said authentication device to carry out
financial transactions, in which said magnetic strip is activated
by said processing unit whenever a match of the voice pattern is
achieved.
20. An apparatus comprising a device according to claim 17, for
permitting or denying access to a computerized system, comprising:
a) a computerized system comprising: a.1) a sound processing device
for receiving audio and ultrasonic signals, and for converting said
signals into digital signals, and for receiving digital signals and
outputting audio and ultrasonic signals; a.2) an input device for
inputting audio and ultrasonic signals and for outputting their
equivalent analog electric signals; a.3) means for connecting the
output of said input device to said sound processing device; a.4)
software means for processing digital signals; and a.5) a database
of voice patterns of authorized users.
21. An apparatus according to claim 20, in which the input device
is connected to telephony infrastructures, for inputting audio
signals over telephone lines.
22. A credit-card sized apparatus capable of receiving and
processing audio signals, comprising: a) a power source; b) an
input device capable of receiving human voice inputs; and c) a data
processing device capable of processing said human voice
inputs.
23. An apparatus according to claim 22, further comprising a device
for outputting human-audible sounds.
Description
FIELD OF THE INVENTION
[0001] The present invention relates to the fields of voice
recognition and voice verification. More particularly, the
invention relates to a method and apparatus for verifying and
identifying authorized users, for discriminating between them and
unauthorized users, and for using voice input and output to control
and activate user devices.
BACKGROUND OF THE INVENTION
[0002] Photographs, signatures, and identification numbers are
commonly used to identify and verify the identity of individuals.
However, these common identifying means are very easy to
counterfeit, and in fact, the vast increase of business and
financial transactions carried out over electronic means, such as
the Internet and data communication, resulted in greater
vulnerability of merchants, and end users to fraud and
counterfeit.
[0003] Many of the modern ways of doing business enjoy the
availability and speed of electronic communications, but do not
require the presence of the entities involved at the time and place
of transaction. It is therefore very easy for a malicious user to
pretend to be someone else, and to carry out transactions in his
name, once he gains access to some of his identifying details such
as passwords, ID number, credit card numbers, etc. Although using
passwords increase the security of such methods of doing business,
and security layers (e.g. SSL) which utilize encryption provide
more protection against eavesdropping, the main problem of
identification and verification still remain.
[0004] Generally, the strength of a security system is measured by
the number and diversity of the security "factors" that are
required. Three such types of security factors are commonly known
as "something you know" (e.g. a password); "something you have"
(e.g. a magnetic card, smart card); and "something you are" (e.g.
biometric measures such as voice.) For example, a password by
itself may not provide complete security, because a holder may
inadvertently disclose it, or the password can be guessed.
Moreover, a magnetic card can be lost or stolen. Security is
substantially enhanced when a password is used as one factor, and
the authenticated presence of an identification card (e.g.,
magnetic card) is used as another factor.
[0005] The Internet and the World Wide Web (WWW) contain a variety
of different sites for e-commerce and online services. In addition
to news and information, merchants and consumers alike have come to
view the web as a virtually unlimited source for conducting
business, in the form of sales of products, services, and
information. Nevertheless, many computer users are still somewhat
wary of conducting sales transactions over the web especially
because credit cards are involved, along with the associated fear
of widespread and unchecked dissemination of the credit card
numbers. These same users may also be worried about conducting
non-Internet related sales transactions using credit cards. After
all, anyone can swipe a credit card number and use it later for
unauthorized purchases. Assurances from well-meaning web (and
non-web) merchants do not alleviate these concerns, because the
user has to carry a credit card in his person, which can be easily
stolen or lost (and thus found by a thief). Moreover, the custom
methods of conducting e-commerce utilized today are unsafe with
respect to the relative ease in which sensitive users' information
can be burglarized. Since the authentication is usually performed
by a server connected to the a computer network and/or the
Internet, sensitive user information, such as identification and/or
credit card numbers, is vulnerable to hackers attacks and
eavesdropping.
[0006] What is needed, therefore, is a secure purchasing mechanism
that provides users with the peace of mind to make a purchase on
the web (or any other form of electronic purchase or other secure
information access) without the fear of having his credit card
number stolen by someone listening-in at any point in the
transaction.
[0007] Typically, systems that provide security to such online
users do so to the detriment of user convenience. Because the
Internet is a mass-medium, the success of online services depends
largely on preserving user convenience. Without such convenience,
users would become decreasingly tolerant resulting in a loss of
business and social opportunities for both web merchants and
users.
[0008] Traditional microprocessor credit cards, also known as
"smart cards" or "chip cards," provide a good degree of security.
But while smart cards that work in conjunction with dedicated smart
card readers have become prevalent (i.e., bank cards in Europe),
they are ill-suited for deployment as widespread Internet access
control devices. Deploying smart card readers to users in their
homes, for example, and educating users on their installation and
use, is a cumbersome, expensive, and inconvenient process.
[0009] The prior art smart card systems were largely developed for
a disconnected world; that is, maintaining a constant connection
between the smart card and some other entity was considered
expensive, inconvenient, and largely unnecessary. Indeed, the
telephone charge itself was significant enough in cost to warrant
minimizing such connection times.
[0010] One of the weaknesses of digital devices is their ease of
duplication. Since digital devices embody binary information, a
duplicated digital device can produce the same output as the
original one. In order to overcome the duplication problem, the
operation of the electric card can be made conditional on by the
provision of a password, such as the password used in mobile
phones. However, this solution is inconvenient, since the user has
to type such password. Moreover, a password-protected device should
comprise a set of keys, in order to enable to type a password.
Passwords also must be remembered by the user, and hence they are
inconvenient, and not secure, since a user may write them down
instead of remembering them.
[0011] Voice verification (speaker verification) methods improve
the identification capabilities and provide convenient
implementations for the end user. In this method, the voice pattern
of the users is utilized to identify him and to allow access only
to authorized users. Voice recognition is sometimes utilized in
conjunction with voice verification, or even as a stand-alone
implementation, to determine if a predefined word or phrase
(password) is vocally pronounced by an authorized user.
[0012] Usually, in voice verification applications the identity of
individuals is verified by comparing their voice pattern, that is
obtained when a predefined word or phrase is pronounced (password),
with users' voice pattern stored in the authenticating system
database. For such verification, a predefined word or phrase should
be agreed on, and the voice pattern of the individual, to which the
word or phrase is assigned, is processed and stored. In this
verification scheme two of the factors that were discussed above
are utilized, i.e., "something you know" and "something you are".
However, when attempting to identify an individual, his voice
pattern should be compared against the entire database of
authorized individuals' patterns, which is a long process, and thus
not always acceptable for electronic commerce and financial
transactions.
[0013] In U.S. Pat. No. 4,961,229, a biometric method for
authentication/verification of individuals is disclosed, where the
voice patterns of the authorized individuals are utilized for
authentication. This method combines all of the security factors
that were discussed before; "something you know", "'something you
have", and "something you are". More particularly, the identity of
an individual is authenticated by utilizing an additional
identification device (magnetic card for instance) "something you
have", comprising some identifying information of the user (names,
identification numbers, etc.), and user's voice pattern. The
verification process is initiated by enabling the verification
system to access the identification device and interrogate the
user's information, and voice pattern, stored on it. Once the
details of that person are retrieved by the authenticating system,
his voice pattern is compared with the voice pattern stored on his
identification device. In this way the identification process is
substantially shortened, since the pattern recognition test is
performed with only one pattern.
[0014] To verify the presence of a particular individual, the voice
pattern stored on the smart card is compared to his voice at the
time of authentication. This type of security is robust against
forgery, but requires dedicated hardware to input and process vocal
signals, and the direct access to the information stored on a
smart-card, which should be in the possession of each authorized
entity, and which should be somehow connected to the authenticating
system. Therefore, it is not adequate for authenticating individual
users over the Internet or other electronic communication systems,
since it requires smart card readers to be at the possession of the
end-user (microphone & smart-card reader). In a similar
fashion, WO 99/22362 describes a method for the identification of
authorized users, by utilizing a mobile device on which users'
identifying information, and a biometric voice pattern, are
stored.
[0015] It should be understood that all the methods described above
are performed online. Furthermore, the utilization of card readers,
microphones, and other fixed hardware, eventuate in a fixed
authentication system. In other words, none of those methods enable
efficient realization of a mobile authentication system. As will be
appreciated, the efficiency and satisfaction of transactions, such
as e-commerce, will be substantially increased if the identity of
the user can be verified before he interacts with the system. In
this way, the time required for such transaction may be
substantially reduced.
[0016] An additional limitation of those prior art systems that
utilize smart cards and voice verification through software means
on a general purpose computer that is subject to tampering, and is
not portable to devices not possessing that software. The foregoing
problems can be resolved by a voice verification apparatus that is
embedded in the smart card itself, which is mobile, and not readily
reprogrammable or addressable by interlopers.
[0017] Voice recognition is used to recognize vocally pronounced
words, in a predefined language, or alternatively, in any language.
There are some similarities between voice recognition and voice
verification; while in voice verification the voice pattern of an
individual is detected, in voice recognition the pattern of
specific words is detected. Interactive Voice Response (IVR)
utilities enable end-users to select a desired operations/option
from a list of possibilities, by vocally pronouncing their
selection. However, there is no operating application known in the
art, in which a mobile device enables end users to vocal
interaction with such systems.
[0018] All the methods described above have not yet provided
convenient means for the vocal interaction of users with
computerized systems, and satisfactory solutions to the problem of
efficiently authenticating and verifying identity of individuals
over data communication systems and computer networks, such as the
Internet.
[0019] It is an object of the present invention to provide a method
and apparatus for the fast and efficient identification of
individuals, based on the identification of voice patterns.
[0020] It is another object of the present invention to provide a
method and apparatus for enabling end users to vocally interact
with user devices, and to carry out instructions received in the
form of speech.
[0021] It is a further object of the present invention to provide a
method and apparatus for carrying out the offline identification of
individuals, where the identity of a user is verified before he
interacts with, or gain access to, the system.
[0022] It is a still another object of the present invention to
provide a method and apparatus for the vocal identification of
individuals where the identification is performed by a mobile
device carried by the user.
[0023] It is a yet another object of the present invention to
provide a method and apparatus for converting vocal information
into textual and/or digital data.
[0024] It is a still further object of the present invention to
provide a method and apparatus for mobile IVR applications wherein
users are presented with voice prompts (menus and/or instructions),
and react, vocally, accordingly.
[0025] Other objects and advantages of the invention will become
apparent as the description proceeds.
SUMMARY OF THE INVENTION
[0026] The following terms are defined as follows:
[0027] Smart card--a smart card is a card of the approximate
dimensions of a credit card, with thicknesses that can vary,
possessing an embedded microchip that can be loaded with data or
computer programs. Under the present invention, the term "smart
card" can be used interchangeably to refer to any physical token
used for identification or information exchange purposes.
[0028] Voice recognition--the conversion of spoken words into
digital data, by matching digitized speech signal against a
database of waveforms, and converting matching speech signals into
digital or textual data.
[0029] Voice pattern--vocal biometric enabling biological
identification of individuals.
[0030] Voice verification--identification of individuals based on
the recognition of their voice pattern.
[0031] IVR--Interactive Voice Response applications, enabling vocal
interaction utilizing voice recognition methods (e.g., computerized
operators of telephone services).
[0032] In one aspect, the invention is directed to a method for
verifying and identifying users, and for verifying users' identity,
by means of an authentication device capable of transmitting,
receiving and recording audio and ultrasonic signals, and capable
of converting the signals into digital data, and to perform digital
signal processing, the method comprise recording on the
authentication device voice pattern(s) of one or more authorized
user(s), storing on the authentication device user information
providing identifying details of the authorized user(s), inputting
to the authentication device a vocal identification signal from a
user, and comparing the voice pattern of the vocal identification
signal with the recorded voice pattern(s) of the authorized
user(s), and if a match is detected issuing an indication that the
user is identified as an authorized user.
[0033] The method may comprise transmission of a predefined pattern
of audio and/or ultrasonic signals, by the authentication device,
whenever a match of voice patterns is detected. Alternatively, the
method may comprise emitting a predefined pattern of light signals
from a light-emitting device, to indicate a match of voice
pattern.
[0034] According to one preferred embodiment of the present
invention, the authentication device is a credit card sized device
comprising a magnetic strip and/or a smart chip, wherein
authentication is performed by inputting to the authentication
device a vocal identification signal from a user, and comparing the
voice pattern of the vocal identification signal with the recorded
voice pattern(s) of the authorized user(s), and if a match is
detected activating the magnetic strip and/or a smart chip and
allowing a magnetic card reader to read the card information
therefrom.
[0035] In a preferred embodiment of the present invention the
authentication device is utilized to permit the access of user(s)
to a computerized system, the method consists of:
[0036] providing a computerized system including:
[0037] an audio signal input device capable of receiving ultrasonic
signals;
[0038] a sound processing device suitable to receive inputs from
the audio signal input device, and to receive audio and ultrasonic
input signals, and capable of converting the signals into digital
data;
[0039] a data base containing of voice patterns of authorized
users;
[0040] an application capable of receiving digital data inputs from
the sound processing device, of activating other applications, and
of transmitting digital data over network links;
[0041] transmitting from the authentication device an ultrasonic
signal comprising the recorded voice pattern;
[0042] receiving the ultra sonic signal by the audio signal input
device;
[0043] processing the ultrasonic signal and extracting the recorded
voice pattern therefrom; and
[0044] comparing the recorded voice pattern with the voice patterns
stored in the database of authorized users, and if a match is
detected enabling access to the computerized system.
[0045] In yet another preferred embodiment of the present
invention, the authentication device is utilized to permit the
access of user(s) to a computerized system by inputting a vocal
identification signal of a user to an authentication device,
transmitting from the authentication device an ultra sonic signal
comprising the vocal identification signal and user information
stored on the authentication device, receiving the ultrasonic
signal by the audio signal input device, and processing the
ultrasonic signal to extract therefrom the vocal identification
signal and the user information, fetching from a database the voice
pattern of the authorized user associated with the user
information, and comparing the fetched voice pattern to the
transmitted voice pattern to determine if they match, where if a
match is detected enabling access to the computerized system, and
if a match is not detected, disabling the access to the
computerized system. In accordance with another preferred
embodiment of the present invention, the user verification is
performed at a remote server connected to a computer network and/or
the Internet by inputting a vocal identification signal of a user
to an authentication device, transmitting from the authentication
device an ultra sonic signal comprising the vocal identification
signal and user information stored on the authentication device,
receiving the ultrasonic signal by an audio signal input device,
and processing the ultrasonic signal to extract the vocal
identification signal and the user information, transmitting the
vocal identification signal and the user information to the remote
server, via the computer network and/or the Internet over a secure
link, receiving the vocal identification signal and the user
information by the remote server, fetching from a database the
voice pattern of the authorized user associated with the user
information, and comparing the fetched voice pattern with the
transmitted voice pattern to determine if they match, where if a
match is detected, enabling access to the remote server, and if a
match is not detected, disabling the access to the remote
server.
[0046] In accordance with yet another preferred embodiment of the
present invention, the authentication device is utilized to permit
the access of user(s) to a computerized system by inputting a vocal
identification signal of a user to an authentication device,
verifying the user identity on the authentication device, by the
following steps:
[0047] processing the vocal identification signal to obtain the
user's voice pattern;
[0048] comparing the voice pattern to the voice pattern stored on
the authentication device, and transmitting an ultrasonic signal
comprising a match or mismatch indication, and the user
information;
[0049] receiving the ultrasonic signal by an audio signal input
device, and processing the ultrasonic signal to extract the match
or mismatch indication and the user information; and
[0050] enabling access to the computerized system whenever a match
indication is extracted from the ultrasonic signal.
[0051] Preferably, the vocal identification signal, and the user
information, are converted into digital data and modulated into an
ultrasonic signal utilizing Frequency Shift Keying techniques.
Optionally, audio signal input is received through telephony
infrastructures, thereby allowing the identification of users
through said telephony infrastructures. The method may further
comprise an Interactive Voice Response device/application utilized
for allowing access of authorized users to personal information,
and/or manipulating said information.
[0052] According to a preferred embodiment of the present
invention, voice recognition is utilized for the verification of
authorized users, comprising a verification procedure in which the
pronunciation of a predefined word or phrase is checked.
[0053] According to one preferred embodiment of the present
invention, voice recognition is utilized to input into the
authentication device vocal instructions received from the user
by:
[0054] playing a vocal menu, from the authentication device, where
the vocal menu comprises an ordered list of possible options;
[0055] inputting a vocal signal comprising the options selected by
the user to the authentication device: and
[0056] performing the task(s) associated with the selected
option(s).
[0057] The method may further carrying out arithmetic calculation
in combination with the vocal menu, by playing a vocal menu
consisting one or more arithmetic operations, vocally selecting a
desired arithmetic operation, vocally inputting the numeric
value(s) on which said arithmetic operation should be performed,
calculating the result of said arithmetic operation, and vocally
outputting said result.
[0058] The method may further comprise calculating the extra
payments to be paid in addition to a basic payment for a service,
by activating an extra payment calculation function, vocally
inputting the basic payment sum, calculating the extra payment to
be paid according to said sum, and vocally outputting the extra
payment calculation result.
[0059] In accordance with a preferred embodiment of the present
invention, voice recognition is utilized to input vocal
instructions received from the user, to the authentication device,
comprising instructions for launching a selected application, or
for performing selected tasks on a computerized system, by the
following steps:
[0060] inputting to the authentication device an audio signal,
received from the user, comprising instruction to carry out a
desired task;
[0061] performing voice recognition procedures to recognize the
desired task, spoken by the user;
[0062] transmitting an ultrasonic signal comprising instructions
for carrying out the desired task to the computerized system;
[0063] receiving the ultrasonic signal by the audio signal input
device, and processing the ultrasonic signal to extract the
instructions; and
[0064] performing the instructions.
[0065] According to another aspect, the invention is directed to an
authentication device capable of transmitting, receiving and
recording audio and ultrasonic signals, and capable of converting
the signal into digital data, and performing digital signal
processing, the authentication device includes:
[0066] an input device capable of receiving audio and ultrasonic
input signals and of outputting an analog electric signal;
[0067] an analog-to-digital converter suitable to receive analog
electric signals from the input device, and to output equivalent
digital signals;
[0068] a memory device for storing data;
[0069] a press button for activating the device operation;
[0070] a processing unit suitable to receive inputs from the press
button, analog-to-digital converter, and to input and output
digital data from/to the memory device;
[0071] a digital-to-analog converter suitable to receive digital
signals from the processing unit, and to output equivalent analog
signals; and
[0072] an output device capable of receiving analog electric
signals and of transmitting audio and ultrasonic input signals,
that receives analog signals from the digital to analog
converter.
[0073] Optionally, a light-emitting device may be utilized to issue
pattern of light pulses by the processing unit to indicate a match.
The authentication device may comprise a magnetic strip to enable
the authentication device to carry out financial transactions, in
which the magnetic strip is activated by the processing unit
whenever a match of the voice pattern is achieved.
[0074] Alternatively, according to a preferred embodiment of the
present invention, an authentication apparatus for permitting or
denying access to a computerized system, may consist a computerized
system including:
[0075] a sound processing device for receiving audio and ultrasonic
signals, and for converting the signals into digital signals, and
for receiving digital signals and outputting audio and ultrasonic
signals;
[0076] an input device for inputting audio and ultrasonic signals
and for outputting their equivalent analog electric signals;
[0077] means for connecting the output of the input device to the
sound processing device;
[0078] software means for processing digital signals; and
[0079] a database of voice patterns of authorized users.
[0080] Optionally, the input device is connected to telephony
infrastructures, for inputting audio signals over telephone
lines.
[0081] According to another aspect of the present invention, the
invention is directed to an apparatus capable of receiving and
processing audio and ultrasonic signals, comprising a power source,
an input device capable of receiving audio and ultrasonic input
signals, and a data processing device capable of processing audio
and ultrasonic signals.
[0082] According to yet another aspect of the present invention,
the invention is directed to an apparatus capable of outputting
audio and ultrasonic signals, comprising a power source, and an
output device, operating in combination with a data processing
device, capable of outputting audio and ultrasonic input
signals.
BRIEF DESCRIPTION OF THE DRAWINGS
[0083] In the drawings:
[0084] FIG. 1 schematically illustrates a preferred embodiment of
the invention, including a smart card utilized for verification,
and for performing transactions through a computerized system;
[0085] FIG. 2 is a flowchart illustrating the user verification
process according to a preferred embodiment of the invention;
[0086] FIGS. 3a and 3b are flowcharts illustrating access
permission schemes according to the method of the invention;
[0087] FIG. 4 is a flow chart illustrating a process for adding new
users according to a preferred embodiment of the invention; and
[0088] FIG. 5 is a block diagram illustrating hardware
implementation according to a preferred implementation of the
invention.
[0089] FIG. 6 shows a process for launching/starting an application
or service according to the method of the invention.
DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS
[0090] The present invention refers to a method for the
identification of users and for carrying out tasks and transactions
using vocal inputs from the user, storing the user's voice stamp
and other acoustic inputs, and outputting acoustic and/or other
signals to indicate and transmit status and/or commands A user
device, equipped with an audio input and audio output devices, is
utilized to vocally identify authorized user(s), and issue
indications and/or instructions to the user and/or an automated
system.
[0091] The automated system may be embodied by means of a
computerized system, which may be also connected to a computer
network and/or the Internet, or alternatively, by means of a
telephony system such as automated telephony services, mobile
phones or Personal Digital Assistant (PDA) devices.
[0092] FIG. 1 schematically illustrates a preferred embodiment of
the invention, in which a user device 100 is utilized to identify
the user (the owner of the user device), and to interact with the
computer system 110. As will be explained hereinafter, such an
embodiment may be utilized to carry out a variety of operations, in
addition to the verification of the identity of a user.
[0093] The user device 100 is equipped with two transducers, 108
and 109. Transducer 108 is utilized to output audio and ultrasonic
signals (a speaker), and transducer 109 is utilized to input audio
and ultrasonic signals (a microphone). A Central Processing Unit
(CPU) 103 is utilized to perform Digital Signal Processing (DSP)
and for controlling the operation of the user device 100. The
signal inputs from the transducer 108 are provided to the CPU 103
in the form of digital data, via an Analog to Digital Converter
(ADC) 104, and the output signals are provided from the CPU 103, in
the form of digital data, to the transducer 109, via a Digital to
Analog Converter (DAC) 107. Amplifier(s) (not illustrated) may be
utilized to amplify the input and output signals.
[0094] In a preferred embodiment of the invention two dedicated
transducers are utilized for outputting audio signals, a first
dedicated transducer is utilized for telephony communication (to
output sonic signals), and a second transducer is utilized for
communication over the computer network(s) (e.g., Internet). It
should be noted that each transducer may be utilized for the input,
and for the output, of audible signals.
[0095] A memory device 101 is utilized to store identifying
information which is utilized to identify the user (i.e., the owner
of the user device 100), and also for storing software to be
executed by the CPU 103, as well as other details and parameters of
the user. A magnetic strip 106 may be also included on the user
device 100, however its proper operation may be enabled/disabled by
the CPU 103. A possible implementation for such a magnetic strip
(106) may be obtained by utilizing a ferromagnetic loop embedded
onto the user device 100, as disclosed in U.S. Pat. No.
5,434,398.
[0096] It should be noted that the same operation of the user
device may be obtained by utilizing one transducer for inputting
and for outputting signals. A smart chip may be embedded into the
CPU 103, or alternatively, an independent smart chip 116 may be
embedded into the user device 100. A similar user device is
described in U.S. patent application Ser. No. 09/570,399 of the
same applicant herein, wherein one or two transducers are utilized
for the input/output of audio signals. In such embodiments of the
invention each transducer may still be utilized for both inputting
and outputting audio signals. For example, the user device 100 may
be implemented with two transducers, wherein one transducer is
dedicated for the input/output of sonic signals, and a second
transducer, which is dedicated for the input/output of ultrasonic
signals.
[0097] The user device 100 may be a standard credit card (bank
card), or alternatively, it may be a non-standard card, embedded
into a mobile phone, or any other form of consumer electronics (key
chain, PDAs, mobile phones, etc.).
[0098] Voice recognition and voice verification techniques are
utilized to verify the user identification, and for vocally
inputting instructions and/or data to the user device 100. FIG. 2
is a flow chart illustrating the procedure required for user
verification, according to one embodiment of the invention. The
user device 100 is provided to the user only after his voice
pattern (voice stamp) is stored in the memory device 101, step
200.
[0099] The operation of step 200 (designated by a dashed line) may
be performed by the manufacturing and/or distributing credit card
entity, or alternatively, this may be performed over the phone, or
even by adding more keys (push buttons) to the user device, and
enabling more manual operations option. However, it is important to
note that this is performed utilizing the input transducer 109, on
the user device 100, which is the same transducer that will be
utilized for verification during normal operation of the user
device. As will be appreciated by a skilled person, this
substantially contributes to eliminating interfering distortions,
which result from the utilization of different audio input devices.
Thus, the voice stamp (also referred to herein as signature) stored
on the user device's memory in step 200, is authentic and optimal
for the verification task. The signature may comprise a digital
recording of the voice stamp, which in turn may be a predetermined
word/phrase (a password). Alternatively, the signature may comprise
only parameters for identifying the user's speech patterns in
general, or the user's speech patterns related to a predefined
word/phrase which is pronounced by him.
[0100] When verifying the user's identity, (e.g., if he wishes to
make a payment with the user device) a verification procedure is
activated in step 201. In a preferred embodiment, the user will
press the button 102 on the user device 100, and then pronounce a
predetermined word/phrase, which is utilized as his signature. In
effect when pressing on the button 102, the CPU activates the input
circuitries on the user device, which causes the speech of the user
to be collected by the transducer 109. The signal collected through
transducer 109, is amplified and converted into a digital signal by
ADC 104. The CPU 103 retrieves the digital signal from ADC 104, and
in step 203 this digital signal is processed, and the voice pattern
of the user is extracted.
[0101] The verification 206 is carried out by comparing the voice
pattern of the signal retrieved, with the signature stored on the
user device's memory 101. This stage of the verification depends on
the type of verification scheme utilized. If the verification
scheme is based on voice verification, the comparison of step 206
will be based on checking if the voice pattern of the user matches
the voice pattern stored in the user device. This may comprise a
test which checks whether the voice patterns are matching in
general, or alternatively, this test may require that the pattern
of the voice matches also that of the predefined word/phrase. On
the other hand, if voice recognition is utilized, the test may
consist of only checking that the predefined word/phrase are
retrieved in step 202.
[0102] If the voice pattern matches the signature, a TRUE
indication is issued in step 204. The effect of such indication
depends on the type of application for which the user device is
utilized. In one preferred embodiment of the invention the
electronic user device 100 may comprise a Light Emitting Diode
(LED) (not illustrated), which will emit a light pulse (or a
pattern of pulses) in case of a TRUE indication 204. This type of
authentication may be utilized, instead of the custom check of
signatures and identification numbers utilized today.
[0103] Alternatively, the TRUE indication 204 may result in an
activation or manipulation of the magnetic strip 106 of the user
device or the smart chip 116, thus allowing the user to purchase
with it merchandise or services. In this case, however, the
magnetic strip 106 is activated for a limited time period, long
enough to enable reading it by a magnetic strip reader.
[0104] In a preferred embodiment of the invention, a TRUE
indication 204 results in the transmission of a verification signal
through the transducer 108. This verification signal may comprise
the user's signature, and data indicating his details and the
verification results. This transmission is received by an audio
input device 112 (microphone), which is connected to the
computerized system 110, for instance, through a sound card. An
application 115 running on the computerized system 110, processes
the transmission collected by the input device 112, and extract the
signature and other identifying information from the transmission.
As will be described below, the communication with a computerized
system 110 enables a wide range of possible applications,
especially when the computerized system 110 is connected to a
computer network 122, and/or the Internet 121.
[0105] In yet another embodiment of the invention, a TRUE
indication 204 results in the broadcast by transducer 208 of
synthesized speech, or another human audible sound indicating to
humans that the voice is verified.
[0106] The invention may be embodied utilizing a telephone system
instead of the computerized system 110. Such embodiments may be
utilized for banking implementations wherein user identity is
verified over the telephone utilizing the user device 100. Once
user identity is verified, access permission to his account is
permitted, and different transaction can then be carried out. The
user device 100 may be further utilized to communicate with the
banking system over the phone line, and to present the user with
vocal menus/options and enable operations utilizing IVR
utilities.
[0107] According to a preferred embodiment of the method of the
invention, the same audio input device (microphone) is utilized for
generation of the vocal stamp of the user, and for inputting the
user's password (hereinafter referred to also as signature), and
for verifying the user's identity.
[0108] Therefore, the signature is stored on the same device that
performs the authentication, which, according to a preferred
embodiment of the invention, is an electronic card. It is therefore
possible to perform an offline verification of the user. In other
words, the verification may be performed before the user initiates
a transaction, for instance with an e-commerce server 123. It
should be understood that in this case the user device 100 performs
verification of user identification, and the authentication is
actually performed by the server 128.
[0109] If the voice pattern of the user does not match to the
signature stored in the memory 101, the CPU 103, in step 205 issues
a FALSE indication. This FALSE indication 205 may be, for example,
in the forms of one or more of the following;
[0110] Visual indication, for instance a blinking LED, or changing
the color of a hologram (not shown) on the user device 100;
[0111] An audio output, through transducer 108, of a human or
synthesized voice, announcing faulty verification;
[0112] A transmission through transducer 107, indicating to
computerized system 110 that the user with the details found on the
user device, is not authorized to use the device;
[0113] Inactivation of the magnetic stripe 106, or a smart card
chip 116.
[0114] As will be further explained hereinafter, the output of the
user device 100 is not limited to TRUE and FALSE indications, in
the form of visual effects and/or human/machine voice. By utilizing
methods for digital data transmission, the electronic user device
100 may be utilized to output packets of data comprising a variety
of details about the user device 100, the user of the user device,
and other data inputs. In a preferred embodiment of the invention,
digital data is modulated by utilizing Frequency Shift Keying
(FSK), or coded as DTMF or other means, and then transmitted as an
ultrasonic signal, through transducer 109.
[0115] The utilization of audio and ultrasonic signals, for
inputting and outputting data according to the method of the
invention, is greatly advantageous. As described above,
verification results may be relayed in the form of human or machine
(synthesized) voice. On the other hand, the user device 100 can
interact with any computerized system 110 having an audio input
device 112 capable of inputting ultrasonic transmissions, that may
be received by a running application 115, utilizing any regular
sound card (not shown in the figure). Alternatively, as was
described before, sonic transmission may be utilized for telephony
applications (i.e., to perform transactions over telephone systems
utilizing the user device).
[0116] According to a preferred embodiment of the invention, an
application 115 running on the computerized system 110, retrieves
the FSK or other data transmission from the user device 100, and
demodulates or otherwise decodes the digital data, by utilizing
Digital Signal Processing (DSP) techniques, such as those described
in U.S. patent application Ser. No. 09/570,399, and in U.S. patent
application entitled "A METHOD AND SYSTEM FOR REMOTELY
AUTHENTICATING IDENTIFICATION DEVICES", filed on Mar. 22, 2001,
both filed by the same applicant herein. It should be clear that
implementations of the invention are not limited to FSK modulation,
and that other methods of digital data communication may be
exploited (e.g. PSK, DTMF).
[0117] When such embodiment is utilized, access permission
utilities may be easily and efficiently implemented. FIG. 3A and 3B
are flowcharts illustrating two possible access permission
implementations, according to preferred embodiment of the
invention. FIG. 3A illustrates an access permission scheme in which
the verification 300 is performed on the user device 100. The
verification in step 300 is performed as described above, by
receiving the user's voice signal through transducer 109, and
comparing his voice pattern to the voice pattern stored in the user
device 100 (voice verification), or alternatively, by checking if a
predetermined password was pronounced by the user (voice
recognition).
[0118] Such embodiments of the Invention, where user identification
is performed on the user device, are particularly useful in
telephony and IVR applications, and substantially improve the
security. Since the user device performing the operation required
for user verification, there is no need in transmission of user
information, and there is no need in storing same information on a
remote site (e.g., Internet server), where it may be exposed by
hackers and/or eavesdroppers. Moreover, the authentication process
is substantially faster than in methods where authentication is
performed in a remote site, wherein the speed of the authentication
depends on many variable parameters, such as communication speed,
band-width, and reliability.
[0119] Once verification is completed, the results and the user
device's details (user information) are transmitted, in step 301,
through transducer 108, to the audio input device 112. In step 302
the transmission is received and processed by a running application
115, on a computerized system 110. The application 115 checks the
verification results in step 303, and if a TRUE result is returned,
in step 306, access is permitted. The transmission received in step
302 comprises the user device details, that contains information
about the user. Thus the access permission of step 306 may comprise
the performance of additional operations, such as:
[0120] Logging into the user account on the computer system 110,
and thus allowing the user to interact with the computer system
110;
[0121] Playing a "welcome aboard" voice message through speaker
111, connected to the computer system 110;
[0122] Enabling transactions with a remote server 123, connected
through a computer network 122 and/or the Internet 121.
[0123] If a FALSE result is returned, user access is denied (step
305). To improve the security, the number of FALSE access attempts
315 should be limited per user over a predefined period of time. It
should be understood that another way to implement such an access
permission scheme is to check verification 303 after step 300 (on
the user device 100), right after the verification test is
performed. In this way, if access is denied, the process is
terminated without performing steps 301 and 302.
[0124] FIG. 3B illustrates an access permission scheme, which
provides improved security. The access permission scheme described
in FIG. 3A may be bypassed by forging a transmission comprising a
TRUE indication and a user information. To improve the security,
the verification test should be performed by the system to which
access is sought. Thus, in such a scheme the user device 100 is
utilized to record the user's voice (step 310), to the memory
device 101, and to transmit the recorded voice along with user
device details (user information) to the computer system 110.
[0125] The computer system 110 should comprise a database (not
shown) comprising voice patterns (or passwords) of all the entities
having access authorization. After transmission is received (step
312), the user's voice pattern (or his password) is extracted from
the transmission, and his data record, comprising his signature
(and/or password), is fetched from the database. In the next step,
313, the voice patterns (or passwords) are compared to verify the
user's identity. If a match is detected (step 314), access is
permitted, and one or more of the operations described above (as
for step 306 in FIG. 3A) may be performed. Of course, if the
patterns (passwords) do not match (step 305), access is denied.
[0126] When the transaction with a remote server 122 requires a
higher degree of security, for example in banking and commerce
applications, the verification should be performed over a secured
link (e.g. SSL) at the remote server 123. In this way the user
device details and the user's voice (password) are shipped in a
concealed form, and may not be captured by eavesdroppers.
[0127] For the implementation of access permission schemes such as
that described in FIG. 3B, it is not needed to store the user's
signature (or password) on the user device. Additionally, the
magnetic strip 106 is not required in such access permission
implementations (FIG. 3A and 3B). Another preferred embodiment of
the invention is one in which the user device 100 is designed only
to output ultrasonic and audio signals (without an input transducer
109 and ADC 104). In such implementations the user's voice stamp
(or password) and user device details are stored in the user
device's memory 101, and transmitted to the computerized system
110, for verification, whenever button 102 is pressed.
[0128] The method of the invention may be also utilized to
implement Interactive Voice Response (IVR) utilities on the user
device 100. IVR utilities are typically utilized to perform
transactions and to interact with computer systems, as will be
described herein after. Voice recognition schemes may be utilized
to retrieve instruction from the user. In this way, the invention
may be utilized to receive spoken digits or spoken names by the
user device 100, and then to transmit a DTMF or an ultrasonic
transmission to launch the proper service on the computer system
110, for example, to dial a phone (a calling card) number or to
launch a web site or any desired application.
[0129] In another IVR implementation of the invention, the user is
presented with a spoken menu (human or machine voice) played by the
user device through the transducer 108. The user then speaks an
option, and the user device performs the desired tasks. The user
device 100 may be designed to receive information/instructions from
an automated system (e.g., computerized system 110). Such an
embodiment will be referred to herein as a 2-way user device. So
that the Interactive Voice Response can also be a dynamic one (menu
changes) if the card is 2-way. More particularly, the spoken menus
may be modified along with other information stored in the user
device. These modifications may be carried out by receiving
ultrasonic/sonic signals from a computerized system or over a
telephone line.
[0130] In yet another preferred embodiment of the invention, the
IVR utility is utilized to set the user device, by speech, to the
country in which the user is located at a given time. Once the user
device retrieves the country in which it is located, it dials the
local access number of the users calling service provider. The user
device may be updated periodically with the worldwide access
numbers if it is a 2-way user device.
[0131] The invention may be also implemented to provide the user
calculating means that are vocally effectuated. For example, the
user device may be designed to comprise tip calculation, after
paying for some kind of services, the user activates a tip
calculating function (by pressing a button, or by vocally selecting
the desired function from an IVR menu). Once the tip calculating
function is activated, the user announces the invoice sum, which is
collected by the user device. The user device then interprets the
spoken sum, utilizing its voice recognition capabilities, the tip
sum is calculated, and a voice signal (synthesized) is then output
telling the user the sum of the tip.
[0132] Another preferred embodiment of the invention is one in
which `text to speech` utilities are embedded into the electronic
user device 100. In `text to speech` utilities the electronic card
is provided with text information input, through the transmission
of voice or modulated digital data transmitted to the card. The
user device receives this transmission and plays it as an audio
voice (man or machine voice) to the cardholder.
[0133] The invention may be implemented to allow authorization of
temporary users, or to impose certain limitations on user
accessibility to various functions of the user device. For
instance, the invention may be implemented to allow the user to
authorize other users. In such implementation a vocal menu may be
presented to authorized users, once their identity is verified, to
guide them through a process in which a temporary user, or a user
having limited access, is added. A possible process for adding new
users is illustrated in FIG. 4.
[0134] Initially, a new user and the access level required are
defined in step 400. This step may comprise definitions for the
type of usage the new user is authorized, for example, to allow
such a user to access certain computer terminals, and optionally
define a limited period of time in which the new user is allowed to
access certain facilities and/or systems. In the next step, 401,
the new user's voice is required as an input to the user device,
which is then utilized in step 402 for generating and storing the
new user's voice stamp. Optionally, in step 403, some limitation
are defined regarding the use of the device's functions.
[0135] In a preferred embodiment of the invention, the user device
is a credit card having a magnetic strip 106 and/or a smart chip
116. One may want to grant other user (e.g., family members,
employees) permission to purchase with his user device. To do so, a
new user is added, as described in steps 400 to 402, and the in
step 403 the user may define a limit the purchase sum allowed to
some value, thereby limiting the amount of interactions allowed by
the new user.
[0136] The method of the invention may be utilized to launch/start
web sites, computer applications or any online or phone service.
FIG. 6 shows one possible process for launching/starting an
application or service according to the method of the invention.
Once an application program 115 is activated, or phone number of a
desired service provider is dialed, the user announce the "name" of
the service/site to which access is sought (for example, "AT&T"
or "Yahoo"), step 600. Optionally, the user device 100 may be
utilized in step 600A to transmit a verification signal, by
pressing the press button 102, as was described herein above. In
step 601, and the optional step 601A, the audio signals are
received by the computerized system 110 via an audio input or via
the telephony system if the service is accessed via the
telephone.
[0137] After receipt of the audio signal(s) that were transmitted
in step 600, and optionally in step 600A, in step 602 the
transmission is forwarded to server 123, which provides voice
recognition/verification services. The server 123 processes the
transmission and interpret the "name" of the service/site in step
603. Using digital signal processing and voice recognition
techniques, the server identify the site/service the user wish to
launch/start, and the user is then redirected by the server 123.
This may be performed by launching an Internet browser and loading
the requested site, on the computerized system 110, or, if the
service request is performed via the telephone,
activating/connecting the/to requested telephony service.
[0138] If a verification signal was transmitted in the optional
step 600A, the user also gains automatic access his private account
in step 603A. This may be a private account on a server 123 (for
instance, when accessing an Internet site via the Internet).
Alternatively, this the may be bank account services that can be
performed through the telephone.
[0139] The process described in FIG. 6 may be also performed by
utilizing the user device 100, to input the user's vocal request.
In such embodiment the user device is utilized for inputting the
"name" of the requested site/service, and for transmitting an audio
signal to the audio input of a computerized system 115, or to the
audio input of the telephone. The user device 100 may be used to
process and interpret the user's request, and then to forward the
interpretation of the "name" of the requested service/site to the
computerized/telephony system. In this case the services of voice
verification/recognition server 123 are not required (step 602), so
that step 603 may be performed directly after receipt of the user
device transmission.
[0140] It should be clear that data communication between the user
device 100, and the computerized system 110 may be performed by
techniques employing modulation of data on an electromagnetic
carrier(s) (radio transmission, infrared, etc.). However, employing
other methods for data transmission will require the addition of
dedicated hardware, while the preferred method of the invention
employs hardware means existing in almost any personal computer
system today (i.e. sound card having audio input and output).
[0141] FIG. 5 is a flow chart illustrating a hardware
implementation according to a preferred embodiment of the
invention. This hardware implementation is based on Speech
Recognition Microcontroller 506, which performs the operations of
user verification, and the operation required for sending an
ultrasonic/sonic signal through the transducer 504. The transducer
can be any transducer that complies with the size (12.+-.3 mm in
diameter), Frequency response (0.5-5 kHz), and Sensitivity (about
60 db) demands which allows input/output of human voice, and that
can be embedded into the user device.
[0142] The Speech Recognition Microcontroller 506 may be any device
that comprises sampling capabilities of analog input signals, an
ADC for converting the sampled values into digital form and a
memory for storing the digital data. Such device also includes
circuitry for sampling received signals in real time and processing
means required for comparing digital data derived from real time
sampling of received signals to the stored data and outputting the
corresponding results. In addition such device comprises an
external clock input for controlling the timing of operations
wothin said device, so as to eliminate the need for an internal
crystal clock oscillator (which is of unacceptable thickness) and
to allow using an external clock that is implemented by using
electronic (low profile) components that can be easily integrated
into the user device.
[0143] Optionally, the Speech Recognition Microcontroller 506 may
also include DAC (not illustrated), for synthesizing and outputting
an analog voice signal. Such device may be for example, RSC-300/364
manufactured by Sensory (Santa Clara, Calif., USA).
[0144] The battery 500, is a 3 Volt cell battery, is preferably 20
mm in diameter and 0.4 mm in thickness (for example--Panasonic
CR2004, CR2404, or any other type that has a 3 volt thin battery
that can supply around 4 mA). All of the system components, 501 to
506, are electrically disconnected from the battery 500 when the
system is not operating. System's operation is initiated by a
silicon switch 501, which comprises a latching circuitry (not
illustrated) which enables the Speech Recognition Microcontroller
506 to control the system's power supply once the silicon switch
501 is pressed. The multiplexer and amplifier circuitry 505(for
example--Maxim's MAX4598, MAX4618, and MAX 4639), is utilized to
amplify and filter the transducer 504 input signal, and to enable
the transducer 504 to operate both as input and output device. The
code memory device 503 is utilized to store the code for the Speech
Recognition Microcontroller 506, and the data memory device 502 is
utilized to store user voice stamp(s)(for example--Atmel AT27BV512,
ST SST29VE020 for code memory and Microchip 24lc64, 24lc32 for data
memory)
[0145] It should be noted that in a preferred embodiment of the
invention the code utilized for the user device operation is stored
in the the Speech Recognition Microcontroller 506.
[0146] When the silicon switch 501 is pressed, the system power
supply is latched, and the system powered up. The Speech
Recognition Microcontroller 506 outputs an audio (or visual, if the
user device comprise a LED) signal indicating that the user device
is activated. The Speech Recognition Microcontroller 506 sets the
multiplexer and amplifier circuitry 505 receive an input signal via
the transducer 504. The user's voice signal is received by the
transducer 504, amplified and filtered by circuitry 505, and
sampled by the Speech Recognition Microcontroller 506. The Speech
Recognition Microcontroller 506 then filters and amplify the
sampled signal and extract the user's voice pattern parameters. The
user's voice pattern parameters are then compared to the voice
stamp stored on the data memory device 502.
[0147] As illustrated in FIG. 5, the Speech Recognition
Microcontroller 506 is always connected to the transducer 504,
through output line 507. This connection is utilized for outputting
an FSK modulated signal. In another preferred embodiment of the
invention two transducers are utilized, one for outputting
ultrasonic signals, and the other for outputting sonic signals.
[0148] The user device should be designed and configured to enable
minimization of current consumption. In order to reduce the current
consumption one or more of the following may be performed:
[0149] reduction of the working voltage (from 5 or 3 Volts down to
2.2/2.4 Volts);
[0150] shutting down unessential components, for instance,
inactivation of the amplifier when signal input is computed (Can
save around 1 mA); and
[0151] lowering the Speech Recognition Microcontroller working
speed (From a 14.3 Mhz frequency to around 7 Mhz).
[0152] The above examples and description have of course been
provided only for the purpose of illustration, and are not intended
to limit the invention in any way. As will be appreciated by the
skilled person, the invention can be carried out in a great variety
of ways, employing different techniques from those described above,
all without exceeding the scope of the invention.
* * * * *