|
|
Basic Recommendations for Speaker RecognitionThe speaker recognition accuracy of VeriSpeak and MegaMatcher depends on the audio quality during enrollment and identification. Certain constraints should be noted before or during algorithm integration into a speaker recognition system, whereas other can be overcome by enrollment with the same phrase in different environments. At least 2-seconds long voice samples are recommended to assure recognition quality. General SecurityA passphrase should be kept in secret and not pronounced in an environment where other people may hear it if the speaker recognition system is used in a scenario with unique phrases for each user. MicrophonesThere are no particular constraints on models or manufacturers when using regular PC microphones, headsets or the built-in microphones in laptops, smartphones and tablets. However these factors should be noted:
Sound SettingsSettings for clear sound must be ensured, as some audio software, hardware or drivers may have certain means of sound modification enabled by default. For example, the Microsoft Windows OS usually has sound boost enabled by default. At least 11,025 Hz sampling rate with at least 16-bit depth should be set during voice recording. Environment ConstraintsThe VeriSpeak and MegaMatcher speaker recognition algorithm is sensitive to background noise or loud voices in the background that may interfere with the user's voice and affect the recognition results. These solutions may be considered to reduce or eliminate these problems:
User Behavior and Voice ChangesThese natural voice changes do not occur often but may affect speaker recognition accuracy:
The aforementioned voice and user behavior changes can be managed in two ways:
Go to MegaMatcher or MegaMatcher Embedded or VeriSpeak or VeriSpeak Embedded contents |
Products
SDKs for mobile devices:
More products for developers:
End-user products:
|