Technical Specifications
All biometric templates should be loaded into RAM before identification, thus the maximum biometric templates database size is limited by the amount of available RAM.
The technical specifications below are presented for fingerprint, face, iris, voiceprint and palmprint biometric engines.
Fingerprint engine specifications
Fingerprint scanners are recommended to have at least 500 ppi resolution and at least 1" x 1" fingerprint sensors. The specifications are provided for 500 x 500 pixels fingerprint images and templates extracted from these images. Also, the matching algorithm has a special mode for matching different scale fingerprint records, like different image resolutions or age-related changes in finger size.
MegaMatcher fingerprint template extraction and matching algorithm is designed to run on multi-core processors allowing to reach maximum possible performance on the used hardware.
MegaMatcher 13.1 fingerprint engine specifications | |||||
---|---|---|---|---|---|
Embedded / mobile (1) platform |
PC-based (2) platform |
Server platform |
|||
Template extraction components | Mobile Fingerprint Extractor |
Mobile Fingerprint Client |
Fingerprint Extractor |
Fingerprint Client |
Fingerprint Image Processing(3) |
Template extraction speed (fingerprints per minute) |
45 | 50 | 45 | 100 | 3,000 |
Template matching components | Mobile Fingerprint Matcher |
Mobile Fast Fingerprint Matcher |
Fingerprint Matcher |
Fast Fingerprint Matcher(2) |
|
Template matching speed (fingerprints per second) |
3,000 | 200,000 | 40,000 | 200,000 | |
Single flat/plain fingerprint record size in a template (4) (bytes) | 300 - 3,200 (configurable) |
||||
Single rolled fingerprint record size in a template (4) (bytes) | 1,100 - 6,600 (configurable) |
Face engine specifications and recommendations
-
General recommendations for facial recognition:
- Face recognition accuracy of the MegaMatcher algorithm heavily depends on the quality of a face image. Image quality during enrollment is important, as it influences the quality of the face template.
- 32 pixels is the recommended minimal distance between eyes for a face on image or video stream to perform face template extraction reliably. 64 pixels or more recommended for better face recognition results. Note that this distance should be native, not achieved by resizing an image.
- Several images during enrollment are recommended for better facial template quality which results in user experience improvement during recognition.
- Additional enrollments may be needed when facial hair style changes, especially when beard or mustache is grown or shaved off.
- Persons wearing face masks or respirators can be recognized without separate enrollment. Face quality check should be disabled for this scenario.
-
The face recognition engine has certain tolerance to face posture:
-
head roll (tilt) – ±180 degrees (configurable);
- ±15 degrees default value is the fastest setting which is usually sufficient for most near-frontal face images.
-
head pitch (nod) – ±15 degrees from frontal position.
- The head pitch tolerance can be increased up to ±25 degrees if several views of the same face that covered different pitch angles were used during enrollment.
-
head yaw (bobble) – ±90 degrees from frontal position (default value).
- Smaller yaw tolerance values are not recommended to be used except if the target system does not meet the system requirements.
- Several views of the same face can be enrolled to the database to cover the whole ±90 degrees yaw range from frontal position.
-
head roll (tilt) – ±180 degrees (configurable);
-
Face liveness check:
- The face liveness check algorithm was tested by iBeta and proven to be compliant with ISO 30107-3 Biometric Presentation Attack Detection Standards.
- A stream of consecutive images (usually a video stream from a camera) or a single image (in some modes) are required for the face liveness detection.
- When the liveness check is enabled, it is performed by the face engine before feature extraction. If the face in the stream fails to qualify as "live", the features are not extracted.
- Only one face should be visible in these frames.
- 80 pixels is the recommended minimal distance between eyes (IOD) for a face to perform liveness check reliably. 100 pixels or more recommended for smoother performance.
- During passive liveness checks the face should be still and the user has to look directly at the camera with ±15 degrees tolerances for roll, pitch and yaw to experience the best performance.
- Optionally, ICAO compliance check can be used to strengthen the liveness check.
-
Users can enable these liveness check modes:
- Active – the engine requests the user to perform certain actions like blinking or moving one's head. All requested actions should be performed to pass the liveness check. This mode can work with both colored and grayscale images. 5 frames per second or better frame rate required.
- Passive – the engine analyzes certain facial features while the user stays still in front of the camera for a short period of time. Colored images are required for this mode. 10 frames per second or better frame rate is required. Better score is achieved when users do not move at all.
- Passive + Blink – the engine analyzes certain facial features while the user stays still in front of the camera for a short period of time, when the engine requests the user to blink. Colored images are required for this mode. 10 frames per second or higher frame rate required.
- Passive then active – the engine first tries the passive liveness check, and if it fails, tries the active check. This mode requires colored images.
- Simple – the engine requires user to turn head from side to side while looking at camera. This mode can work with both colored and grayscale images. 5 frames per second or better frame rate recommended.
- Single frame passive – the engine uses a neural network to estimate if a face image is not inserted in front of a camera using a paper photo or smartphone screen. This mode does not need any interaction from the user.
The specifications below are provided for the default roll and yaw values.
MegaMatcher face template extraction and matching algorithm is designed to run on multi-core processors allowing to reach maximum possible performance on the used hardware.
MegaMatcher 13.1 face engine specifications | |||||
---|---|---|---|---|---|
Embedded / mobile (1) platform |
PC-based (2) platform |
Server platform |
|||
Template extraction components | Mobile Face Extractor |
Mobile Face Client |
Face Extractor |
Face Client |
Face Image Processing(3) |
Template extraction speed (faces per minute) |
45 | 50 | 45 | 100 | 3,000 |
Template matching components | Mobile Face Matcher | Mobile Fast Face Matcher |
Face Matcher | Fast Face Matcher(2) | |
Template matching speed (faces per second) |
3,000 | 200,000 | 40,000 | 200,000 | |
Single face record size in a template (4) (bytes) | 322 |
Iris engine specifications
Iris capture cameras are recommended to produce at least 640 x 480 pixels images. The specifications are provided for these images.
MegaMatcher iris template extraction and matching algorithm is designed to run on multi-core processors allowing to reach maximum possible performance on the used hardware.
MegaMatcher 13.1 iris engine specifications | |||||
---|---|---|---|---|---|
Embedded / mobile (1) platform |
PC-based (2) platform |
Server platform |
|||
Template extraction components | Mobile Iris Extractor |
Mobile Iris Client |
Iris Extractor |
Iris Client |
Iris Image Processing(5) |
Template extraction speed (irises per minute) |
45 | 50 | 45 | 100 | 3,000 |
Template matching components | Mobile Iris Matcher | Mobile Fast Iris Matcher |
Iris Matcher | Fast Iris Matcher(2) | |
Template matching speed (irises per second) |
3,000 | 200,000 | 40,000 | 200,000 | |
Single iris record size in a template (4) (bytes) | 2,486 |
Voiceprint engine specifications and recommendations
-
General recommendations:
- The speaker recognition accuracy of MegaMatcher depends on the audio quality during enrollment and identification.
- Voice samples of at least 2-seconds in length are recommended to assure speaker recognition quality.
- A passphrase should be kept secret and not spoken in an environment where others may hear it if the speaker recognition system is used in a scenario with unique phrases for each user.
- The text-independent speaker recognition may be vulnerable to attack with a covertly recorded phrase from a person. Passphrase verification or two-factor authentication (i.e. requirement to type a password) will increase the overall system security.
-
Microphones –
there are no particular constraints on models or manufacturers when using regular PC microphones, headsets or the built-in microphones in laptops, smartphones and tablets.
However these factors should be noted:
- The same microphone model is recommended (if possible) for use during both enrollment and recognition, as different models may produce different sound quality. Some models may also introduce specific noise or distortion into the audio, or may include certain hardware sound processing, which will not be present when using a different model. This is also the recommended procedure when using smartphones or tablets, as different device models may alter the recording of the voice in different ways.
- The same microphone position and distance is recommended during enrollment and recognition. Headsets provide optimal distance between user and microphone; this distance is recommended when non-headset microphones are used.
- Web cam built-in microphones should be used with care, as they are usually positioned at a rather long distance from the user and may provide lower sound quality. The sound quality may be affected if users subsequently change their position relative to the web cam.
-
Sound settings:
- Settings for clear sound must be ensured; some audio software, hardware or drivers may have sound modification enabled by default. For example, the Microsoft Windows OS usually has, by default, sound boost enabled.
- A minimum 8000 Hz sampling rate, with at least 16-bit depth, should be used during voice recording.
-
Environment constraints –
the MegaMatcher speaker recognition engine is sensitive to noise or loud voices in the background; they may interfere with the user's voice and affect the recognition results.
These solutions may be considered to reduce or eliminate these problems:
- A quiet environment for enrollment and recognition.
- Several samples of the same phrase recorded in different environments can be stored in a biometric template. Later the user will be matched against these samples with much higher recognition quality.
- Close-range microphones (like those in headsets or smartphones) that are not affected by distant sources of sound.
- Third-party or custom solutions for background noise reduction, such as using two separate microphones for recording user voice and background sound, and later subtracting the background noise from the recording.
-
User behavior and voice changes:
-
Natural voice changes may affect speaker recognition accuracy:
- a temporarily hoarse voice caused by a cold or other sickness;
- different emotional states that affect voice (i.e. a cheerful voice versus a tired voice);
- different pronunciation speeds during enrollment and identification.
-
The aforementioned voice and user behavior changes can be managed in two ways:
- separate enrollments for the altered voice, storing the records in the same person's template;
- a controlled, neutral voice during enrollment and identification.
-
Natural voice changes may affect speaker recognition accuracy:
MegaMatcher voiceprint template extraction and matching algorithm is designed to run on multi-core processors allowing to reach maximum possible performance on the used hardware.
MegaMatcher 13.1 voiceprint engine specifications | |||||
---|---|---|---|---|---|
Embedded / mobile (1) platform |
PC-based platform |
Server platform |
|||
Template extraction components | Mobile Voice Extractor |
Mobile Voice Client |
Voice Extractor(2) |
Voice Client(2) |
Voice Processing(5) |
Template extraction speed (voiceprints per minute) |
45 | 50 | 45 | 100 | 3,000 |
Template matching components | Mobile Voice Matcher | Voice Matcher | Fast Voice Matcher(2) | ||
Template matching speed (voiceprints per second) |
100 | 8,000 | 40,000 | ||
Single voiceprint record size in a template (4) (6) (bytes) | 3,500 - 4,500 |
Palmprint engine specifications
MegaMatcher palm print template extraction and matching algorithm is designed to run on multi-core processors allowing to reach maximum possible performance on the used hardware.
MegaMatcher 13.1 palm print engine specifications | ||
---|---|---|
PC-based platform |
Server platform |
|
Template extraction component | Palm Print Client(2) | Palm Print Image Processing(3) |
Template extraction speed (palm prints per minute) |
15 | 350 |
Template matching component | Palm Print Matcher | Fast Palm Print Matcher(2) |
Template matching speed (palm prints per second) |
800 | 4,000 |
Average single palm print record size in a template (4) (kilobytes) | 33 |
Notes:
(1) Requires to be run on iOS devices or Android devices based on at least Snapdragon S4 system-on-chip with Krait 300 processor (4 cores, 1.51 GHz).
(2) Requires to be run on PC or laptop with Intel Core i7-8700K or similar processor to reach the specified performance.
(3) Requires to be run on server hardware with at least Dual Intel Xeon Gold 6126 processors (2.6 GHz) to reach the specified performance.
(4) MegaMatcher 13.1 allows to store multiple biometric records of the same or different biometric modalities in a template; in this case the template size is the sum of all included biometric records.
(5) Requires to be run on server hardware with at least Intel Xeon Gold 6126 processor (2.6 GHz) to reach the specified performance.
(6) The specifications are provided for 5-second long voice samples; template size has linear dependence from voice sample length.