|
|
SentiSight object learning and recognition processesObject learning process
Object learning
Click to zoom
In order to recognize an object in an image, the appearance of an object must first be memorized. In the learning phase, SentiSight algorithms extract specific object features from a video stream or single image and save them into what is known as an object's model. In many cases there is more information in a video or single image than just the object you want SentiSight to learn, like a background, other objects in the room or a hand holding the object. Therefore, to learn an object, information about the exact location of the object in the image should be provided. SentiSight supports 2 methods of object learning: manual and automatic. Manual object learning is suitable for most situations. A user must perform these steps for manual object learning in the SentiSight 2.1 SDK:
Automatic object learning is suitable for lightweight movable objects. This learning procedure is based on detecting an object by excluding a static background and object's holder (usually a hand) from an image. A user must perform these steps for automatic object learning in the SentiSight 2.1 SDK:
Therefore, the automatic method requires to use live video or to provide separate videos or image sets of background, holder and object. Also, the other background elements could be learned together with the object if the object is hardly separable from the background. This can affect the ability of the algorithm to recognize the unique qualities of the object and may result in the object being misclassified with other objects that have the same background. Manual object learning should be used for objects that cannot be moved or if there are no way to provide separate media with objects background and/or holder. Thus, automatic learning provides less amount of user interaction with the system, but it is not as precise as manual learning. Also manual learning is suitable for wider range of cases. Object recognition process
Object recognition
Click to zoom
Object recognition requires no user interaction apart from providing a video file with the object or pointing a camera to the scene where the learned object is presented or will appear. When the object appears in the vision field, SentiSight tries to recognize it. If the object is recognized by SentiSight, object's name (ID) and coordinates are returned. The SentiSight algorithm creates a model with possible views from different sides, in different 3D poses and in different lighting conditions in object learning stage. This object's model improves recognition capability. SentiSight's object recognition is comparably fast – around 10 frames per second for a single object model (320 X 240 resolution). However for tasks when an even faster response is needed, the SentiSight 2.1 library has a tracking mode that enables tracking speeds up to 20 frames per second. Tracking is initialized if an object is recognized and located, then tracks the object until it changes somewhat in appearance, at which point tracking is reinitialized by recognition. The tracking feature is sensitive to complex backgrounds, and tracking is more difficult with homogenous objects SentiSight algorithm capabilities and requirementsSentiSight is designed to be as universal as possible and is able to perform fully automatic and manual object learning. Some of the potential applications for SentiSight include security systems, vision systems for robots, machine vision (like parts recognition in production lines), search engines that recognize objects in picture files, road signs recognition, etc. The SentiSight 2.1 algorithm has these capabilities for advanced visual-based object learning and recognition:
These conditions may alter algorithms performance:
All performance evaluations were made using a PC with 2.4 GHz Intel Core2 Duo CPU Reliability Tests and Technical SpecificationsSentiSight 2.1 was tested with object images from many cameras. At 0.1% False Acceptance Rate (FAR), the recognition rate is from 70% to more than 99% depending on object structural appearance, transparency, etc. For objects with well defined intenal structure, the recognition rate is 98% - 99% at 0.1% FAR.
|
Products
AFIS or multi-biometric face-fingerprint and optionally iris and palm print identification for large-scale systems.
More products:
|
||||||||||||||