| |

 |
Learning
a Model |
- Target: a target
is a form that you wish to recognize (e.g. a car, a face,
a weapon…). SpikeNet allows you to search for multiple
targets simultaneously.
- Input image:
The Input Image is the image on which your perform the search
using the models in the database.
- Source image: Image used to create a
model. You can use the entire image or only a selection.
- Region: You can
use various tools to select a region for learning. The default
selection tool is rectangle but other tools can be selected
using the Region Selection Tool option of the Edit menu:
rounded rectangle, ellipse, free-hand. You can also fix
a determined size using the select fixed size option in
the Edit menu.
- Learning a model: Process to create models.
Select a region then double click within that region. The
model is automatically added to the model list and displayed
in the “Model” window.
- Model: a model is a region (an object,
a face, a car, a wing of a plane…) selected from a
source image that you wish to locate on other images. You
can create any number of models for each target that you
wish to recognize and add them to the Model List. You can
save these models in a Models File (.snm) which can be loaded
subsequently and used in the recognition.
- Models file: File containing models (.snm)
- Model Identificator (Model ID): Identificator
used for specifying a model within the database. It is unique
and given by the system during the addition of the model
to the database.
- Model Logical Name:
Name given to a model. By default, the name is that of the
source image file used to create the model. If you create
a model using a source image file named “airplane”,
the first model you create will be named “airplane
1”, the second, “airplane 2”, and so on.
Each Model has a unique Logical Name which you can change
by editing the Model Parameters.
- Model Generator:
When you create a model, you can use the Model Generator
to specify a range of orientations and zoom factors, together
with the step size between models. Click on the Learn option
in the Model menu. By default the Zoom and Rotate factors
boxes are left unchecked. If you check them, SNVision MB
will generate multiple models to cover the specified range
of zoom factors and/or orientations.
- Model Info: Properties of a model. Model
properties can be accessed by clicking on the "+"
sign to the left of the model name in the Model List Window.They
include:
- Model Identificator
- Model size: width and height of the model
- Heigh and width references (size of the original selection
on the source image)
- Model Logical Name
- Color: A colored shape (circle, cross, point, zone) is
used to locate the model when it is recognized on an image.
This color can be changed when you edit the model.
- Zoom: zoom factor applied
- Rotate: rotation factor applied
- Threshold: threshold value
- Detail: detail value
- Model List: Window displaying all models
by name. The checked models are enabled.
- Model parameters: Default search settings
are assigned to the model that you learn and then search.
Model search settings can affect the speed and robustness
of your application. It is recommended that you begin with
the default settings, and then adjust the individual settings
one by one in order to train the system to search for the
model in the most efficient manner.
- Threshold: Adjusts
the sensitivity of the model. Increasing the threshold will
increase the sensitivity of the model, generating hits that
better match your model. The higher the threshold value
is, the more similar to the model an object has to be in
order to be detected
Lowering the threshold will lower the sensitivity increasing
the amount of error. This can be useful when you have used
a single model (a face for example) and want the system
to respond even when the view is not identical to that of
the model used.
- Detail: Quantity
of information contained in a model. Increasing the detail
value will improve the signal to noise ratio, but with the
penalty of increasing the processing time.
- Enabled (Model): By default all models
are enabled. But you can disable models within your list.
This can be useful if you have created models for different
targets and want the recognition process to focus only on
specific targets.
- Color: Color of the shape used to locate
the model on the input image.
- Name: Model name
 |
Recognizing a model |
- Recognition: Searches for model(s) in
the Input image and writes the results in a Results table.
The recognition process will find all the occurrences of
each enabled model. But, you can limit the number of hits
(detections) produced. Click on the “Recognize”
button to start the search.
- Recognition Parameters
Default settings are assigned when you first initiate the
recognition process. You can adjust all recognition parameters
in order to improve the performance and robustness of the
recognition.
- Spatial filter: Filter used to prevent
multiple recognitions within the same region. Only the detection
with the higher Quality score remains.
- Propagation: information extracted from
the input image.
When you increase the propagation value, more information
about the image is provided thus improving the signal to
noise ratio of the recognition process. But as more information
is provided, the processing time goes up and you increase
the number of false positives. The propagation value should
be kept below 30.
- Minimum Contrast:
SpikeNet’s algorithm is designed to be extremely good
at detecting objects even when the contrast in the image
is very low. One problem that this can cause is that in
an image where the only interesting objects are at high
contrasts, unwanted detections may occur in regions of very
low contrast. To avoid this sort of problem, you can raise
the Minimum Contrast value that is normally fixed at 0.
This rules out the false detections occurrences in regions
with very low contrast and greatly speeds up the processing
time. Only the regions in which the contrast is above the
Minimum Contrast value are subject to processing.
- Resolution:
The resolution refers to the spacing of the pixels in an
image. The higher the resolution, the more pixels in the
image. Decreasing the resolution will improve the processing
time and strengthen the robustness of the recognition because
you reduce the image detail. Make sure that you modify the
image resolution before learning a model.
- Maximum hits: When you start a recognition
process, you can set the expected number of occurrences
of the model to be found in the Input Image and shown in
the Recognition Results Table. The Maximum Hits value applies
to all models in the list.
- Multiscale: The multiscale option enables
you to process an image at several different scales simultaneously.
This provides an alternative way of achieving size invariant
recognition to that provided by the zoom options that can
be used during model learning (see Model Generator).
- Mask: Image used for
defining a region of interest within an image thus reducing
the processing time and the number of false positives in
the hidden regions (outside the mask).
To create a mask, select a region using the Region Selection
Tools in the Edit menu, and click on Add mask in the Processing
menu. You may save the image mask for later use (.pgm file)
by clicking on Save Mask in the Processing Menu.
 |
Recognition Results |
- Hits: Occurrences (detections) of the
enabled model(s) found on an input image
- Results: Each time the recognition process
is initiated the Results window provides the following information
about the recognition:
- Rank: Detections (hits) listed by descending
order of quality
- ID: Model Identificator
- Model Logical Name
- Model X and Y coordinates: When an occurrence
of the model is found, the X and Y coordinates of the actual
found position are provided. The reference point is at the
center of the model.
- Quality: The quality
is a confidence factor. Quality scores range from 0 to 2000
with higher values corresponding to higher quality.
- Frame: Image number in a video sequence.
The status bar of the main window also provides information
on the recognition results:
- Hit Count: Number of hits that
were found in the active Input Document (Image, Video
Sequence, Video Capture).
- Processing time in ms
|
| |
|
|
|