Next: Photobook
Up: Systems
Previous: MIR
Figure 18:
NETRA. Result of querying on shape with the complex description.
 |
Department of Electrical and Computer Engineering, University of California,
Santa Barbara, CA.
http://maya.ece.ucsb.edu/Netra/.
A demo of the system is available at webaddress
http://maya.ece.ucsb.edu/Netra/netra.html.
[Ma97], [MM99].
Images in the database are segmented into regions of homogeneous color.
Of those regions, the following features are extracted: color, texture, shape,
and spatial location.
On the basis of a training set of images, the RGB color space is quantized,
and represented by a color codebook of 256 colors, the centroids of the
quantization cells.
The colors of an image region are also quantized, giving a color feature vector
,
with ci the index into the color code book,
and pi the fraction of that color in the region,
.
The number n is the number of colors used to represent the region, which is
different for each region.
Texture is represented by a feature vector ft containing the normalized
mean and standard deviation of a series of Gabor wavelet transforms of the
image:
,
with s the number of scales, and k the number of directions.
There are three feature vectors used to represent the shape of regions.
The first, fK, is based on the curvature function of the contour, giving
the curvature at each point on the contour.
The second, fR is based on the centroid distance function, giving at each
contour point the distance to the centroid of the region.
The third, fZ, is the complex coordinate function, representing each contour
point as a complex number with real component equal to the x-coordinate, and
the imaginary component equal to the y-coordinate.
On 64 samples of each of these functions, the fast Fourier transform (FFT) is
applied, of which the real (amplitude) component of the coefficients is used,
the numbers
.
The feature vectors are as follows:
,
,
.
There are 2,500 images from the Corel photo collection, organized in 25
categories, with 100 images in each category.
You can select any one of them as the query image.
All images in the database have been segmented into homogeneous regions.
You can click on one of the regions and select one of the four image attribute
color, spatial location, texture, and shape.
Instead of using an image example, you can also directly specify the color and
spatial location.
The spatial location querying tool utilizes two bounding boxes to define the
area of interest.
The inner box is used to define the preferred area, and the box outside is
used to constrain the objects to be within this area.
Thus, if the object has any its bodies exceeding this outside box, they will
not be considered.
Consider two color feature vectors, fcA of region A, and fcB of region B.
For each color ci in fcA, the closest color ckB in fcB is found, and
the distance
d(ciA,fcB) is calculated as the weighted Euclidean distance in
RGB space:
d(ciA,fcB)=|piA-pkB| d(ciA,ckB).
The distance between the two color feature vectors is now
+
.
The distance between two texture feature vectors is the L1-distance.
The distance between two shape feature vectors is the Euclidean distance.
Indexing is based on the SS-tree [WJ96].
Color, texture, and shape are indexed separately.
The first feature the user specifies is used to retrieve about 100 candidates.
Then this feature and the possible other features together are used to order the
retrieval result.
The matched images are linearly ordered, see figure 18.
An initial prototype of NETRA is used in ADLADL (see above) to
search on texture.
Next: Photobook
Up: Systems
Previous: MIR
Remco Veltkamp
2001-03-08