Computer Vision
• Optical Flow

Due: November 6

Code

Questions

  1. Generalize the line fitting technique covered in class, which allows fitting a single line to a set of points, to allow fitting multiple lines to a set of points. Your solution should use the Expectation-Maximization (EM) framework and answer the following questions:

  2. Imagine that you are the lead researcher for the Image Search group at Google. Outline an image retrieval system that takes an image as input and returns a rank-ordered list of images from the Internet that are most similar. Your solution should make use of the image segmentation and texture analysis methods we covered in class. Clearly specify the way you define distance between two images (this must be a scalar value).
The first step would be to separate the input image into some set of usable images. This would be achieved by computing the Laplacian of the input image scaled by powers of two. The so-called Laplacian pyramid could then be convolved on each level with a subset of the textons proposed by Olhausen and Field. This would produce N number of Laplacian pyramids, where N would be the number of textons. N is hard to determine without knowledge of the results of this operation, but it should be related to the amount of variance in "texture" space the input image has. The textons can be thought of as eigentextures and so depending on the input image, fewer eigentextures may be needed to decompose it. The resulting images can be thought of as vectors in texture space. These vectors can then be searched against a database of precomputed texture vectors. The "engine" under the hood here is this: assuming that we can recover the eigentextures present in the world, we can project any input texture into this space and the Euclidean distance in texture space should be small when a match is present. Therefore, we would look to minimize the Euclidean distance in texture space between the input convolved Laplacian pyramids and the database's convolved Laplacian pyramids. The Laplacian pyramid approach is rather secondary to the mechanism behind the matching. It ensures that scaled forms of the same texture can be accurately found too. Unfortunately this approach would require a good deal of texture present in all images so that each vector in texture space would be unique.
 False-Colorization
V = 1 :: Relatively no error is shown
γ = 0.7
γ = 0.5
γ = 0.2 :: Low error is accurately visualized and the overall scaling is adequate
Original Image
False-Colored Image

 

Optical Flow

Gradient Magnitude
Gradient in t-Direction
Gradient in Y-Direction
Gradient in t-Direction

 

Optical Flow
Stability
Error


Optical Flow
Stability
Error


Optical Flow
Stability
Error


Optical Flow
Stability
Error


Optical Flow
Stability
Error


Optical Flow
Stability
Error


Optical Flow
Stability
Error


Optical Flow
Stability
Error


Optical Flow
Stability
Error


Optical Flow
Stability
Error


Optical Flow
Stability
Error


Optical Flow
Stability
Error


Optical Flow
Stability
Error


Optical Flow
Stability
Error


Optical Flow
Stability
Error


Optical Flow
Stability
Error


Optical Flow
Stability
Error


Optical Flow
Stability
Error


Hand Image
Flowers Image
Image
N=10 Error
N=10 Stability
N=20 Error
N=20 Stability
N=40 Error
N=40 Stability