关键词 > COMP/ENGN4528

COMP/ENGN4528 Computer Vision Mid-semester test, First Semester 2022

发布时间：2023-08-01

Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit

College of Engineering and Computer Science

Mid-semester test, First Semester 2022

COMP/ENGN4528 Computer Vision

Question Booklet

Reading time: 15 minutes

Writing time: 1 hour

Uploading time: 15 minutes

There are 3 questions in total.

(Q1-Q3)

Each has multiple parts.

Please name your submission as

COMP/ENGN4528_midsem_u1234567.docx

(or PDF as you prefer)

Q1.(26 points) Multiple Choices 26 Marks. There is at least one correct answer for each question. You can get the full mark only if all correct answers are selected. No mark will be granted for partially selected answers. (2 marks each)

1. (2 points) Which filter(s) could be used for removing salt-and-pepper noise?

A. Gaussian Filter

B. Bilaterial Filter

C. Median Filter

D. Sobel Filter

2. (2 points) Please select the linear filter(s).

A. Gaussian filter

B. Median filter

C. Bilateral filter

D. Sobel filter

3. (2 points) Suppose a camera has a calibration matrix follows: 10(.)5 105 0(0))

( 0 0 1)

What does this tell you about the camera?

A. This is not a valid camera matrix.

B. The focal length is 1.5.

C. The camera has been rotated.

D. The camera has been translated.

E. None of the above.

4. (2 points) Which of the following are advantages of the HSV colour-space with respect to the RGB colour-space ?

A. Brightness information is separated into a single channel.

B. It uses less memory to encode each pixel.

C. Its fast to compute, so good for compression.

D. The Hue and Saturation components of HSV are less sensitive to lighting variation.

E. It is a perceptually uniform space - distances in the colour space itself correspond to differences in human perception of colour.

5. (2 points) Which of the following is a (are) good property(ies) of an interest point?

A. Strong 2D structure

B. Two eigenvalues of a local auto-correleation matrix (M) are both large.

C. Edges make good interest points.

D. The derivatives are small.

E. There are bright colours.

6. (2 points) Which of the following are not advantages of the SIFT feature over the Harris corner?

A. Response is invariant to image rotation

B. Response is invariant to image scaling

C. Response in invariant to image image translation

D. Less computation required for computation of a SIFT feature.

E. SIFT provides distinctive features for matching across images.

7. (2 points) Which of the following are properties of a Gaussian Filter?

A. It is separable.

B. Pixel weight is scaled according to the nearness of the pixel to the centre of the filter. C. The only parameter is the sigma of the Gaussian.

D. Applying it always increase the overall image brightness.

E. Its application sharpens edges in the image.

F. Cascading the filter with size σ = a followed by σ = b has the same effect as applying a single filter with size σ = (a + b)

8. (2 points) Suppose you have a simple fully connected linear network, with bias, that takes a 32x32 greyscale image as input. The input is connected directly to a classification space with 10 output units. How many free parameters are there in the network?

A. 10240

B. 10250

C. 10260

D. 320

E. 330

F. 340

9. (2 points) Suppose we have a softmax function at the end of our network that has 4 outputs correspond- ing to 4 classes. Which of the following are possible outputs from the function?

A. 0.1 1 0.3 0.7

B. 0.3 0.4 0.2 0.1

C. 1 0 1 0

D. 0.4 0.1 0.4 0.1

E. 0.5 0 1.5 0

F. 0.5 0 -0.5 0

G. 0.5 0 0.7 -0.2

H. None of the above

10. (2 points) Which loss function is the best suited to classification problems? Note that y is the estimated value, y* defines the ground truth. (Please select multiple answers if you consider them to be equally good).

A. 2

B. y * log(y) + (1 − y*)log(1 − y)

C. |(y − y*)|

D. None of these

11. (2 points) Suppose we have a 7x7 convolutional layer, followed by a ReLU, then sequentially another two 3x3 convolutional layers, each of them is followed by a ReLU. Note that the stride is equal to 1 for all convolutional layers. What is the effective receptive field of a node after the third convolutional layer?

A. 7x7

B. 9x9

C. 10x10

D. 3x3

E. 11x11

F. None of the above.

12. (2 points) Suppose a network has an input image of 240x240. A convolution layer of 5x5 with no padding, and a stride of two is applied. This is followed by a ReLU, and then a pooling layer of 2x2, stride 2. What is the size of the output of the pooling layer?

A. 60x60

B. 59x59

C. 118x118

D. 240x240

E. 120x120

F. None of the above.

13. (2 points) Which of the following does regularization in Neural Networks help with?

A. keeping the weights small.

B. preventing overfitting.

C. making some of the weights zero.

D. None of the above.

Q2. (34 points) Short Answer Questions. (No answer should be more than half a page.)

1. (3 points) Please provide one example each for a point operation and a neighborhood operation.

2. (5 points) Please describe the differences between Gaussian filter and Bilateral filter. What are the advantages and disadvantages of the bilateral filter?

3. (3 points) How does SIFT extract scale invariant features?

4. (3 points) What are the advantages of a separable filter?

5. (3 points) What are the advantages and disadvantages of forward warping and inverse warping for image transformation?

6. (5 points) Why is Harris corner detector rotation invariant?

7. (5 points) Please discuss whether bilateral filtering can remove salt and pepper noise. Please provide the reasoning.

8. (7 points) Compare the activation function shown in Fig. 1 with a Relu function. Discuss the differences and the behavior of this activation function when training a network.

Figure 1: Activation Function Comparison

Figure 2: Image filter (a) and Image patch (b)

Q3. (40 points) Calculation questions. Please only provide the final answer.

'（214(230) 31(22) 25(9) 12(4) 2(7))'

1. (5 points) Given an input 5x5 image: ' 109 2 1 3 2 ' ,

( 21 2 4 3 3)

suppose a Deep CNN uses a filter (correlation) that it has learned: l016 0(0).(.)2(3) 01(.)4」to this image patch.

[ 2 0.5 0.6l

Then it is followed by a ReLU activation function. What will be the result for the centre pixel of this image?

2. (5 points) Suppose that Fig. 2 (b) is an image, and (a) is a filter.

Calculate the result if you were to apply the filter as a convolution operator for the red (upper left) and the blue square (bottom right) (Note: you are going to obtain one value for each region).

3. (15 points) Calculate the value of the blue patch in Fig. 3 using bilateral filtering. Assume the Domain

'（0(0).(.)0375(0318) 0(0).(.)0443(0375) 0(0).(.)0469(0397) 0(0).(.)0443(0375) 0(0).(.)0375(0318))'

kernel is of size 5×5, the standard deviation σd = 3, provided as ' 0.0397 0.0469 0.0495 0.0469 0.0397 '

(0.(.)0318 0.(.)0375 0.(.)0397 0.(.)0375 0.(.)0318)

the Range kernel is of size 5 × 5 and the standard deviation σr = 50. Please 1) provide the range filter associated to the pixel high-lighted in the Fig. 3, and 2) show the filtered value for the high-lighted pixel.

4. (15 points) Back propagation through the computational graph (see Fig. 4). The current value for w0 = 0.2, w1 = 0.2, w2 = 0.3, x0 = 2, x1 = 3. p and q define the intermediate variables that are calculated during training, at the specified points in the computation graph. l is the output of the computational graph. Please provide the gradient and based on the back-propagated gradient calculation.

Figure 3: Image Patch

Figure 4: Computational Graph. * is the multiplication operator, + is addition operator, exp denotes the exponential operator, ’+1’ defines the sum of the input with 1 and ’-1’ defines the subtraction of 1 frominput, defines the reciprocal operator.