Assignment 1 Chapter 1 1.5 For the following colors in the digital RGB coordinate, determine their valuse in the YCbCr coordinate. (a)(255, 255, 255) (b)(0, 255, 0) (c)(255, 255, 0) (d)(0, 255, 255) Y Cb Cr 0.257 = -0.148 0.439 0.504 -0.291 -0.368 (a) 0.098 0.439 -0.071 R G B (b) Y (c) 235.045 169.51 Cb = 128 = 165.74 Cr 128 16.055 + 16 128 128 Y Cb Cr = (d) 144.52 Y 53.795 Cb 34.16 Cr = 210.055 Y 16.055 Cb 146.105 Cr Please calculate the needed bit-rate (bits/s) of a non-coded video stream with following formats: 1. Color Coordinate: YCbCr 2. Frame size: 352 x 288 3. Frame rate: 30 fps 4. Sampling format: (a) 4:4:4 (b) 4:2:2 (c) 4:2:0 (a) 352*288*(1+1+1)*30*8=727990720 bits/sec. (b) 352*288*(1+1/2+1/2)*30*8=48660480 bits/sec. (c) 352*288*(1+1/4+1/4)*30*8=36495360 bits/sec. Chapter 2 2.5 Consider a horizontal bar pattern on a TV screen with 100 cycle/picture-height. If the picture-height is 1 meter, and the viewer sits at 3 meters from the screen, what is the equivalent angular frequency in cpd? What if the viewer sits at 1 meter or 5 meters distance? In either case, would the viewer be able to perceive the vertical variation properly? When viewer sits at 3 meter from the screen, fθ=5.236 (cycle/degree) When viewer sits at 1 meter from the screen, fθ=1.7453 (cycle/degree) When viewer sits at 5 meter from the screen, fθ=8.7266 (cycle/degree) The maximum sensitivity occurs at about 3~5 cpd, so d = 3 m will be best 2.6 Consider an object that has a flat, homogeneously textured surface with maximum spatial frequency of (fx,fy)=(3,4) cycle/meter, and that is moving at a constant speed of (vx,vy)=(1,1) meter/second. What is the temporal frequency of the object surface at any point? What are the results for the following speeds (in meter/second):(4,-3),(4,0),(0,1)? When (fx, fy) = (3, 4), (vx, vy) = (1, 1) ft,max = -(vx fx+vy fy) = -(1*3+1*4) = -7 cycle/second When (fx, fy) = (3, 4), (vx, vy) = (4, -3) ft,max = -(vx fx+vy fy) = -(4*3+(-3)*4) = 0 cycle/second When (fx, fy) = (3, 4), (vx, vy) = (4, 0) ft,max = -(vx fx+vy fy) = -(4*3+0*4) = -12 cycle/second When (fx, fy) = (3, 4), (vx, vy) = (0, 1) ft,max = -(vx fx+vy fy) = -(0*3+1*4) = -4 cycle/second Assignment 2 Chapter 5 5.2 Show that, under orthographic projection, the projected 2-D motion of a planar patch undergoing translation, rotation, and scaling (because of camera zoom) can be described by an affine function. x=X , y=Y (5.1.3) X ' r 1 r 2 r 3 X Tx Y ' = Y + Ty r 4 r 5 r 6 Z ' r 7 r 8 r 9 Z Tz (5.5.12) orthographic projection 將(5.1.3)代入(5.5.12) planar patch 將 aX+bY+cZ=1 代入(5.5.12) 則 x’=r1X+r2Y+r3Z+Tx = r1x+r2y+r3 ( =( r1- 1 ax by )+Tx c r 3a r 3b r3 )x+( r2- )y+( Tx+ ) c c c dx(x, y)=x’-x=( r1- =( Tx+ r 3a r 3b r3 )x+( r2- )y+( Tx+ )-x c c c r3 r 3a r 3b )+( r1- -1)x+( r2- )y c c c r3 r 3a =a0, r1- -1=a1, c c 則 dx(x, y)= a0+a1x+a2y 令 Tx+ r2- 而 y’= r4X+r5Y+r6Z+Ty = r4x+r5y+r6 ( 1 ax by )+Ty c r 3b =a2 c =( r4- r 6a r 6b r6 )x+( r5- )y+( Ty+ ) c c c dy(x, y)=y’-y=( r4- =( Ty+ r 6a r 6b r6 )x+( r5- )y+( Ty+ )-y c c c r6 r 6a r 6b )+( r4- )x+( r5- -1)y c c c r6 r 6a =b0, r4- = b1, c c 則 dy(x, y)= b0+b1x+b2y 令 Ty+ dx( x, y ) a0 a1x a2y 推得 = dy ( x, y ) b0 b1x b2y r5- r 6b -1= b2 c (Affine Motion) Chapter 6 6.1 Describe the advantages and disadvantages of different motion representation methods (pixel-based, block-based, mesh-based, region-based, and global). See sec 6.2.1 6.9 In Section 6.9.2, we derived the number of operations required by the HBMA, if the search range at every level is R/2L-1. What will be the number if one uses a search range of ±1 pel in every level, except at the first level, where the search range is set to R/2L-1? Is this parameter set-up appropriate? L ( M / 2 L 1 ) 2 (2 R / 2 L 1 1) 2 (M / 2 L l ) 2 (2 R / 2 L 1 1) 2 l 2 (M / 2 L 1 2 ) (2 R / 2 L 1 1) 12M 2 (1 41 L ) 2 For higher levels, 1 is too small for sequences with high motions. Please discuss how to solve this problem.
© Copyright 2026 Paperzz