Solutions to the Spring 2016 CAS Exam S There were 45 questions in total, of equal value, on this 4 hour exam. There was a 15 minute reading period in addition to the 4 hours. The Exam S is copyright 2016 by the Casualty Actuarial Society. The exam is available from the CAS. The solutions and comments are solely the responsibility of the author. CAS Exam S prepared by Howard C. Mahler, FCAS Copyright 2016 by Howard C. Mahler. Howard Mahler [email protected] www.howardmahler.com/Teaching Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 1 1. Alice, Bob and Chris are hired by a firm to drive three vehicles - a bus, a taxi and a train (only one driver is needed for each). Each employee has different skills and requires different amounts of training on each vehicle. The cost to train each employee i for vehicle j, Ci,j, is an independent exponential random variable with mean of 200. To minimize the total cost of training, the firm uses the following procedure to assign the employees to their vehicle: • Alice is assigned to the vehicle which minimizes her training cost, CAlice,j. • Bob is then assigned one of the two remaining vehicles which minimizes CBob,j. € € € • Chris is then assigned the remaining vehicle. Calculate the firm's expected total cost of training these three employees, if it uses this assignment algorithm. A. Less than 200 B. At least 200, but less than 300 C. At least 300, but less than 400 D. At least 400, but less than 500 E. At least 500 1. C. E[Training costs of Alice] = E[minimum of 3 exponentials] = θ/3. E[Training costs of Bob] = E[minimum of 2 exponentials] = θ/2. E[Training costs of Chris] = E[exponential] = θ. Expected total training costs: θ(1/3 + 1/2 + 1) = (200)(11/6) = 366.67. Comment: An example of a “Greedy Algorithm for Assignment” as discussed in Example 5.7 of An Introduction to Probability Models, by Ross. Solutions Spring 2016 CAS Exam S, € € HCM 6/28/16, Page 2 2. You are given the following information: • Vehicles arrive at the drive-through bank at a Poisson rate of 30 per hour. • On average, 10% of these vehicles require cash transactions. Calculate the probability that at least 3 vehicles require cash transactions between 10 AM and noon. A. Less than 0.915 B. At least 0.915, but less than 0.925 C. At least 0.925, but less than 0.935 D. At least 0.935, but less than 0.945 E. At least 0.945 2. D. The number of cash transactions over two hours is Poisson with mean: (2)(10%)(30) = 6. Prob[at least 3] = 1 - f(0) - f(1) - f(2) = 1 - e-6 (1 + 6 + 62 /2) = 93.80%. Solutions Spring 2016 CAS Exam S, € € HCM 6/28/16, 3. You are given: • The number of claims, N(t), follows a Poisson process with intensity: λ(t) = t/2, 0 < t < 5 λ(t) = t/4, t ≥ 5 • By time t = 4, 15 claims have occurred. Calculate the probability that exactly 16 claims will have occurred by time t = 6. A. Less than 0.075 B. At least 0.075, but less than 0.125 C. At least 0.125, but less than 0.175 D. At least 0.175, but less than 0.225 E. At least 0225 3. B. The number of claims from time 4 to time 6 is Poison with mean: 5 6 ∫ t / 2 dt + ∫ t / 4 dt = (52 - 42)/4 + (62 - 52)/8 = 3.625. 4 5 Prob[exactly one claim between time 4 and 6] = 3.625 e-3.625 = 9.66%. € € Page 3 Solutions Spring 2016 CAS Exam S, € € € € HCM 6/28/16, Page 4 4. You are given the following information about the rate at which calls come into a customer service center: • The customer service center is open 24 hours a day. • Calls are received from four regions. • The rate at which calls arrive is 10 calls per hour and follows a Poisson distribution. • The probability that a call will be from a given region is independent of other calls and is shown in the table below. Region Probability 1 0.2 2 0.3 3 0.1 4 0.4 Calculate the probability that in a one-hour time period at least one call will have been received from each region. A. Less than 0.25 B. At least 0.25, but less than 0.30 C. At least 0.30, but less than 0.35 D. At least 0.35, but less than 0.40 E. At least 0.40 4. E. The number of calls from each region are independent Poissons with means: 2, 3, 1, and 4. Prob[at least one call from each region] = Prob[> 0 from Region 1] Prob[> 0 from Reg. 2] Prob[> 0 from Reg. 3] Prob[> 0 from Reg. 4] = (1 - e-2) (1 - e-3) (1 - e-1) (1 - e-4) = 50.98%. Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 5 5. You are given the following information on a health insurance policy: • Claims occur according to a Poisson process with mean λ = 10 per week. • Claim amounts follow the Pareto distribution with probability density function f(x) = € € (4) (10004 ) ,0<x (x + 1000)5 Calculate the probability that the total claim amount in 52 weeks exceeds 200,000 using the Normal approximation. € than 0.02 A. Less B. At least 0.02, but less than 0.03 C. At least 0.03, but less than 0.04 D. At least 0.04, but less than 0.05 E. At least 0.05 5. B. Looking in the tables attached to the exam, the Pareto Distribution has α = 4 and θ = 1000. The Pareto has mean: θ/(α-1) = 1000/(4-1) = 333.333. The Pareto has second moment: 2 θ2 (2) (10002) = = 1/3 million. (α - 1)(α - 2) (4 -1) (4 - 2) Thus the mean aggregate loss is: (52)(10)(333.333) = 173,333. The variance of aggregate loss is: (52)(10)(1/3 million) = 173.333 million. € 52 weeks exceeds 200,000 is approximately: Probability that the total € claim amount in 200,000 - 173,333 1 - Φ[ ] = 1 - Φ[2.026] = 2.14%. 173.333 million € Solutions Spring 2016 CAS Exam S, € € € HCM 6/28/16, Page 6 6. You are given the following information: • The lifetimes of all light bulbs follow the exponential distribution. • A new incandescent light bulb has a hazard rate of 0.20 per year. • The expected lifetime of an LED light bulb is twice the expected lifetime of a 2-year-old incandescent light bulb. Calculate the hazard rate per year for an LED light bulb. A. Less than 0.05 B. At least 0.05, but less than 0.07 C. At least 0.07, but less than 0.09 D. At least 0.09, but less than 0.11 E. At least 0.11 6. D. The hazard rate for an Exponential Distribution is constant and equal to 1/mean. The expected lifetime of a 2-year old incandescent light bulb is: 1/0.2 = 5, same as that of a new one. Thus LED light bulbs have mean: (2)(5) = 10, and hazard rate: 1/10. Solutions Spring 2016 CAS Exam S, € € € HCM 6/28/16, Page 7 7. You are given the following information: • Consider a system of three independent components, each of which functions for an amount of time (in months) uniformly distributed over (0, 1). • Under current design, the system will fail if any of the 3 components fail. • A new design was made such that the system will fail if 2 or more components fail. Calculate the increase in expected system life in months under the new system design. A. Less than 0.195 B. At least 0.195, but less than 0.215 C. At least 0.215, but less than 0.235 D. At least 0.235, but less than 0.255 E. At least 0.255 7. D. Under the current design, the expected lifetime is the minimum of three uniform distributions over (0, 1): 1/(3+1) = 1/4. Under the new design, the expected lifetime is the 2nd order statistic from three uniform distributions over (0, 1): 2/(3+1) = 1/2. The increase in expected system life is: 1/2 - 1/4 = 1/4. Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 8 8. In an airport, you are given the following information: • Taxis arrive in accordance with a Poisson process with rate of λT = 2 per minute. • Customers arrive in accordance with a Poisson process with rate λC = 3 per minute. € € € € • A taxi will wait no matter how many taxis are present. • An arriving customer that does not find a taxi waiting will leave. Calculate the proportion of arriving customers that find taxis. A. Less than 0.2 B. At least 0.2, but less than 0.4 C. At least 0.4, but less than 0.6 D. At least 0.6, but less than 0.8 E. At least 0.8 8. D. Let Pi = proportion of the time there are i taxis present. The balance equations are: 2P0 = 3P1 . ⇒ P1 = (2/3) P0 . 2P1 = 3P2 . ⇒ P2 = (2/3) P1 = (2/3)2 P0 . 2P€ 2 = 3P3 . ⇒ P3 = (2/3) P2 = (2/3)3 P0 . In general, Pi = (2/3)i P0 . € Since € the proportions add to one: 1 = P0 (1 + 2/3 + (2/3)2 + (2/3)3 + ...} = 3P0 . ⇒ P0 = 1/3. The proportion of arriving customers that find taxis is: 1 - P0 = 2/3. € Comment: This is an example of a birth and death process. The rate at which the process goes from state i to state i+1 must be the same as the rate at which the process goes from state i+1 to state i. The proportions follow a Geometric Distribution with β = 2. For a Birth-Death Process that has the rate of births λ and rate of deaths µ each constant, P0 = 1 - λ/µ. In this case: 1 - P0 = λ/µ = 2/3. Solutions Spring 2016 CAS Exam S, € € € HCM 6/28/16, 9. You are given the following information: • A firm has modeled the annual movement of its workers using a Markov chain with the following three states: 0. Fully Employed 1. Temporarily Disabled 2. Retired • The associated transition matrix is: ⎛0.7 0.1 0.2⎞ ⎜ ⎟ P = ⎜0.4 0.3 0.3⎟ ⎜ ⎟ ⎝0.0 0.0 1.0⎠ • Tom is Fully Employed with the firm this year. Calculate the probability that Tom will be Fully Employed in two years. € A. Less than 0.52 B. At least 0.52, but less than 0.54 C. At least 0.54, but less than 0.56 D. At least 0.56, but less than 0.58 E. At least 0.58 9. B. (1, 0, 0) P = (0.7, 0.1, 0.2). (0.7, 0.1, 0.2) P = (0.53, 0.10, 0.37). Page 9 Solutions Spring 2016 CAS Exam S, € € € € HCM 6/28/16, Page 10 10. You are given the following information: • A taxi driver provides service for two cities, C1 and C2, only. • If the taxi driver is in the city C1, the probability that he has to drive the customer to city C2 is 0.35. • If the taxi driver is in the city C2, the probability that he has to drive the customer to city C1 is 0.75. • The expected profit for each trip is: • Within C1: $6 • Within C2: $5 • Between C1 and C2: $10 € Calculate the long-term expected profit per trip. € A. Less than $7.50 € B. At least $7.50, but less than $8.00 C. At least $8.00, but less than $8.50 D. At least $8.50, but less than $9.00 E. At least $9.00 10. B. Let the states of a Markov chain be the city in which the taxi starts a trip. 1 ⎛0.65 0.35 ⎞ The transition matrix for this Markov chain is: ⎜ ⎟ . 2 ⎝0.75 0.25 ⎠ The equations for the stationary distribution are: 0.65π1 + 0.75π2 = π1 . 0.35π1 + 0.25π2 = π2 . π1 + π2 = 1. ⇒ π1 = 15/22, and π2 = 7/22. €€ The average profit when the taxi starts in city 1 is: (0.65)(6) + (0.35)(10) = 7.4. The average profit when the taxi starts in city 2 is: (0.25)(5) + (0.75)(10) = 8.75. The long term average profit per trip is: (15/22)(7.4) + (7/22)(8.75) = 7.83. Comment: Similar to 3, 1/01, Q.22. With only two states: π1 = P21/(P21 + P12) and π2 = P12/(P21 + P12). π 1 = 0.75/(0.75 + 0.35) = 15/22, and π2 = 0.35/(0.75 + 0.35) = 7/22. Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 11 11. You are given the following information: A three-state Markov Chain, with the following transition probability matrix, is used to model the movement of policyholders between three States: States 0 1 2 0 0.5 0.5 0.0 1 0.4 0.4 0.2 2 0.3 0.4 0.3 Calculate the stationary percentage of policyholders in State 2. A. Less than 13% B. At least 13%, but less than 23% C. At least 23%, but less than 33% D. At least 33%, but less than 43% E. At least 43% 11. A. The balance equations are: 0.5π0 + 0.4π1 + 0.3π2 = π0. 0.5π0 + 0.4π1 + 0.4π2 = π1. 0.2π1 + 0.3π2 = π2. ⇒ π1 = 3.5 π2. From the first equation: 0.5π0 = 0.4π1 + 0.3π2 = 1.7 π2. ⇒ π0 = 3.4 π2. Also π0 + π1 + π2 = 1. Thus, 3.4 π€2 + 3.5 π2 + π2 = 1. ⇒ π2 = 1/(3.4 + 3.5 + 1) = 1/7.9 = 0.1266. € ⇒ π0 = 3.4/7.9 = 0.4304. π1 = 3.5/7.9 = 0.4430. € € Solutions Spring 2016 CAS Exam S, € € € € € € HCM 6/28/16, 12. You are given the following information: • A 3-year term insurance policy on a life age (80) provides for a death benefit, payable at the end of the year of death. • The death benefit is 1000t, t ∈ {1, 2, 3}, where t is the year of death. • This policy is purchased by a single premium, P, at time 0. • If (80) lives to age 83, the single premium is returned without interest. • Mortality rates follow€the Illustrative Life Table. • i = 0.10 Calculate P using the equivalence principle. A. Less than 830 B. At least 830, but less than 850 C. At least 850, but less than 870 D. At least 870, but less than 890 E. At least 890 12. E. APV of future benefits is: 1000 v q80 + 2000 v2 2 q80 + 3000 v3 3 q80 = (1000/1.1) 3,914,365 - 3,600,038 3,600,038 - 3,284,542 + (2000/1.12 ) 3,914,365 3,914,365 3,284,542 - 2,970,496 = 387.06. 3,914,365 € € APV of future premiums including the possible return of the single premium is: 2,970,496 P - P v3 3 q80€= P (1 /1.13 ) = 0.42985 P. 3,914,365 + (3000/1.13 ) Thus, 0.42985 P = 387.06. ⇒ P = 900.45. € € Page 12 Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 13 13. Let Y1 , ... ,Yn denote a random sample from the uniform distribution on the interval: (θ, θ + 1). You are given: • There are two unbiased estimators for θ, θ^ 1 and θ^ 2 θ^ 1 = Y - 1/2 € θ^ 2 = Y(n) - n/(n+1), where€Y(n) = max(Y 1 , ... ,Yn ) € € € • The efficiency of θ^ 1 relative to θ^ 2 is given by eff( θ^ 1 , θ^ 2 ) = € € € € 12 n2 (n + 2) (n +1)2 • s is the minimum sample size required to make θ^ 2 more efficient than θ^ 1 € € € € • s is less than 10 € Calculate Var( θ^ 2 | n = s). A. Less than 0.007 B. At least 0.007, but less than 0.009 C. At€least 0.009, but less than 0.011 D. At least 0.011, but less than 0.013 E. At least 0.013 € € Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 14 13. C. In order for θ^ 2 to be more efficient than θ^ 1 , the efficiency of θ^ 1 relative to θ^ 2 has to be less 12 n2 than one. We try values of n, eff( θ^ 1 , θ^ 2 ) = . (n + 2) (n +1)2 € € € € For n = 7, eff( θ^ 1 , θ^ 2 ) = 1.02083. For n = 8, eff( θ^ 1 , θ^ 2 ) = 0.948148. € € So the minimum sample size that works is 8. € Var[€ θ^ 1 ] €= Var[ Y - 1/2] = Var[ Y ] = Var[Y] € /€n. However, Y is uniformly distributed over an interval of width one. ⇒ Var[Y] = 1/12. ^ ^ ⇒ Var[ € θ 1 ] = 1/(12n). € For n = 8, Var[ θ 1 ] = 1/96. € Now for n = 8, Var[ θ^ 2 ] / Var[ θ^ 1 ] = eff( θ^ 1 , θ^ 2 ) = 0.948148. € € ⇒ Var[ θ^ ] = 0.948148/96 =€0.009877. 2 € € Var[ θ^ € € € Alternately, 2 ] = Var[Y(n) - n/(n+1)] = Var[Y(n)]. € Var[Y(n)] is the same as the variance of the maximum of a sample of size n from a uniform on (0, 1). € This maximum follows a Beta Distribution with a = n = 8 and b = 1. € a(a +1) (8)(9) Mean = a/(a+b) = 8/9. Second Moment = = = 0.8. (a + b)(a + b + 1) (9)(10) Variance = 0.8 - (8/9)2 = 0.009877. Comment: Var[ θ^ 1 ] = € € Var[ θ^ 2 ] is the variance of a Beta Distribution with a = n and b = 1: € ⎛ n ⎞2 ⎛ n ⎞2 n(n +1) € n n - ⎜ ⎟ = - ⎜ ⎟ = . (n +1)(n + 1+1) ⎝ n + 1⎠ n + 2 ⎝ n + 1⎠ (n + 2) (n +1)2 € 12 n2 ⇒ eff( θ^ 1 , θ^ 2 ) = Var[ θ^ 2 ] / Var[ θ^ 1 ] = , as given in the question. (n + 2) (n +1)2 € € € € € € 1 . 12n € € € € € Solutions Spring 2016 CAS Exam S, € € € HCM 6/28/16, Page 15 14. You are given the following information: • Loss severity follows an exponential distribution with mean θ. • Small claims are handled directly by the insured, and no information about losses less than 10,000 is available to you. • You observe the following four claim payments net of the 10,000 deductible: 11,000 10,500 12,000 15,000 Calculate the Maximum Likelihood Estimate of θ. A. Less than 2,000 B. At least 2,000, but less than 2,050 C. At least 2,050, but less than 2,100 D. At least 2,100, but less than 2,150 E. At least 2,150 14. E. For the Exponential Distribution, θ^ = (sum of the payments) / (number of uncensored values) € = (11,000 + 10,500 + 12,000 + 15,000) / 4 = 12,125. Comment: Based on the given choices, I suspect the writer of this question at some point intended the data to be loss amounts prior to the effect of the deductible rather than claim payments net of the 10,000 deductible. In the case of the given data being loss amounts rather than payments: θ^ = (1000 + 500 + 2000 + 5000)/4 = 2,125, corresponding to choice D. € Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 16 15. You are given three normal distributions with the same mean and varying variances, 1 1 1 , , . θ 2θ 4θ You select one random draw from each of the distributions respectively: 0, 4, 8 € The € probability € density of the normal distribution is: f(x I µ, σ2) = 1 (x - µ)2 exp[]. 2σ2 σ 2π Determine the Maximum Likelihood Estimate of θ. A. Less than 0.04 € B. At least€0.04, but less than 0.06 C. At least 0.06, but less than 0.08 D. At least 0.08, but less than 0.10 E. At least 0.10 15. B. ln[f(x)] = -(x- µ)2 /(2σ2) - lnσ + constants. Ignoring constants, the loglikelihood is: -µ2θ/2 + lnθ /2 - (4- µ)2 θ + ln[2θ] /2 - (8- µ)2 2θ + ln[4θ] /2 = -θ {µ2/2 + (µ - 4)2 + 2(µ - 8)2 } + 1.5 ln[θ] + ln[2]/2 + ln[4]/2. Set the partial derivative with respect mu equal to zero: 0 = -θ {µ + 2(µ - 4) + 4(µ - 8)}. ⇒ µ = 40/7. Set the partial derivative with respect theta equal to zero: 0 = -{µ2/2 + (µ - 4)2 + 2(µ - 8)2 } + 1.5/θ. € ⇒ θ = 1.5 / {(40/7)2 /2 + (40/7 - 4)2 + 2(40/7 - 8)2 } = 0.0505. € Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 17 16. You are given: • A density function, f(x) = 5θ x3 exp[-θx4 ] • A random sample of size n from this distribution Calculate the Rao-Cramer lower bound for the variance of an unbiased estimator of θ. € A. € € € θ2 n B. 1 θn C. D. 16. A. ln[f(x)] = ln5 + lnθ + 3lnx -θx4 . € € € ∂ ln[f(x)] = 1/θ - x4 . ∂θ 5θ2 n- 1 n E. 2 θ € ∂2 ln[f(x)] = -1/θ2. ∂θ2 Rao-Cramer lower bound is: € θ2 4n -1 ∂2 ln[f(x)] n E[ ] ∂θ2 = θ2/n. Comment: The Rao-Cramer lower bound goes down as 1/n, eliminating choices D and E. This is a Weibull Distribution, parameterized differently than in the tables attached to the exam. € Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 18 17. Determine the Fisher Information of n independent samples from a geometric distribution with mean β. A. 1/β B. n/β C. n/β - n/(β + 1) D. nβ/(β + 1) E. β(β+1)/n 17. C. The method of moments is equal to maximum likelihood for the Geometric. ^ ^ β = X . Var[ β ] = Var[ X ] = Var[X] /n = β(1+β)/n. ^ Fisherʼs Information is: 1/Var[ β ] = n/ {β(1+β)} = n/β - n/(β + 1). € € Alternately, € f(x) = βx/(1+β)x+1. € ln[f(x)] = x lnβ - (x+1) ln[1+β]. € ∂ ln[f(x)] = x/β - (x+1)/(1+β). ∂β ∂2 ln[f(x)] = -x/β2 + (x+1)/(1+β)2 . ∂β2 € E[ € ∂2 ln[f(x)] ] = -β/β2 + (β+1) /(1+β)2 = -1/β + 1/(1+β). ∂β2 Fisherʼs Information is: -n E[ € ∂2 ln[f(x)] ] = n/β - n/(β + 1). ∂β2 Comment: Fisherʼs Information is proportional to n, eliminating choices A and E. € Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 19 18. You are testing whether the number of claims in year 2 is independent of the number of claims in year 1. You are given the following contingency table: Claims in Year 2 Claims in Year 1 0 1+ 0 20 21 1+ 30 21 Calculate the Chi-Square test statistic. A. Less than 0.5 B. At least 0.5, but less than 1.0 C. At least 1.0, but less than 1.5 D. At least 1.5, but less than 2.0 E. At least 2.0 18. B. Here is the calculation of the contributions to the Chi-Square Statistic of 0.924: Yr Two 0 Yr Two 1+ Sum Yr One 0 Yr One 1+ 20 30 21 21 41 51 Sum 50 42 92 Yr One 0 Yr One 1+ Sum Yr One 0 Yr One 1+ Sum Expected Number 22.2826 18.7174 27.7174 23.2826 Sum 41.0000 51.0000 50.0000 42.0000 92.0000 Contribution to Chi-Square 0.2338 0.2784 0.1880 0.2238 Sum 0.5122 0.4118 0.4218 0.9240 0.5022 For example, in the first row and first column, the expected number is: (50)(41)/92 = 22.2826. The contribution from the first row and first column is: (20 - 22.2826)2 / 22.2826 = 0.2338 Comment: There is: (2 - 1)(2 - 1) = 1 degree of freedom. Using a computer, the p-value is 33.6%. Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 20 19. To test whether taking a driver's education course improves driving skills, you set up an experiment. Participants are given a pretest of their driving skills and are tested on their skills again after the course. The data is contained in the table below. User ID Pretest Posttest Difference 1 62 60 -2 2 93 94 1 3 97 100 3 4 77 80 3 5 69 75 6 6 72 78 6 7 99 100 1 8 55 57 2 9 90 91 1 10 66 63 -3 11 76 83 7 12 51 49 -2 13 74 84 10 14 52 61 9 15 97 99 2 16 54 52 -2 Sum 1184 1226 42 Sum of Squares 91,940 98,616 352 Assume the scores follow a Normal distribution. Calculate the statistic to test if there is a significant difference in the two scores. A. Less than -3 B. At least -3, but less than -1 C. At least -1, but less than 1 D. At least 1, but less than 3 E. At least 3 Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 21 19. D. The differences Wi are Normal with unknown variance. We apply a t-test. W = 42/16 = 2.625. S W2 = (16/15) (352/16 - 2.6252 ) = 16.1167. € t= € W SW 2 / n = 2.625 16.1167 / 16 = 2.615. Comment: The t-statistic has 16 - 1 = 15 degrees of freedom. For a two-sided test, the p-value is 1.95%. € In general, the sign of the t-statistic depends on the order in which one takes the differences; as most people would, I have used the differences given in the question. Solutions Spring 2016 CAS Exam S, € € HCM 6/28/16, Page 22 20. You are given the following information: • The Underwriting Department tells you that 40% of homeowners who stated they have alarm systems have cancelled their alarm service. • A survey of 1,000 homeowners who stated they have alarm systems shows that 428 homeowners have cancelled their service. • You test the following hypotheses: H0 ; The Underwriting Department's claim is accurate. H1 : The Underwriting Department's claim is inaccurate. € Calculate the minimum significance level at which you would reject the null hypothesis. A. Less than 0.005 B. At least 0.005, but less than 0.010 C. At least 0.010, but less than 0.025 D. At least 0.025, but less than 0.050 E. At least 0.050 20. E. If H0 is true, then the mean is 400, and the variance is: (0.4)(0.6)(1000) = 240. Using the continuity correction, for the two-sided test the p-value is: 427.5 - 400 2{1 - Φ[ ]} = (2){1 - Φ[1.775]} = (2)(1 - 0.9621) = 7.58%. 240 Alternately, Z = 428 - 400 240 = 1.807. € For the two-sided test, the p-value is: (2){1 - Φ[1.807]} = (2)(1 - 0.9646) = 7.08%. € Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 23 21. Given • X1 , X2 , ... , X10 are random samples from a Normal distribution with mean µx and variance 7. • Y1 , Y2 , ... , Y8 are random samples from a Normal distribution with mean µy and variance 9. € • X and Y are independent • W = X - Y is a Normal distribution with mean µx - µy € • H0 : µx = µy € € € • H1 : µx > µy For W = 2, what is the result of the hypothesis test using the Normal distribution? A. Reject H0 at the 0.025 level. € B. Reject H0 at the 0.050 level, but not at the 0.025 level. C. Reject H0 at the 0.075 level, but not at the 0.050 level. D. Reject H0 at the 0.100 level, but not at the 0.075 level. E. Do not reject H0 at the 0.100 level. 21. C. Var[W] = 7/10 + 9/8 = 1.825. Z = 2/ 1.825 = 1.480. The p-value of a one-sided test is: 1 - Φ[1.480] = 6.9%. € Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 24 22. You are given the following: • The time it takes an insurance company to adjust a claim follows a Normal distribution with µ = 90 and σ = 30. (Time is in minutes) • The insurance company is considering purchasing software from a vendor that claims the software will reduce adjusting time by ten minutes per claim but without any impact on variation. Before purchasing the software, the insurance company will test the software on a sample of 100 claims and measure the time to adjust claims. • The insurance company will purchase the software only if the improvement is shown to be at least ten minutes and the sample result is subject to no more than a 5% Type I error. Calculate the probability of a Type II error if the vendor's claim is true. A. At least 1%, but less than 2% B. At least 2%, but less than 3% C. At least 3%, but less than 4% D. At least 4%, but less than 5% E. At least 5% € € € 22. D. Let H0 : µ = 90. Let H1 : µ ≤ 80. 5% = Prob[Type I error] = Prob[ X < c | µ = 90] = Φ[ c - 90 302 / 100 ]. c - 90 = -1.645. ⇒ c = 90 - (3)(1.645) = 85.065. € 3 € 85.065 - 80 Prob[Type II error] = 1 - Prob[ X < 85.065 | µ = 80] = 1 - Φ[ ] = 1 - Φ[1.688] = 4.57%. € 302 / 100 ⇒ € € Comment: Personally, I found it a little unclear exactly how to translate the given language into hypothesis testing. “if€ the vendor's claim is true” is redundant and/or confusing. € Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 25 23. To test if the distribution of the grades of two classes which have the same teacher are the same, 7 students from Class A and 9 from Class B are randomly selected. The sum of the ranks for Class A is 75. Calculate the Mann-Whitney U test statistic. A. Less than 17 B. 17 C. 18 D. 19 E. At least 20 23. A. U = n1 n2 + n1 (n1 +1)/2 - (sum of the ranks for the first sample of size n1 ) = (7)(9) + (7)(8)/2 - 75 = 16. Comment: Looking in the tables attached to the exam: F(16) = 5.7%. Thus for a two-sided test, the p-value is: (2)(5.7%) = 11.4%. It is somewhat arbitrary which one takes as the first sample; here it makes sense to take A as the first sample since it is both mentioned first and is the smaller sample. While the value of U does, the result of the hypothesis test does not depend on this choice. The total ranks are: (16)(17)/2 = 136. Thus the sum of the ranks for Class B is: 136 - 75 = 61. Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 26 24. You are given: • Y1 , Y2 , and Y3 denotes a random sample from an exponential distribution f(y) = e-y/5 / 5, 0<y<∞ • The second-order statistic is Y(2) Calculate E[Y(2)]. € A. Less than 3.0 B. At least 3.0, but less than 4.0 C. At least 4.0, but less than 5.0 D. At least 5.0, but less than 6.0 E. At least 6.0 24. C. For a sample of size N from an Exponential Distribution with mean θ: ⎧ 1 1 1 ⎫ E[X(r)] = ⎨ + + ... + ⎬ θ. ⎩N N -1 N + 1-r ⎭ Here θ = 5 and E[Y(2)] = (1/3 + 1/2) (5) = 4.1667. Alternately, Prob[ E[Y(2)] > y] = Prob[exactly two values > y] + Prob[3 values > y] = € 3F(y)S(y)2 + S(y)3 = 3(1 - e-y/5)(e-y/5)2 + (e-y/5)3 = 3 e-y2/5 - 2e-y3/5. Integrating from 0 to infinity: E[Y(2)] = (3)(5/2) - (2)(5/3) = 4.1667. Solutions Spring 2016 CAS Exam S, € € € HCM 6/28/16, Page 27 25. Two models are used to determine annual losses for a group of policies. • Ten policies are selected at random Policy Model Y Model Z A 500 400 B 500 525 C 500 650 D 750 925 E 750 950 F 1000 1225 G 1250 1125 H 1250 1300 I 1250 1500 J 1500 1575 • H0 : MedianY = MedianZ • H1 : MedianY < MedianZ Calculate the smallest value for significance level for which H0 can be rejected using the Signed-Rank Wilcoxon test for a matched pairs experiment. A. Less than 0.010 B. At least 0.010, but less than 0.025 C. At least 0.025, but less than 0.050 D. At least 0.050, but less than 0.100 E. At least 0.100 25. C. The differences of Z minus Y: -100, 25, 150, 175, 200, 225, -125, 50, 250, 75. Rank these by absolute value: 25, 50, 75, -100, -125, 150, 175, 200, 225, 250. The sum of the negative ranks is: 4 + 5 = 9. Consulting the table attached to the exam for N = 10 for a one-sided test: 8 < 9 ≤ 10. Therefore, reject at 5% and not at 2.5%. Comment: If the alternative were true, we would expect lots of large absolute value positive differences in Z - Y, and thus we would reject if the sum of the ranks for the negative differences is small (and there the sum of the positive ranks is large). For a one-sided test, using the tables attached to the exam, one can not reject at 10%; thus the choices in the question could have been better. Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 28 26. You are given: • X1 , X2 , ... , Xn , is a random sample from a N(µ, σ2) • σ2 = 200 • The prior distribution is a normal N(µ0, σ02) distribution • µ0 = 70 and σ02 = 10 • There are 5 observations from this distribution, Xt: 53 22 61 86 70 Calculate the lower bound of the 95% Bayesian credibility interval for µ. A. Less than 55 B. At least 55, but less than 60 C. At least 60, but less than 65 D. At least 65, but less than 70 E. At least 70 26. C. The sum of the observed loss is 292. Using greek letters for parameters of the prior Normal, the posterior distribution is also Normal, L σ 2 + µ s2 (292)(10) + (70)(200) with mean = = = 67.68, 2 2 Cσ + s (5)(10) + 200 σ2 s2 (10)(200) and variance = = = 8. C σ2 + s 2 (5)(10) + 200 € € Thus a posterior 95% confidence interval for µ is: 67.68€± 1.960 8 = (62.14 , 73.22). € Alternately, K = 200/10 = 20. Thus, the Buhlmann Credibility is: Z = 5/(5+20) = 1/5. The posterior distribution is also Normal, with mean € = (1/5)(292/5) + (1 - 1/5)(70) = 67.68, and variance = (1 - Z)(σ02) = (4/5)(10) = 8. Proceed as before. Comment: The third bullet would have been better if it had said: “The prior distribution of µ is a normal N(µ0, σ02) distribution.” Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 29 27. For ABC Insurance Company, you are given the following information: • All insureds come from one of three population classes. • The claim frequency of each insured follows a Poisson process. • The claims information is as follows: € € € Population Expected time Probability of being Claim Class between claims in class cost I 12 months 0.25 $5,000 II 18 months 0.25 $5,000 III 36 months 0.50 $5,000 Calculate the expected loss in year 2 for an insured that had no claims in year 1. A. Less than 2,400 B. At least 2,400, but less than 2,500 C. At least 2,500, but less than 2,600 D. At least 2,600, but less than 2,700 E. At least 2,700 27. C. For the three classes, λ = 1, 1/ 1.5 = 2/3, and 1/3. The chance of the observation is: e− λ. We apply Bayes Analysis: A B A Priori Population Chance of Class This Class 4-sided 6-sided 8-sided Overall 0.250 0.250 0.500 C Chance of the Observation 0.36788 0.51342 0.71653 D E F Prob. Weight = Posterior Product Chance of Mean of Columns This Type Frequency B&C of Class 0.09197 0.12835 0.35827 15.90% 22.18% 61.92% 1.0000 0.6667 0.3333 0.57859 1.000 0.5133 For example, e-1 = 0.36788. (0.25)(0.36788) = 0.09197. 0.09197/0.57859 = 15.90%. (15.90%)(1) + (22.18%)(2/3) + (61.92%)(1/3) = 0.5133. Thus the estimated future loss is: ($5000)(0.5133) = $2567. Solutions Spring 2016 CAS Exam S, € € € HCM 6/28/16, Page 30 28. You are given the following information: • There are five observations, y1 , y2 , y3 , y4 , y5 , from a gamma distribution: 6.85 10.3 20.89 15.06 6.26 • Var(Y) = 37.67 • The gamma distribution G(µ, ν) is a member of the exponential family with: a(θ) = -In(-θ) E(Y) = µ Var(Y) = µ2/ν Calculate the maximum likelihood estimate of the dispersion parameter, ϕ, for this distribution. A. Less than 0.00 B. At least 0.00, but less than 1.00 C. At least 1.00, but less than 2.00 € D. At least 2.00, but less than 3.00 E. At least 3.00 28. B. The dispersion parameter for the Gamma Distribution is 1/α. They intend for us to find the maximum Iikelihood conditional on Var(Y) = 37.67. Therefore, we want: αθ2 = 37.67. € When fitting an exponential family via maximum Iikelihood, the fitted mean is equal to the sample mean. Therefore we want: αθ = (6.85 + 10.3 + 20.89 + 15.06 + 6.26) / 5 = 11.872. Dividing the two equations in two unknowns: θ = 37.67 / 11.872 = 3.173. ⇒ α = 11.872/3.173 = 3.742. ⇒ 1/α = 1/3.742 = 0.267. Comment: The dispersion parameter is always positive, eliminating choice A. de Jong and Heller use ν for what is α in the Tables attached to the exam. € a Gamma via maximum likelihood conditional on the variance; One does not usually fit this is not covered in the syllabus readings. This differs from fitting via maximum likelihood with both parameters unknown. (Using a computer, the maximum likelihood fit is α = 4.886 and θ = 2.430.) 37.67 is the sample variance. Thus this weird method of fitting a Gamma is equivalent to setting the sample mean equal to the theoretical mean and the sample variance equal to the theoretical variance. The method of moments would also set the sample mean equal to the theoretical mean but would instead set the biased estimate of the variance equal to the theoretical variance. Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 31 29. You are given the following information for a fitted GLM: Response variable Occurrence of Accidents Response distribution Binomial Link Logit Parameter df ^ β Intercept 1 x Driver's Age 2 1 1 € 0.288 2 1 0.064 3 0 0 Area 2 A 1 -0.036 B 1 0.053 C 0 0 Vehicle Body 2 Bus 1 1.136 Other 1 -0.371 Sedan 0 0 The probability of a driver in age group 2, from area C and with vehicle body type Other, having an accident is 0.22. Calculate the odds ratio of the driver in age group 3, from area C and with vehicle body type Sedan having an accident. A. Less than 0.200 B. At least 0.200, but less than 0.250 C. At least 0.250, but less than 0.300 D. At least 0.300, but less than 0.350 E. At least 0.350 € € 29. E. In order to solve for the unknown intercept, we use the given probability of accident for a driver in age group 2, from area C and with vehicle body type Other. 0.22 = exp[x + 0.064 - 0.371] / {exp[x + 0.064 - 0.371] + 1}. ⇒ exp[x + 0.064 - 0.371] = 0.22 / (1 - 0.22) = 0.28205. ⇒ x + 0.064 - 0.371 = ln[0.28205] = -1.2657. ⇒ x = -0.9587. Thus for a driver in age group 3, from area C and with vehicle body type Sedan, the odds (ratio) is: π / (1 - π) = exp[-0.9857 + 0 + 0 + 0] = 0.3834. Comment: The probability of having an€accident for a driver in age group 3, from area C and with vehicle body type Sedan is: 0.3834 / (1 + 0.3834) = 0.277. Note that 0.277 / (1 - 0.277) = 0.383. π / (1 - π) is called the odds, but is also sometimes called the odds ratio. Solutions Spring 2016 CAS Exam S, HCM 6/28/16, 30. You are given the following information for a fitted GLM: Response variable Occurrence of Accidents Response distribution Binomial Link Logit Parameter df ^ β se Intercept 1 -2.358 0.048 Area 2 Suburban € 0 0.000 Urban 1 0.905 0.062 Rural 1 -1.129 0.151 Calculate the modeled probability of an Urban driver having an accident. A. Less than 0.01 B. At least 0.01, but less than 0.05 C. At least 0.05, but less than 0.10 D. At least 0.10, but less than 0.20 E. At least 0.20 30. D. € Exp[-2.358 + 0.905] = 0.190. 1 + Exp[-2.358 + 0.905] Page 32 Solutions Spring 2016 CAS Exam S, HCM 6/28/16, 31. You are given the following information for a fitted GLM: Response variable Claim size Response distribution Gamma Link Log Dispersion Parameter 1 ^ Parameter df β Intercept 1 2.100 €4 1 1 1 0 1 7.678 4.227 1.336 0.000 1.734 Vehicle Class Convertible Coupe Sedan Truck Minivan Station wagon Utility 6 1 1 0 1 1 1 1 1.200 1.300 0.000 1.406 1.875 2.000 2.500 Driver Age Youth Middle age Old 2 1 0 1 2.000 0.000 1.800 Zone 1 2 3 4 5 Calculate the predicted claim size for an observation from Zone 3, with Vehicle Class Truck and Driver Age Old. A. Less than 650 B. At least 650, but less than 700 C. At least 700, but less than 750 D. At least 750, but less than 800 E. At least 800 31. D. exp[2.100 + 1.336 + 1.406 + 1.800] = 766.63. Page 33 Solutions Spring 2016 CAS Exam S, HCM 6/28/16, 32. You are given the following information for a fitted GLM: Response variable Claim size Response distribution Gamma Link Log Dispersion parameter 1 ^ Parameter df β Intercept 1 2.100 €4 1 1 1 0 1 7.678 4.227 1.336 0.000 1.734 Vehicle Class Convertible Coupe. Sedan Truck Minivan Station wagon Utility 6 1 1 0 1 1 1 1 1.200 1.300 0.000 1.406 1.875 2.000 2.500 Driver Age Youth Middle age Old 2 1 0 1 2.000 0.000 1.800 Zone 1 2 3 4 5 Calculate the variance of a claim size for an observation from Zone 4, with Vehicle Class Sedan and Driver Age Middle age A. Less than 55 B. At least 55, but less than 60 C. At least 60, but less than 65 D. At least 65, but less than 70 E. At least 70 Page 34 Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 35 32. D. For an observation from Zone 4, with Vehicle Class Sedan and Driver Age Middle age, the mean is: exp[2.1] = 8.166. Thus the variance is: φ V[µ] = (1) (µ2) = 8.1662 = 66.7. Comment: We are given that the dispersion parameter φ is 1. For the Gamma Distribution, V[µ] = µ2. Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 36 33. You are given the following information for a GLM of customer retention: Response variable Retention Response distribution Binomial Link Logit ^ Parameter df β Intercept 1 1.530 Number of Drivers 1 >1 €1 0 1 0.000 0.735 Last Rate Change <0% 0%-10% >10% 2 0 1 1 0.000 -0.031 ·0.372 Calculate the probability of retention for a policy with 3 drivers and a prior rate change of 5%. A. Less than 0.850 B. At least 0.850, but less than 0.870 C. At least 0.870, but less than 0.890 D. At least 0.890, but less than 0.910 E. At least 0.910 33. D. € exp[1.530 + 0.735 - 0.031] = 90.33%. 1 + exp[1.530 + 0.735 - 0.031] Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 37 34. Determine which of the following statements are true. I. The deviance is useful for testing the significance of explanatory variables in nested models. II. The deviance for normal distributions is proportional to the residual sum of squares. III. The deviance is defined as a measure of distance between the saturated and fitted model. A. I only B. II only C. III only D. All but III E. All 34. E. All three statements are true. See pages 72-73 of de Jong and Heller. Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 38 35. You are given the following information about three candidates for a Poisson frequency GLM on a group of condominium policies: Log Model Variables in the Model DF Likelihood AIC BIC 1 Risk Class 5 -47,704 95,418 95,473.61182 2 Risk Class + Region -47,495 3 Risk Class + Region + Claim Indicator 10 -47,365 94,750 € € € € • Insureds are from one of five Risk Class: A, B, C, D, E • Condominium policies are located in several regions • Claim Indicator is either Yes or No • All models are built on the same data Calculate the absolute difference between the AIC and the BIC for Model 2. A. Less than 85 B. At least 85, but less than 95 C. At least 95, but less than 105 D. At least 105, but less than 115 E. At least 115 35. C. Since Model 2 has one fewer parameter than model 3, model 2 has 9 degrees of freedom. AIC = (-2) (maximum loglikelihood) + (number of parameters)(2). BIC = (-2) (maximum loglikelihood)) + (number of parameters) ln(number of data points). Therefore, 95,473.61182 = (-2)(-47,704) + (5) ln(n). ⇒ n = 500,000. For Model 2, AIC = -47,495 + (9)(2). For Model 2, BIC = -47,495 + (9) ln(500,000). The absolute difference between the AIC and € the BIC for Model 2 is: | (9) ln(500,000) - 18 | = 100.1. Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 39 36. You are given the following two graphs comparing the fitted values to the residuals of two different linear models: Determine which of the following statements are true. I. Graph 1 indicates the data is homoscedastic II. . Graph 1 indicates the data is heteroskedastic III. Graph 2 indicates the data is non-normal A. I only B. II only C. III only D. I and III E. II and III 36. E. Graph one shows an increasing variance with fitted value. Homoscedasticity would be constant variance, so statement 1 is false; statement 2 is true. The residuals in Graph 2 are not symmetric around zero; there are more extreme positive values than there are extreme negative values. This indicates that the residuals are not normally distributed. Statement 3 is true. Comment: In Graph 2 it is not clear the meaning of the horizontal lines. Note that Graph 2 does not show normalized residuals. In that case one could compare to a Standard Normal Distribution. A Normal Q-Q Plot would have been much more useful than Graph 2. Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 40 37. Determine which of the following GLM selection considerations is true. A. The model with the largest AIC is always the best model in model selection process. B. The model with the largest BIC is always the best model in model selection process. C. The model with the largest deviance is always the best model in model selection process. D. Other things equal, when the number of observations > 1000, AIC penalizes more for the number of parameters used in the model than BIC. E. Other things equal, when number of observations > 1000, BIC penalizes more for the number of parameters used in the model than AIC. 37. E. The model with the smallest AIC is usually the best model in model selection process, all other things being equal. Statement A is not true. The model with the smallest BIC is usually the best model in model selection process, all other things being equal. Statement B is not true. The model with the smallest deviance is usually the best model in model selection process, all other things being equal. Statement C is not true. AIC = (-2) (maximum loglikelihood) + (number of parameters)(2). BIC = (-2) (maximum loglikelihood)) + (number of parameters) ln(number of data points). The penalty for AIC is (number of parameters)(2). The penalty for BIC is (number of parameters) ln(number of data points). So the penalties are equal for: 2 = ln(number data Points). ⇒ number of data points = e2 = 7.4. Thus, other things equal, when number of observations ≥ 8, BIC penalizes more for the number of parameters used in the model than AIC. Thus statement E is true. € it is likely that one of them is true. Comment: Since statements D and E are opposites, Solutions Spring 2016 CAS Exam S, € € € HCM 6/28/16, Page 41 38. You are testing the addition of a new categorical variable into an existing GLM, and are given the following information: • A is the change in AIC and B is the change in BIC after adding the new variable. • B > A + 25 • There are 1500 observations in the model. Calculate the minimum possible number of levels in the new categorical variable. A. Less than 3 B. 3 C.4 D. 5 E. More than 5 38. E. Change in AIC is: (2) (number of parameters added). Change in BIC is: ln(1500) (number of parameters added). We want: ln(1500) (number of parameters added) > (2) (number of parameters added) + 25. ⇒ Number of parameters added > 4.7. ⇒ Number of added parameters is at least 5. ⇒ Minimum possible number of levels in the new categorical variable is: 5 + 1 = 6. € € € Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 42 39. Determine which of the following statements about GLMs are true. I. The saturated model has the highest possible deviance. II. Deviance follows a χ2 distribution for all models in the exponential family. III. Deviance is a useful measure of goodness of fit for all models in the exponential family. A. I only B. II only C. III only D. I, II and III E. None of the above 39. E. The saturated model has the smallest possible deviance. Statement 1 is false. Deviance approximately follows a χ2 distribution when the scale parameter does not have to be estimated such as for the Poisson, but not for those cases such as the Gamma Distribution in which the deviance parameter has to be estimated. Statement 2 is false. Deviance is not a useful measure of goodness of fit for Bernoulli and Binomial models. Statement 3 is false. Comment: See pages 72 and 107 of de Jong and Heller. Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 43 40. You are given the following results from a fitted GLM on the frequency of accidents: ^ Parameter df β se Intercept 1 -11.2141 0.1826 0.0000 1.0874 0.3162 Location Rural City €1 0 1 Calculate the Wald statistic for testing the null hypothesis of βcity = 0. A. Less than 10.5 B. At least 10.5, but less than 11.5 C. At least 11.5, but less than 12.5 D. At least 12.5, but less than 13.5 E. At least 13.5 ⎛ fitted parameter ⎞ 2 40. C. Wald Statistic = ⎜ ⎟ = (1.0874/0.3162)2 = 11.826. ⎝ standard error ⎠ Comment: We would compare this statistic to a Ch-Square Distribution with 1 degree of freedom. Using a computer, the p-value is 0.06%. € Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 44 41. A Poisson regression model with log link is used to estimate the number of diabetes deaths. The parameter estimates for the model are: Response variable Number of Diabetes Deaths Response distribution Poisson Link Log ^ Parameter df β p-value Intercept 1 -15.000 <0.0001 -1.200 0.000 <0.0001 Gender: Female Gender: Male €1 1 Age 1 0.150 <0.0001 Age2 1 0.004 <0.0001 Age × Gender: Female Age × Gender: Male 1 0 0.012 0.000 <0.0001 Calculate the expected number of deaths for a population of 100,000 females age 25. € A.€Less than 3 B. At least 3, but less than 5 C. At least 5, but less than 7 D. At least 7, but less than 9 E. At least 9 41. C. 100,000 exp[-15 - 1.2 + (0.15)(25) + (0.004)(252 ) + (0.012)(25)] = 100,000 e-9.65 = 6.44 deaths. Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 45 42. You are building a model of claim counts, and in order to evaluate the risk of overdispersion, you calculate the following: • The log-likelihood of the Poisson regression is l p = 350. € € • The log-likelihood of the Negative Binomial regression is l NB. At a significance level of 0.5%, you reject the hypothesis that a Poisson response is more € appropriate than a Negative Binomial response. € Calculate the smallest possible value of l NB. A. Less than 350 B. At least 350, but less than 352 € C. At least 352, but less than 354 D. At least 354, but less than 356 E. At least 356 42. C. The test statistic is: (2) ( l NB - 350). We compare to the (2)(0.5%) = 1% critical value for the Chi-Square Distribution with one-degree of freedom: 6.63. 6.63 = (2) ( l NB - 350).€ ⇒ l NB = 353.315. Comment: To compare a Poisson Regression and an otherwise similar Negative Binomial Regression, the test statistic is twice the difference in loglikelihoods. For significance level α we € compare to the€2α€tail of the Chi-Square Distribution with one-degree of freedom. Will not be on the syllabus going forward. Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 46 43. You are given the following information: • The time series {Xt} is stationary in mean and variance. • The first 8 observed values of x are as follow: t xt € 1 2 3 4 5 6 7 8 € 13 10 12 9 8 8 7 5 Calculate the sample auto covariance function, ck, for lag k = 2. A. Less than 1.80 B. At least 1.80, but less than 1.90 C. At least 1.90, but less than 2.00 D. At least 2.00, but less than 2.10 E. At least 2.10 43. B. Mean is: 72/8 = 9. (13 - 9)(12 - 9) + (10 - 9)(9 - 9) + (12 - 9)(8 - 9) + (9 - 9)(8 - 9) + (8 - 9)(7 - 9) + (8 - 9)(5 - 9) c2 = 8 = 1.875. € Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 47 44. You are given the following ARMA(p, q) model: 5 1 1 xt = xt-1 - 2xt-2 + xt-3 + wt - wt-1 2 2 2 The model is parameter redundant. Which of the following statements is false? € € A.€The model is non-stationary. B. The model is ARMA(3,1). C. The model is invertible. D. The model can be restated as ARMA(2,0). E. None of the above. 44. E. This is an ARMA(3,1) model. Statement B is true. The characteristic equation is: 0 = 1 - 2.5B + 2B2 - 0.5B3 . We can factor this: 0 = (1 - B/2) (1 - 2B + B2 ). Thus the roots are 2, 1, and 1. Since not all of these roots have absolute value greater than one, the series is not stationary. Statement A is true. φ(B) = 1 - B/2. Setting φ(B) equal to zero: 0 = 1 - B/2. ⇒ B = 2. Since this root has absolute value greater than one, the series is invertible. Statement C is true. The series can be written as: θ(B) xt = φ(B) wt. ⇔ (1 - 2.5B + 2B2 - 0.5B3 ) xt = (1 - B/2) wt. € ⇔ (1 - B/2) (1 - 2B + B2 ) xt = (1 - B/2) wt. ⇔ (1 - 2B + B2 ) xt = wt. € Thus we have restated the original model € as ARMA(2,0). Statement D is true. Comment: A somewhat difficult and long question. € Solutions Spring 2016 CAS Exam S, € € € HCM 6/28/16, Page 48 45. You are given the following information: • x and y are two time series that are stationary. • The graphs below are generated using the ACF (auto correlation function) function of R. • The dashed lines above and below zero indicate the range within which the ACF results are considered not significantly different than zero. • The symbol wt is white noise with zero mean. € Which of the following statements displays the model structure that best describes series x and series y? A. xt = 0.9xt-1 + wt and yt = 0.9yt-1 + wt + 0.6wt-1 - 0.3wt-2 B. xt = -0.9xt-1 + wt and yt = wt + 0.6wt-1 - 0.3wt-2 C. xt = 0.9xt-1 + wt and yt = wt + 0.6wt-1 D. xt = -0.9xt-1 + wt and yt = wt + 0.6wt-1 - 0.3wt-2 + 0.4wt-3 E. xt = 0.9Xxt-1 + wt and yt = wt + 0.6wt-1 - 0.3wt-2 Solutions Spring 2016 CAS Exam S, HCM 6/28/16, Page 49 45. B. For an AR(1) model, with |α| < 1, the autocorrelations decline geometrically in absolute value; for α < 0 the autocorrelations alternate sign. The first correlogram follows this pattern, indicating an AR(1) model such as xt = -0.9xt-1 + wt is appropriate. For an MA(2) model, the autocorrelations beyond lag two are zero. The second correlogram follows this pattern, indicating an MA(2) model such as yt = wt + 0.6wt-1 - 0.3wt-2 is appropriate. Comment: For an AR(1) model: ρk = αk. Using the R function ARMAacf, one can compute the autocorrelations. For the AR(1) series, xt = -0,9xt-1 + wt, the output from ARMAacf(ar=c(-0.9), lag.max = 10) is: 0 1.0000000 1 2 3 -0.9000000 0.8100000 -0.7290000 6 7 8 0.5314410 -0.4782969 0.4304672 For the MA(2) series, yt = wt + 0.6wt-1 - 0.3wt-2, the output from ARMAacf(ma= c(.6,-.3), lag.max = 4) is: 0 1 2 1.0000000 0.2896552 -0.2068966 END OF THE EXAMINATION 4 5 0.6561000 -0.5904900 9 10 -0.3874205 0.3486784 3 0.0000000 4 0.0000000
© Copyright 2026 Paperzz