Pearson Group Assignment 3 Coffee, Stress and Health Example

Pearson Group
Assignment 3
Coffee, Stress and Heart Example
Model Candidates
• Heart ~ Coffee
• Heart ~ Stress
• Heart ~ Coffee + Stress
Data Ellipses
Data ellipses show areas of concentration of the data and relationships between variables
Red ellipses for r=1 (1 standard deviation)
Green ellipses for r=2 (2 standard deviations)
Conditional Model
BetaStress = 1.1993
BetaCoffee = -0.4091
Marginal Models
BetaStress = 0.89317
BetaCoffee = 1.1082
95% Confidence Ellipses and Intervals
for the
Coefficients for Stress and Coffee
• Green
Ellipse:
Scheffé
• Red Ellipse
and Its
'Shadow'
Intervals:
Bonferroni
• Blue
Interval:
Bonferroni
(marginal
model)
Scheffé vs. Bonferroni
Scheffé CI



Larger than Bonferroni CI with minimal
number of parameters
Allows for data snooping
Based on F-statistic
Bonferroni CI


Incorporates a penalty for examining several
coefficients simultaneously
Based on t-statistic
Conditional Model
Coefficent for Coffee
 Bonferonni 95% CI: (-1.02494, 0.20646)
 Scheffé 95% CI:
(-1.19090, 0.37288)
– Both intervals include 0, thus not significant
Coefficient for Stress
 Bonferonni 95% CI: (0.72586, 1.67264)
 Scheffé 95% CI:
(0.59790, 1.80061)
– Both intervals do not include 0, thus are highly
significant
Conditional Model



In this case, either confidence interval we
use will give us the same conclusion
If we move the ellipses upward introducing
measurement error on Stress, we can make
the coefficient of Coffee significant according
to Bonferroni’s method.
Although, it is still insignificant according to
Scheffé method.
Conclusions







Both Coffee and Stress alone are excellent predictors of Heart for this data set
(positive linear relationship)
Scenario 1: Coffee (consumption) may be a cause of Heart (condition) with
(occupational) Stress as a mediating factor: use Heart ~ Coffee
Scenario 2: Stress may be a common cause for both Coffee and Heart: use
Heart ~ Coffee + Stress
Scenarios 1 and 2 and other possible scenarios cannot be decided from this
small data set with highly confounded variables
To determine the proper scenario, we need to collect more data, less
confounded if possible, and consult the relevant medical literature and health
researchers for helpful insights
Statistician take into consideration the significance of all possible models, and
then decide on an appropriate model
Statistician must try to insure all important factors and predictors are
considered in the model fitting process