Cheng-Hsin General Hospital Study Designs and Applied Statistics Methods in the Clinical Research 振興醫院教學研究部 董道興 副研究員 2008.2.26 Cheng-Hsin General Hospital Part I: Basic Study Designs Cheng-Hsin General Hospital 研究設計的重要性 • 好的研究設計較能得到可信的研究結果 • 好的研究設計較能提供正確的因果關係 Cheng-Hsin General Hospital 研究設計之分類 • 觀察性研究 1.個案研究與描述性研究(Case-series or Descriptive) 2.橫斷研究(Cross-sectional studies) 3.世代研究(Cohort or Prospective studies) 4.病例對照研究(Case-control or Retrospective studies) 5.歷史性世代研究 (Historical cohort studies) Cheng-Hsin General Hospital • 實驗性研究(Experimental studies) 1. 控制性試驗(Controlled trials) A. Parallel or concurrent controls a. Randomized b. Not randomized B. Sequential controls a. Self-controlled b. Crossover C. External controls (including historical) 2.無控制組 • 統合性研究(Meta-analysis) Cheng-Hsin General Hospital 觀察性研究 • 個案研究 1. 針對某些特定病徵的病人進行描述 2. 無對照組(control group) 3. 無研究假設 (research hypotheses) Cheng-Hsin General Hospital Cheng-Hsin General Hospital • 橫斷研究(Cross-sectional studies) 有病 研究樣本 沒病 Time 研究時間 No direction of inquiry Question: What is happening? Cheng-Hsin General Hospital 橫斷研究之特點 • 目的 1.描述健康狀態 2.分析因子與事件之相關 • 研究設計 同時間收集因子與疾病等變項 • 資料收集方法 1.病歷紀錄 2.觀察 3.面訪及電訪 4. 問卷填寫 • 優點 1. 簡單易行 2. 資料收集範圍廣闊 3. 較具經濟性 4. 可短時間內得到結果 5. 較易獲得大量樣本 • 缺點 1.無法判定因果關係(causal relationship) 2.推論易流於表面 1991-1993 A community-based screening regime for type 2 diabetes (n=12,489) Cheng-Hsin General Hospital The Study Procedure Screening for type 2 diabetes (1991-1993) WHO (1999) criteria 1994-1998 Type 2 diabetics (n=1123) DR screening for Type 2 diabetics in 1999-2002 (n=971) Screening for diabetic retinopathy (1999-2002) 725 Type 2 diabetics received DR screening Economic evaluation of diabetic retinopathy screening (2003) Sight year measurement Utility survey Economic evaluation 1. Cost-effectiveness analysis 2. Cost-utility analysis 3 .Cost-benefit analysis Migrated out or death (n=152) Loss to follow-up (n=246) Epidemiologic information 1.Prevalence of DR 2.Incidence of DR 3.Risk factors of DR 4.Natural history of DR Willingness-to-pay survey Cheng-Hsin General Hospital Cheng-Hsin General Hospital • 世代研究(Cohort study) 暴露組 (exposed) 有病 研究世代 沒病 Direction of inquiry 有病 Question: What will happen? 非暴露組 (unexposed) 研究起始點 沒病 Time Cheng-Hsin General Hospital 世代研究之特點 • 目的 探討暴露組和非暴露組之間發生率(incidence)是否相同 • 研究設計 1.觀察世代可選擇一組或兩組 2.長期觀察研究世代之得病情形 • 優點 1.可計算發生率 2.可了解因果關係 3.可以調查稀有暴露世代的得病情形 4.可減少選樣性偏差的發生 5.可探討多重疾病(multiple diseases) • 缺點 1.觀察時間過長 2.費用較高 3.失去追蹤導致結果產生偏差(bias) Cheng-Hsin General Hospital • 主要測量指標 有暴露 沒暴露 有病 沒病 a c b d a+c b+d a+b c+d a+b+c+d 相對危險性(relative risk, RR) [a/(a+b)]/ [c/(c+d)] 相差危險性(attributable risk, AR) [a/(a+b)]-[c/(c+d)] Cheng-Hsin General Hospital Cheng-Hsin General Hospital Cheng-Hsin General Hospital • 成果評值(Outcome assessment) 1.功能狀態(Functional status) 2.生活品質(Quality of life) 3.滿意度(Patient satisfaction) 4.存活時間(survival time) 5.經濟評估(Economic evaluation) A.Cost-effectiveness analysis –CEA B.Cost-utility analysis –CUA C.Cost-benefit analysis –CBA Cheng-Hsin General Hospital Cheng-Hsin General Hospital Cheng-Hsin General Hospital • 病例對照研究(Case-control studies) 暴露(exposed) 病例組(cases) 未暴露(unexposed) 暴露(exposed) 對照組(controls) 未暴露(unexposed) Time 研究起始時間 Direction of inquiry Question: What happened? Cheng-Hsin General Hospital 病例對照研究法之特徵 • 目的 探討病例組與對照組其暴露分布是否相同 • 研究設計 選擇一組有病者為病例組, 無病者為對照組, 比較病例組與 對照組間過去的暴露經驗 • 病例組與對照組之選擇 1.定義清楚所選擇的病例組個案 2.選擇具有代表性的對照組個案 Cheng-Hsin General Hospital • 優點 1.迅速便宜 2.所須樣本數較小 3.適合於稀有疾病之研究 4.可快速得到結果 5.可以探討多重暴露對單一疾病的相關 • 缺點 1.不易得到過去完整的暴露經驗 2.不易選取合適的對照組 3.僅能用估計的方式得到發生率 4.時序性不易確定 5.可能產生選樣性偏差(selection bias)或 回憶性偏差(recall bias) Cheng-Hsin General Hospital 病例組 對照組 暴露 未暴露 a c b d a+c b+d a+b c+d 1.藉著比較case及control組的exposed與unexposed的比值 (ratio),即比較a/b和c/d就可知exposure和disease是否有關 2.若case組暴露率較control組高 (a/b>c/d),表示暴露和發病 有正相關 3.case 組的exposure odds=a/b control 組的exposure odds=c/d 則odds ratio(OR)=(a/b)/(c/d)=ad/bc 如果OR值越高,表示暴露與發病越有正相關. Cheng-Hsin General Hospital Cheng-Hsin General Hospital • 歷史性世代研究(Historical cohort studies) 暴露組 (exposed) 研究世代 有病 沒病 有病 Direction of inquiry 對照組 (unexposed) 沒病 Time 研究起始時間 Cheng-Hsin General Hospital 種類 觀察性研究設計之比較 過去 橫斷研究 病例對照研究 現在 未來 Subjects selected, data gathered on exposure status and outcome Information obtained on past exposure 世代研究 Case chosen with outcome Controls chosen without outcome Subjects selected and classified as to exposure 歷史性世代研究 Subjects classified as to exposure from existing records Subjects identified who were exposed in the past Outcome measured outcome measured Cheng-Hsin General Hospital 實驗性研究 (Experimental studies) • Experimental studies that involve humans are called clinical trials • Controlled trials are studies in which the experimental drug or procedure is compared with another drug or procedure, sometimes a placebo and sometimes the previously accept treatment • Uncontrolled trials are studies in which the investigators’ experience with the experimental drug or procedure is described, but the treatment is not compared with another treatment Cheng-Hsin General Hospital • 有對照組的實驗研究 實驗組 有結果 研究樣本 無結果 對照組 有結果 無結果 Time 研究起始時間 介入(intervention) Cheng-Hsin General Hospital • 實驗組與對照組 1.接受實驗步驟的對象 所組成的團體稱為實驗組,未接受實 驗步驟者則稱為對照組 2.實驗組的介入因素包括:治療性、預防性、干預性、社區 性試驗對照組則使用安慰劑(placebo) Cheng-Hsin General Hospital • 隨機分派(Random Allocation) 1.研究對象簽了同意書之後,根據random allocation來分組, 通常採用隨機號碼表 2.Randomization=Random Allocation 指每一個研究對象被分到任一組的機率是一樣的 3.以隨機分派的方法來分配實驗組與對照組可以提高兩組 的可比較性(comparability),而避免自我選擇(selfselection)的誤差, 以建立資料分析結果的效度(validity) 4.隨機分派方式: (1)簡單隨機分派(simple randomization) (2)分區隨機分派(Block randomization) (3)分層隨機分派(Stratified randomization) Cheng-Hsin General Hospital • 盲目程序( Blinding procedure) 1.在評估實驗結果時,必須考慮研究者與參與者的bias,為了避免bias, 最好的方法就是double blind 2.單盲程序(single blind):實驗者知道,被實驗者不知道被分到哪一組 雙盲程序(double blind):實驗者和被實驗者都不知道會分到哪一組 三盲程序(triple blind):實驗者、被實驗者和資料分析者都不知道 3.雙盲或三盲程序有時不易做到: (1)副作用(side effect) (2)有些trial是home VS hospital (3)醫師必須知道patient是屬於哪一組,便於照顧病人的情況 Cheng-Hsin General Hospital • Randomized clinical trials Provided the strongest evidence for concluding causation • Nonrandomized trials Do nothing to prevent bias in patient assignment • Trials with self-controls 1.Only one group in which patients are assessed before and after the intervention 2.Hawthorne effect Cheng-Hsin General Hospital • 交叉試驗(Crossover study) with outcome experimental subjects experimental subjects subjects meeting entry criteria with outcome without outcome without outcome with outcome with outcome controls controls without outcome without outcome Time onset of study intervention washout period intervention Cheng-Hsin General Hospital • Trials with external controls subjects with outcome without outcome with outcome Results from previous study without outcome Time onset of study intervention in subjects only Cheng-Hsin General Hospital Cheng-Hsin General Hospital 類實驗研究 (Quasi-experimental study) Cheng-Hsin General Hospital 統合性研究(Meta-analysis) • Meta-analysis uses published information from other studies and combines the results so as to permit an overall conclusion • Meta-analysis is similar to review articles, but additionally includes a quantitative assessment and summary of the findings • Meta-analysis is especially appropriate when the studies that have been reported have small numbers of subjects or come to different conclusions Cheng-Hsin General Hospital 實驗設計評估 • How to evaluating the study design the medical literatures? 1.In a clinical trial (1) How are subjects recruited? (2) Are subjects randomly assigned to the study group? (3) Is there a control group? (4) Are appropriate therapies included? (5) Is the study blind? Double blind? (6) How is compliance evaluated? (7) If some cases are censored, is a survival method such as Kaplan-Meier or the Cox model used? Cheng-Hsin General Hospital 2. In a cohort study (1) How are subjects recruited? (2) Are subjects randomly selected from an eligible pool? (3) How rigorously are subjects followed? How many dropouts does the study have and who are they? (4) If some cases are censored, is a survival method such as Kaplan-Meier or the Cox model used? 3. In a case-control study (1) Are subjects randomly selected from an eligible pool? (2) Is the control group a good one? (3) Are records reviewed independently by more than one person? Cheng-Hsin General Hospital 4. In a cross-sectional study (1) Are the questions unbiased? (2) Are subjects randomly selected from an eligible? (3) What is the response rate? 5. In a meta-analysis (1) How is the literature search conducted? (2) Are the criteria for inclusion and exclusion of studies clearly stated? (3) Is an effort made to reduce publication bias? (4) Is there information on how many studies are need to change the conclusion? Cheng-Hsin General Hospital Part II: Basic Statistics Methods Cheng-Hsin General Hospital Types of clinical data analysis • Time-unrelated analysis • Time-related analysis Cheng-Hsin General Hospital Time-unrelated analysis • Categorical scale 1. Proportion Z test & 2 test 2. Logistic regression • Interval scale 1. Independent T-test (Two-sample) 2. ANOVA 3. Linear Regression Cheng-Hsin General Hospital Time-related analysis • Balanced repeated measurement 1.Categorical scale (1) Generalized Estimation Equation (GEE) (2) Mixed model 2.Interval scale (1) Pair T-test (2) Mixed model • Survival analysis Cheng-Hsin General Hospital Survival analysis • Non-parametric method Kaplan-Meier (Life-Table) method • Semi-parametric method Cox regression model • Parametric method Cheng-Hsin General Hospital Two-Sample Problems • Continuous variable 1.Independent T-test 2.Paired T- test • Binary variable 1.Proportion Z Test or Chi-square test 2.McNemar’s Test Cheng-Hsin General Hospital Independent T-test & Paired T-test Example 1 : Evaluation of effectiveness of community rehabilitation care (n=30) Pre 83 95 75 Post 77 80 70 62 59 Cheng-Hsin General Hospital Statistical evaluation Mean Pre 78.20 (SD=14.31) Post 70.26 (SD= 18.04) Paired T T=-3.09 P=0.0079 Independent T T=-1.33 P=0.19 Cheng-Hsin General Hospital Correlation & Repeated measurement Paired T test enhance statistical efficiency by considering ...... the correlation between Pre- and Post(the same individual) Cheng-Hsin General Hospital 2 test & Proportion Z test Example 2 : • The study on the assessment of mental health program • Intervention: Experimental & Control • Outcome : Favorable (F) & Unfavorable (U) Cheng-Hsin General Hospital Two Data Sets 1. Data 1 : Test Control F 40 16 U 20 48 F 10 2 U 2 4 2. Data 2 : Test Control Cheng-Hsin General Hospital Statistical evaluation • Data 1 2 =(O-E)2/E 2(1) =21.79 P=0.001 Proportion test: Diff=P1-P2 =0.667-0.25=0.417 (95%CI: 0.26-0.58) • The intervention group has a more significant favorable response than the control group Cheng-Hsin General Hospital Fisher Exact Test • Data 2: 2(1) =4.50 P=0.034 (right??) Fisher’s Exact Test : One-Tail P=0.057 Two-Tail P=0.107 Cheng-Hsin General Hospital Correlated Data (McNemar’s Test) Example 3 : A match study on the efficacy of attending CRC screening with respect to health promotion (matched by sex and age) Outcome: whether to attend CRC screening Cheng-Hsin General Hospital General 2 test Health Promotion + - + 43 33 - 16 26 Attend 2(1)=3.70 P=0.055 (right??) Cheng-Hsin General Hospital McNemar’s test Health Promotion Yes + + 27 16 43 No - 6 10 16 33 26 59 2(1) =4.54 P=0.033 Cheng-Hsin General Hospital Measuring agreement between two people Example 4: Two reviewers on mammography Second Yes Yes 20 No 5 No 3 200 First Cheng-Hsin General Hospital Kappa statistics Kappa =(Observed-Expected agreement) /(1-expected agreement) =(O-E)/(1-E) Kappa=0.70 (95% CI: 0.50-0.89) Cheng-Hsin General Hospital How to interpret kappa? Byrt (1996) 0.93-1.00 0.81-0.92 0.61-0.80 0.41-0.60 0.21-0.40 0.01-0.20 0.00 Excellent agreement Very good agreement Good agreement Fair agreement Slight agreement Poor agreement No agreement Cheng-Hsin General Hospital Mutivariate analysis • Continuous variable 1. Linear regression 2. Mixed model (repeated measurement) • Binary variable 1. Logistic regression 2. GEE model (repeated measurement) Cheng-Hsin General Hospital Linear regression Example 5 : Is the effect of diet and exercise program on the SBP effective? Independent variable: diet, exercise Outcome: SBP Cheng-Hsin General Hospital Multiple Linear Regression Y= a+b1*diet+b2*exer+b3*diet*exer + (Interaction term) b’s SE T P-value diet: 1.03 2.22 0.47 0.64 exer: 0.78 0.13 6.21 0.0001 diet*exer : 0.30 0.20 1.54 0.13 Cheng-Hsin General Hospital Logistic regression Example 6: A study was performed on 53 patients with prostate cancer to collect data on several variables considered predictive of nodal involvement Cheng-Hsin General Hospital Definitions of predictors • Age • Acid: level of serum acid phosphate in King-Armstrong units (0.46-1.26) • X-ray examination: (0=negative 1=positive) • Grade: Pathological grade (0=less serious 1=more serious) • Size: size of tumor (0=small, 1=large) • Nodalinv: Laprectomy results (0=No involvement;1=Involvement) Cheng-Hsin General Hospital Model selection X-ray, Size and Acid are three significant factors for predicting nodal involvement Cheng-Hsin General Hospital Model Prediction expβx P 1 expβ x Odds Ratio (OR)=exp() Cheng-Hsin General Hospital Probability prediction of nodal involvement (risk classification) • High-risk Acid= 1.26 , X-ray with positive finding, and large tumor = 0.96 • Low-risk Acid=0.48 , X-ray with negative finding, and small tumor=0.05 Cheng-Hsin General Hospital Generalized Estimation Equation (GEE method) • Indications: The regression model for correlated binary outcome • Example 7 A clinical trial comparing two treatments for a respiratory disease Cheng-Hsin General Hospital Definitions of variables • Outcomes: Respiratory status (0=poor 1=good)four visits • The main variable: Treatment (1=Active 0=placebo) • Other explanatory variables: center, age, sex, and baseline respiratory status Cheng-Hsin General Hospital Characteristics of GEE model Working correlation matrix to accommodate correlation between outcomes Cheng-Hsin General Hospital Survival Analysis • Outcome variable: survival time • Starting pointsendpoints : Survival time • Censoring problem : i.e Right censoring: study ends or lost to follow-up Cheng-Hsin General Hospital Kaplan-Meier method Example 8: Prognosis for women with breast cancer: A histochemical marker called Helix pomatia agglutinin (HPA) is used to assess whether tumor have already been metastasized Cheng-Hsin General Hospital Definition of Variables 32 Patients • Survival time: from treatment to death • Censoring: 1=dead 0=censoring • Prognostic factor: HPA (1=positive 0=negative) Cheng-Hsin General Hospital Cox regression model • Proportional hazard model: One of common multiple regression models for survival analysis. • No baseline hazard estimation • h(t,x1,x2,…,xn)=h0(t)exp(b1x1+b2x2+…+bnxn) Cheng-Hsin General Hospital Importance of the Cox Model • Provided the only valid method of predicting a time-dependent outcome • Producing survival curves that are adjusted for confounding factors. • The Cox model can be extended to the case of multiple events for a subject • Cox model could estimate RRexp() Cheng-Hsin General Hospital Parametric regression models Accelerated failure time (AFT) model • To accommodates left censoring and interval censoring • Distribution 1.exponential distribution 2.weibull distribution 3.log-normal distribution 4.log-logistic distribution 5.Gamma distribution Cheng-Hsin General Hospital Other methods for multiple variables • Discriminant analysis It assumes that the independent variables follow a multivariate normal distribution, so it must be used with caution if some X variables are nominal • Log-Linear analysis All the variables, both independent and dependent, are measured on a nominal scale • Factor analysis All variables are considered to be independent variables Cheng-Hsin General Hospital • Cluster analysis The object is to determine a classification or taxonomic scheme that accounts for variance among the subjects • Multivariate analysis of variance (MANOVA) Involved multiple dependent variables as well as multiple independent variables • Canonical correlation analysis When both the independent variables and the outcomes are numerical and the research question focuses on the relationship between the set of independent & dependent variables Cheng-Hsin General Hospital Summary of conceptual framework for questions involving two variables Independent variable Dependent variable nominal nominal binary numerical nominal (more than two values) numerical Method chi-square t-test numerical one-way ANOVA numerical regression; correlation Cheng-Hsin General Hospital Summary of conceptual framework for questions involving two or more independent variables Independent variable Dependent variable nominal nominal nominal and numerical nominal and numerical dichotomous nominal (two or more values) nominal numerical numerical numerical nominal and numerical nominal with confounding numerical only censored Method Log-linear Logistic regression Discriminant analysis ANOVA Multiple regression Cox regression numerical ANCOVA nominal Mantel-Haenszel --Factor analysis & Cluster analysis Cheng-Hsin General Hospital Thank you for your attention!
© Copyright 2026 Paperzz