Practical Aspects of Sampling

Practical Aspects of
Sampling
An Overview
Why Sample?
Why Sample?
Samples
are taken to obtain
information about populations.
Sample
estimators are computed to
estimate parameters of the the
population from which the sample was
drawn.
Advantages
Complete
enumeration of all
sample units in the
entire universe is
often unnecessary
to obtain reasonably
accurate results.
Advantages
An
examination of the
entire population is
often too costly, too
time-consuming, and
impractical (if not
impossible).
Advantages
In
the case of
destructive testing, the
sample elements or
units must be destroyed
or must be consumed to
obtain necessary
measurements.
Precision
The
standard error [se] is a measure of
precision. A smaller se, other things
remaining the same, means more
precision
.....that is, less variance in the sampling.
Sample Size
for a mean –
n = z2 2 / e2
where:
– e, the sampling error, is the difference
between sample mean and population mean
[e is expressed in units]
Sample Size
for a proportion n = [z2 p (1 – p)] / e2
where:
– e, the sampling error, is the difference
between sample proportion and population
proportion
[ e is expressed in percentage points]
Sample Size
Errors
Sampling
(internal) Error
The fact that a sample was taken, the
sample statistic is expected to deviate
from the population parameter.
Errors
Non-Sampling
(external) Error
The practical considerations in taking a
sample.
recording errors
processing errors
Errors
Bias
Most insidious to detect ....
poorly defined universe
inadequate sampling design
improperly worded questions
distorted answers
convenience sampling
Errors
The
sampling error refers to the
extent to which the sample values on
some variable of importance to the
research differ from those of the
population from which it was drawn.
Types of Random
Samples
Simple Random
 with
replacement
 without
replacement
Stratified
When
the population
is heterogeneous
overall, but within it
there are
homogeneous
populations (strata)
the population is
stratified.
Systematic
Selecting
a random
sample, as opposed
to the simple random
selection technique.
Select the K-th item.
Draw every I-th item.
Cluster
Another
modified
random sample design
-- requires that the
sample unites be
grouped in clusters in
the universe.
Not grouped by
homogeneous strata in
the population.
Multistage
The
selection procedure takes place in
a hierarchy of stages.
–
–
–
–
–
first
second
third
.....
last
primary sample unit
second sample unit
tertiary sample unit
final (or ultimate) sample unit
Multistage - An Example
The president of Supermarkets, Inc.
decided to sample purchases at 150
stores in the US.
The first stage is to select, on the basis of
clustering (save travel time), 15 of the
150 stores.
Multistage - An Example
The researcher recommends that cash register
files be randomly selected at each of the
150 stores. [second stage]
Then select every 20th purchase in a file using
a random start. [final stage]
Comparison of Survey
Sampling Designs
Simple Random
 How
to Select
– assign numbers to
elements using
random numbers
table
 Strengths/Weaknesses
– basic, simple, often
costly
– must assign a number
to each element in
target population
Stratified
 How
to Select
– divide population
into groups that are
similar within and
different between
variable of interest
 Strengths/Weaknesses
– with proper strata, can
produce very accurate
estimates.
– less costly than
simple random
sampling
– must stratify target
population correctly
Stratified
 One
of the main reasons for using a
stratified sample is that stratifying has
the effect of reducing sampling error
for a given sample size to a level lower
than that of an simple random sample
of the same size.
Stratified
This is so because of a very simple
principle: the more homogeneous a
population is on the variables being studied,
the smaller the sample size needed to
represent it accurately.
 Stratifying makes each sub-sample more
homogeneous by eliminating the variation
on the variable that is used for stratifying.

Systematic
 How
to Select
– select every K-th
element are from a
list after a random
start
 Strengths/Weaknesses
– produces very accurate
estimates when
elements in population
exhibit order
– used when pop. size
not known
– simplifies selection
process
Cluster
 How
to Select
– randomly choose
clusters and sample
all elements within
each cluster
 Strengths/Weaknesses
– with proper clusters,
can produce accurate
estimates
– useful when sample
frame not available or
travel costs high
– must cluster target
population correctly
Convenience





In Dining Commons at
dinner
In Student Union
In classes in which
you are enrolled
Data available in
Library
friend knows
somebody ...
References
 Monette,
Duane R., et al. Applied Social
Research New York: Holt, Rinehart and
Winston, 1986.
 Levine,
David, et al. Statistics for
Managers, Second Edition. Upper Saddle
River, NJ: Prentice-Hall, 1999.
Mini-Cases
Work as a team…
… decide best sampling technique
Scenario 1
 You
have been hired by the County of
Sacramento to estimate the percentage of
registered voters that favor issuing a bond in
order to finance the construction of a new
bike trail along the Sacramento River.
Given that you want no more than a 0.04
error margin, at the 95% confidence level,
state how you would conduct such a survey
using a simple random sample
Scenario 1
(continued)
 When
going over your sampling design
with the county Parks Director, you are
asked whether a stratified sample would be
appropriate? What is your reply? Why?
 What
about a systematic sample?
Travel Vouchers
Fly the Friendly Skis
Scenario #2
o
The State of California has hired you to
estimate the number of travel vouchers for
legislators that have been filed incorrectly.
The vouchers have been filed as they are
processed.
o
Which sampling technique would you
employee?
Light Rail
Scenario #3

Light Rail has hired you to determine
whether passengers like the convenience of
using light rail system.

Which sampling technique would you
employee?
Trucks
Scenario #4
 Marketers,
Inc., has hired you to determine
why so many young drivers, both male and
female, prefer owning a pickup truck as
compared to an automobile.
 Which
sampling technique would you
employee?
Questions?