Revised Version (correcting some errors in the version posted 12/17/01)
Part I: True of False?
a) If we reject the null hypothesis
using a significance level
, then there is less than a 5% probability that the null hypothesis is
really true.
FALSE. Confidence is not the same thing as probability.
b) If the correlation between two variables X and Y is r=0.4, then 40% of the variation in Y comes from variation in X
FALSE. Even with a generous reading of the words ``comes from'', the
proportion should be
, not r=0.4.
c) To double the accuracy of a z-test, you have to quadruple the size of the sample (leaving all other parameters fixed).
TRUE. The accuracy goes as the square root of the sample size.
d) The power of a statistical test is the probability of rejecting the null hypothesis, given that the alternative hypothesis is true.
TRUE.
e) For a given set of data, the decision to reject the null hypothesis may depend on what the alternative hypothesis is.
TRUE. The threshold for rejection is different for a 1-sided or 2-sided alternative. In fact, in an earlier exam the answer depended on which alternative you were using.
f) When doing 2-sided hypothesis testing, the chi-squared test for a
grid is equivalent to a z test for the difference of proportions.
TRUE. Any answer you get using one test you could get from the other.
For the next 4 questions, let
be the average value of some variable x taken from a random sample
of size n, and let s be the corresponding sample standard
deviation.
g) If x has a normal distribution with mean
and standard deviation
, then
has a normal distribution with mean
and standard deviation
.
FALSE. The standard deviation is
.
h) If n is large, then the distribution of
is approximately normal, no matter what the distribution of x looks
like.
TRUE. This is the Central Limit Theorem.
i) If x has a normal distribution with mean
, then
has a t-distribution with n-1 degrees of freedom.
FALSE.
, not
. (This is the problem that I had intended to be true, but mistyped).
j) If the distribution of x is skewed or has outliers, then the
standard deviation of
is NOT equal to
, where
is the (population) standard deviation of x.
FALSE. It is ALWAYS true that the standard deviation of
is
times the standard deviation of x, regardless of the value of n
or the distribution of x. This result is not the same as the Central
Limit Theorem, which talks about the shape of the distribution of
, and which requires n to be large.
k) When testing the difference of two means, the standard error for
is a weighted average of the standard error for
and the standard error for
.
FALSE. The standard error for
is
, which is larger than either
or
.
l) In an SRS, every collection of k individuals has an equal change of being selected.
TRUE.
m) If there is a strong correlation between two variables, then one must affect the other.
FALSE. Correlation is not causation.
n) A statistical consulting firm does tests and constructs 95% confidence intervals for a large number of customers. Assuming they do their calculations correctly, their confidence intervals will contain the true value of the quantities they are measuring about 95% of the time.
TRUE. 95% confidence means that the method works about 95% of the time, which is precisely what is stated.
o) For a given standard of accuracy, sampling a large population requires more data than sampling a small population.
FALSE. Accuracy depends on sample size, not population size.
Part II:
1. z-tests versus t-tests. (5 points) Suppose we are comparing two populations (say, men vs. women at UT). We take a random sample of each population to compare attributes. When comparing the mean values of some trait (e.g. weight) for the two populations, we use a t-statistic. However, when comparing the proportion of each population that has some trait (e.g., is registered to vote), we use a z-statistic. Explain why.
Answer: A t statistic is used when you don't know what
the standard deviation
is. However, when computing proportions you know the standard deviation
in terms of
and
. Even though you don't know
and
exactly, the formula
is not very sensitive to changes in p, and has an upper bound of
.
2. Life expectancy (5 points)
The male members of my family, and of my wife's family, have tended to die much younger than the female members. My father, my grandfathers, my wife's father and my wife's grandfathers died at the ages of 56, 70, 58, 65, 62, and 69. My mother, my grandmothers, and my wife's grandmothers lived to be 83, 86, 81, 93, and 82, while my wife's mother is still alive at 70. What statistically significant conclusions can we draw about my daughter's life expectancy vs. my sons'?
Answer: NONE! THE SAMPLE IS NOT RANDOM!
Without a random sample, none of the statistical tests we learned about mean anything. This is probably the MOST important lesson for would-be statisticians. Anecdotal evidence, such as I gave you in this problem, should never be relied upon for statistical analysis.
I am sad to report that the vast majority of the class got this problem wrong.
3. Shapes of distributions. a) The graph below shows two curves.
On is the distribution of a quantity x in a large population. The
other is the distribution of
from random samples of size 10 taken from that population. Which is which?
How can you tell? (You MUST justify your answer to get full credit).
Answer: Curve 1 is the underlying distribution of x. That
is, x is equally likely to be found anywhere between 1 and 6. The
average value of 10 measurements, however, is bound to be fairly close
to 3.5. It's given by an approximatly normal distribution (curve 2), and
has standard deviation a factor of
smaller than that of curve 1.
b) On the same graph, sketch the distribution of the mean
of random samples of size 40. Be as precise as possible. In particular,
indicate what the shape is supposed to be, and how the height and width
of this curve compare to the others.
Answer: This is a normal distribution with a standard deviation
exactly half that of curve 2, since
. In other words, it will be twice as high and half as wide as curve 2,
and will still be centered at 3.5.
4. Vaccinations Measles vaccines are given to millions of American children every year. This prevents the million-or-more cases per year that used to occur before vaccination was started, but does lead to occasional side effects. In any given year, we should expect an average of 121 cases of severe complications from the vaccine. [These numbers are reasonably close to the truth, by the way].
a) What kind of probability distribution (e.g., uniform, exponential, binomial, poisson, normal, student t, chi squared, etc.) should describe the number of cases in a given year? What are the values of the relevant parameters for this distribution?
Answer: This is a Poisson distribution with
. If you prefer, you can call it a binomial distribution with n=several
million and p=121/n. But that's essentially the same as Poisson.
b) What is the probability of getting 130 or fewer cases in a given year?
Answer: Use the normal approximation.
,
.
z=(130.5-121)/11 = 0.864 (don't forget the continuity correction!)
and
.
c) Suppose that, starting in 2002, a new vaccine will be used that is intended to be safer than the old vaccine. If there are 100 cases next year, can we conclude, with 95% confidence, that the new vaccine really is safer? [This question is purely hypothetical. To the best of my knowledge, there is no new vaccine.]
Answer: Yes! The null hypothesis is that the two vaccines are
equally safe (
), while the alternate hypothesis is that the new vaccine is safer (
). We compute using the null hypothesis:
z = (100.5-121)/11 = -1.864, F(z) = 0.031. In other words, if the null hypothesis is true, then there is only a 3.1% chance of getting 100 or fewer cases. With 95% confidence, we reject the null hypothesis in favor of the alternative that the new vaccine is safer.
5. 2-way grid problem
A study was done of study habits of students in a large class, and the possible effect this has on grades. 28 students were tracked, of which some study with friends, some study with a tutor, and some study alone. All these students received either an A, B or C in the course, according to the following table:
a) Compute the expected number of counts, assuming that the two features are unrelated.
Answer: The expected count for a square is the row total times the column total divided by the overall total (28). So our expected counts are:
b) Compute the chi-squared statistic.
Answer:
c) How many degrees of freedom are there?
Answer: df=(3-1)(3-1)=4.
d) Can we reject, with 90% confidence, the null hypothesis that one's mode of study has nothing to do with what grade one gets?
Answer: For 4 degrees of freedom and 90% confidence, the critical
value of
is 7.78. Since 5.76 < 7.78, we CANNOT reject the null hypothesis.
6. Cholesterol levels
A study attemps to measure the difference in cholesterol levels between
people in different countries. A random sample of 51 American men between
the ages of 40 and 50 showed an average cholesterol count of
, with standard deviation s=42. A random sample of 51 Japanese men
between the ages of 40 and 50 showed an average cholesterol count of
, with standard deviation s=37.
a) Compute a 90% confidence interval for the true mean cholesterol level for American men aged 40-50.
Answer:
, df=50, so
, so
.
b) Compute a 95% confidence interval for the true mean cholesterol level for Japanese men aged 40-50.
Answer:
, df=50,
, so
.
c) Compute a 99% confidence interval for the difference between the mean cholesterol levels in the USA and Japan (for men between 40-50).
Answer:
. Conservatively, we take df=50, so
. Thus
7. Proportions
a) How much exit polling data do you need to predict the result of a referrendum to within 2% with 95% confidence? (You may assume that you have a random sample and that everybody you poll answers your questions truthfully).
Answer:
, so we need
. But
and
, so this works out to n =2401.
b) An exit survey of 1000 individuals shows that 542 voted in favor of the proposition and 458 voted against. With what degree of confidence can you say that the proposition passed?
Answer:
is that the election is tied (p=0.5).
is that the proposition passes (p > 0.5). Note that this is a 1-sided
alternative.
. z = 0.042/0.0158 = 2.658. F(z)=0.996, so we can
reject
in favor of
with 99.6% confidence. In other words, we can say the proposition passed
with 99.6% confidence.
8. Correlations
You are given a sample of 18 data points
from a large population. Among these, you find that
,
,
,
, and r = 0.45.
a) Find the equations of the best-fit regression line
.
Answer:
.
, so our best-fit line is y = 7.72 + 1.56 x.
b) What fraction of the variation in y is ``explained'' by the variation in x?
Answer:
.
c) Can we conclude, with 95% confidence, that x and y are correlated?
Answer: Do a t-test for the correlation.
With n-2=16 degrees of freedom, the critical value is
, so we CANNOT conclude, with 95% confidence that there is a correlation.
(That's assuming a 2-sided alternative. If you use a 1-sided alternative,
then the critical value is only 1.746, and you CAN conclude that there's
a correlation).
d) Construct a 90% confidence interval for the slope
of the best-fit line of the entire population. (You may assume that y
can indeed be written as
, where
is a normal random variable that is independent of x).
Answer: The key is finding the standard error, which equals
. However,
, and
. Thus SE = 0.772. Since
, we have
.