1. A patient is classified as having gestational diabetes if their average glucose level is above 140 milligrams per deciliter (mg/dl) one hour after a sugary drink is ingested. Rebecca’s doctor is concerned that she may suffer from gestational diabetes. There is variation both in the actual glucose level and in the blood test that measures the level. Rebecca’s measured glucose level one hour after ingesting the sugary drink varies according to the Normal distribution with μ=140+# mg/dl and σ=#+1 mg/dl, where # is the last digit of your GCU student ID number. Using the Central Limit Theorem, determine the probability of Rebecca being diagnosed with gestational diabetes if her glucose level is measured:

a. Once?

b. n=#+2 times, where # is the last digit of your student ID?

c. n=#+4 times, where # is the last digit of your student ID?

d. Comment on the relationship between the probabilities observed in (a), (b), and (c). Explain, using concepts from lecture why this occurs and what it means in context.


For each part, insert your sketch of the required area under the normal curve. In addition, include a screenshot of your Excel computation to find this area.

i. Insert screenshot and figure for part (a)


ii. Insert screenshot and figure for part (b)


iii. Insert screenshot and figure for part (c)


iv. Comment on the relationship among the probabilities in parts (a),(b), and (c).



2. Suppose next that we have even less knowledge of our patient, and we are only given the accuracy of the blood test and prevalence of the disease in our population. We are told that the blood test is 9# percent reliable, this means that the test will yield an accurate positive result in 9#% of the cases where the disease is actually present. Gestational diabetes affects #+1 percent of the population in our patient’s age group, and that our test has a false positive rate of #+4 percent. Use your knowledge of Bayes’ Theorem and Conditional Probabilities to compute the following quantities based on the information given only in part 2:

a. If 100,000 people take the blood test, how many people would you expect to test positive and actually have gestational diabetes?

b. What is the probability of having the disease given that you test positive?

c. If 100,000 people take the blood test, how many people would you expect to test negative despite actually having gestational diabetes?

d. What is the probability of having the disease given that you tested negative?

e. Comment on what you observe in the above computations. How does the prevalence of the disease affect whether the test can be trusted?


Fill in the conditional probability table here, then answer the questions in each part below.





i. Answer part (a) here.


ii. Answer part (b) here.


iii. Answer part (c) here.


iv. Answer part (d) here.


v. Comment on how prevalence of the disease affects your ability to trust the test. Discuss what factors would lead you to trust the blood test, or not trust the blood test.



3. As we have seen in class, hypothesis testing, and confidence intervals are the most common inferential tools used in statistics. Imagine that you have been tasked with designing an experiment to determine reliably if a patient should be diagnosed with diabetes based on their blood test results. Create a short outline of your experiment, including all the following:

a. A detailed discussion of your experimental design. Detailed experimental design should include the type of experiment, how you chose your sample size, what data is being collected, and how you would collect that data.



b. How is randomization used in your sampling or assignment strategy? Remember to discuss how you would randomize for sampling and assignment, what type of randomization are you using?



c. The type of inferential test utilized in your experiment. Include type of test used, number of tails, and a justification for this choice.



d. A formal statement of the null and alternative hypothesis for your test. Make sure to include correct statistical notation for the formal null and alternative, do not just state this in words.


e. A confidence interval for estimating the parameter in your test. State and discuss your chosen confidence level, why this is appropriate, and interpret the lower and upper limits.



f. An interpretation of your p-value and confidence interval, including what they mean in the context of your experimental design. Answer each part below. State your significance level, interpret your p-value, and make a decision on the null.


Given Data Values and Mean/Standard Deviation
Data Value (x) 1.1200
Standard Deviation 1.0000
Mean 0.0000
Z-Score 1.1200
Left Probability 0.8686
Right Probability 0.1314
Given Left Probability
Left Probability 0.0500
Standard Deviation 1.0000
Mean 0.0000
Z-Score -1.6449
Right Probability 0.9500
Data Value (x) -1.6449


Parameters for Binomial Distribution
n 5922
p 0.0268490375
Count P(X<=k) P(X=k) P(X<k) P(X>=k)
177.66 0.9296 0.0111 0.9185 0.0815


Z-Test for 1 Proportion
phat 0.1892780558
Hypothesized value 0.2
n 6995
Number of Tails 1
Significance Level 0.01
Test Statistic (Z) -2.2418544478
P-Value 0.0124853905
Proportion Confidence Interval
phat 0.1893
n 6995
Confidence Level 0.98
1-phat 0.8107
zstar 2.3263
Lower Limit 0.1784
Upper Limit 0.2002
Sample Size Given Margin of Error
pstar 0.5
(1-pstar) 0.5
MOE 0.05
Confidence Level 0.95
zstar 1.9599639845
n 384.1458820694

Z-Test (1 sample)

Hypothesis Test
Sample Mean 1
Hypothesized Value 0.5
Sample Size 1009
Significance Level 0.05
Number of Tails 1
Z-statistic 0.3529417817
P-Value Right Tail 0.3620660433
P-Value Two Tail 0.7241320867
Confidence Intervals for Known Sigma
Sample Mean(xbar) 1
Sample Size (n) 1009
STDEV (sigma) 45
Confidence Level 0.9
zstar 1.644853627
Lower Limit -1.3302053093
Upper Limit 3.3302053093

T-Template (1-sample)

Significance Test for t (One Sample)
xbar -9.1
Hypothesized Value 0
Sample Standard Deviation 12.51177
n 10
Significance Level (alpha) 0.01
Number of Tails 2
df 9
Test Statistic (t) -2.2999724825
P-Value(Left) 0.0470015052
Critical Value 3.2498355416
Confidence Interval for t (One Sample)
Sample Mean (xbar) -9.1
Sample STDEV (s) 12.51177
n 10
Confidence Level 0.99
df 9
tstar 3.2498355416
Lower Limit -21.958198806
Upper Limit 3.758198806