# MAT 274 BENCHMARK FORMAT AND STYLE TEMPLATE

1. A patient is classified as having gestational diabetes if their average glucose level is above 140 milligrams per deciliter (mg/dl) one hour after a sugary drink is ingested. Rebecca’s doctor is concerned that she may suffer from gestational diabetes. There is variation both in the actual glucose level and in the blood test that measures the level. Rebecca’s measured glucose level one hour after ingesting the sugary drink varies according to the Normal distribution with μ=140+# mg/dl and σ=#+1 mg/dl, where # is the last digit of your GCU student ID number. Using the Central Limit Theorem, determine the probability of Rebecca being diagnosed with gestational diabetes if her glucose level is measured:

a. Once?

b. n=#+2 times, where # is the last digit of your student ID?

c. n=#+4 times, where # is the last digit of your student ID?

d. Comment on the relationship between the probabilities observed in (a), (b), and (c). Explain, using concepts from lecture why this occurs and what it means in context.

For each part, insert your sketch of the required area under the normal curve. In addition, include a screenshot of your Excel computation to find this area.

i. Insert screenshot and figure for part (a)

ii. Insert screenshot and figure for part (b)

iii. Insert screenshot and figure for part (c)

iv. Comment on the relationship among the probabilities in parts (a),(b), and (c).

2. Suppose next that we have even less knowledge of our patient, and we are only given the accuracy of the blood test and prevalence of the disease in our population. We are told that the blood test is 9# percent reliable, this means that the test will yield an accurate positive result in 9#% of the cases where the disease is actually present. Gestational diabetes affects #+1 percent of the population in our patient’s age group, and that our test has a false positive rate of #+4 percent. Use your knowledge of Bayes’ Theorem and Conditional Probabilities to compute the following quantities based on the information given only in part 2:

a. If 100,000 people take the blood test, how many people would you expect to test positive and actually have gestational diabetes?

b. What is the probability of having the disease given that you test positive?

c. If 100,000 people take the blood test, how many people would you expect to test negative despite actually having gestational diabetes?

d. What is the probability of having the disease given that you tested negative?

e. Comment on what you observe in the above computations. How does the prevalence of the disease affect whether the test can be trusted?

Fill in the conditional probability table here, then answer the questions in each part below.

v. Comment on how prevalence of the disease affects your ability to trust the test. Discuss what factors would lead you to trust the blood test, or not trust the blood test.

3. As we have seen in class, hypothesis testing, and confidence intervals are the most common inferential tools used in statistics. Imagine that you have been tasked with designing an experiment to determine reliably if a patient should be diagnosed with diabetes based on their blood test results. Create a short outline of your experiment, including all the following:

a. A detailed discussion of your experimental design. Detailed experimental design should include the type of experiment, how you chose your sample size, what data is being collected, and how you would collect that data.

b. How is randomization used in your sampling or assignment strategy? Remember to discuss how you would randomize for sampling and assignment, what type of randomization are you using?

c. The type of inferential test utilized in your experiment. Include type of test used, number of tails, and a justification for this choice.

d. A formal statement of the null and alternative hypothesis for your test. Make sure to include correct statistical notation for the formal null and alternative, do not just state this in words.

e. A confidence interval for estimating the parameter in your test. State and discuss your chosen confidence level, why this is appropriate, and interpret the lower and upper limits.

f. An interpretation of your p-value and confidence interval, including what they mean in the context of your experimental design. Answer each part below. State your significance level, interpret your p-value, and make a decision on the null.

## NormalCalculator

 Given Data Values and Mean/Standard Deviation Data Value (x) 1.1200 Standard Deviation 1.0000 Mean 0.0000 Z-Score 1.1200 Left Probability 0.8686 Right Probability 0.1314 Given Left Probability Left Probability 0.0500 Standard Deviation 1.0000 Mean 0.0000 Z-Score -1.6449 Right Probability 0.9500 Data Value (x) -1.6449

## BinomialCalculator

 Parameters for Binomial Distribution n 5922 p 0.0268490375 Count P(X<=k) P(X=k) P(X=k) 177.66 0.9296 0.0111 0.9185 0.0815

## ProportionTemplate

 Z-Test for 1 Proportion phat 0.1892780558 Hypothesized value 0.2 n 6995 Number of Tails 1 Significance Level 0.01 Test Statistic (Z) -2.2418544478 P-Value 0.0124853905 Proportion Confidence Interval phat 0.1893 n 6995 Confidence Level 0.98 1-phat 0.8107 zstar 2.3263 Lower Limit 0.1784 Upper Limit 0.2002 Sample Size Given Margin of Error pstar 0.5 (1-pstar) 0.5 MOE 0.05 Confidence Level 0.95 zstar 1.9599639845 n 384.1458820694

## Z-Test (1 sample)

 Hypothesis Test Sample Mean 1 Hypothesized Value 0.5 Sample Size 1009 STDEV 45 Significance Level 0.05 Number of Tails 1 Z-statistic 0.3529417817 P-Value Right Tail 0.3620660433 P-Value Two Tail 0.7241320867 Confidence Intervals for Known Sigma Sample Mean(xbar) 1 Sample Size (n) 1009 STDEV (sigma) 45 Confidence Level 0.9 zstar 1.644853627 Lower Limit -1.3302053093 Upper Limit 3.3302053093

## T-Template (1-sample)

 Significance Test for t (One Sample) xbar -9.1 Hypothesized Value 0 Sample Standard Deviation 12.51177 n 10 Significance Level (alpha) 0.01 Number of Tails 2 df 9 Test Statistic (t) -2.2999724825 P-Value(Left) 0.0470015052 Critical Value 3.2498355416 Confidence Interval for t (One Sample) Sample Mean (xbar) -9.1 Sample STDEV (s) 12.51177 n 10 Confidence Level 0.99 df 9 tstar 3.2498355416 Lower Limit -21.958198806 Upper Limit 3.758198806