Inferential Statistics Coursera Quiz Answers | Networking Funda

Get All Weeks Inferential Statistics Coursera Quiz Answers

Quiz 1: Practice Quiz Answers

Q1. Suppose we are interested in studying how much chocolate is consumed by Coursera students, measured in grams per week. After surveying 500 students, we calculate an average of 175 grams per week with a standard deviation of 195 grams per week. Which of the following is not necessarily true?

[expand title=View Answer] A histogram of the samples will be skewed to the right.[/expand]

Q2. Which of the following is false?

[expand title=View Answer] Standard error computed based on a sample standard deviation will always be lower than the standard deviation of that sample.[/expand]

Q3. The ages of pennies at a particular bank follow a nearly normal distribution with mean 10.44 years with standard deviation 9.2 years. Say you take random samples of 30 pennies, find the mean age in each sample, and plot the distribution of these means. Which of the following are the best estimates for the center and spread of this distribution?

[expand title=View Answer]
mean = 10.44,
standard error = 9.2/ \sqrt{30} = 1.689.2/
30 =1.68
[/expand]

Q4. Which of the following is true about sampling distributions?

[expand title=View Answer]
1.Sampling distribution of the mean is always right skewed since means cannot be smaller than 0.
2.Shape of the sampling distribution is always the same shape as the population distribution, no matter what the sample size is.
3.Sampling distributions get closer to normality as the sample size increases.
[/expand]

Q5. To get an estimate of consumer spending in the U.S. following the Thanksgiving holiday, 436 randomly sampled American adults were surveyed. Their daily spending for the six-day period following Thanksgiving averaged $84.71. A 95% confidence interval based on this sample is ($80.31, $89.11). Which of the following are true?

[expand title=View Answer]
I. We are 95% confident that the average spending of the 436 American adults in this sample is between $80.31 and $89.11.

II. If we collected many random samples of the same size and calculated a confidence interval for daily spending for each sample, then we would expect 95% of the intervals to contain the true population parameter.
[/expand]

III. We are 95% confident that the average spending of all American adults is between $80.31 and $89.11.

[expand title=View Answer] I and II [/expand]

Q6. Which of the following is false about confidence intervals?All else held constant.

[expand title=View Answer] as the sample mean increases, the margin of error stays constant. [/expand]

Inferential Statistics Week 01 Quiz Answers

Q1. Researchers studying anthropometry collected body girth measurements and skeletal diameter measurements, as well as age, weight, height and gender, for 507 physically active individuals. The histogram below shows the sample distribution of heights in centimeters, and the table shows sample statistics calculated based on this sample. Which of the following is not necessarily true?

[expand title=View Answer] The population mean is 171.1 cm. [/expand]

Q3. For the standard deviation σ or s and the standard error SE, which of the following is the correct set of descriptions?

[expand title=View Answer]
s: variability in sample data

SE: variability in point estimates from different samples of the same size and from same population

σ: variability in population data
[/expand]

Q1. We want to estimate the average coffee intake of Coursera students, measured in cups of coffee. A survey of 1,000 students yields an average of 0.55 cups per day, with a standard deviation of 1 cup per day. Which of the following is not necessarily true?

[expand title=View Answer] The sample distribution is right skewed. [/expand]

0.55 is a point estimate for the population mean.

Q2. Researchers studying anthropometry collected various body and skeletal measurements for 507 physically active individuals. The histogram below shows the sample distribution of heights in centimeters. If the 507 individuals are a simple random sample – and let’s assume they are – then the sample mean is a point estimate for the mean height of all active individuals. What measure do we use to quantify the variability of such an estimate? Compute this quantity using the data from this sample and choose the best answer below.

[expand title=View Answer] standard error = 0.019 [/expand]

Q3. Students are asked to count the number of chocolate chips in 22 cookies for a class activity. They found that the cookies on average had 14.77 chocolate chips with a standard deviation of 4.37 chocolate chips. After collecting the data, a student reports the standard error of the mean to be 0.93 chocolate chips. What is the best way to interpret the student’s result?

[expand title=View Answer] 0.93 chocolate chips is a measure of the variability we’d expect in calculations of the mean number of chocolate chips if we took repeated random samples of 22 cookies. [/expand]

Q4. Four plots are presented below. The plot at the top is a distribution for a population. The mean is 60 and the standard deviation is 18. Also shown below is a distribution of

(1) a single random sample of 500 values from this population,

(2) a distribution of 500 sample means from random samples of each size 18,

(3) a distribution of 500 sample means from random samples of each size 81.

Determine which plot (A, B, or C) is which.

[expand title=View Answer]
) 500 samples, n = 18 – Plot A

(3) 500 samples, n = 81 – Plot B
[/expand]

Q5. The General Social Survey (GSS) is a sociological survey used to collect data on demographic characteristics and attitudes of residents of the United States. In 2010, the survey collected responses from over a thousand US residents. The survey is conducted face-to-face with an in-person interview of a randomly-selected sample of adults. One of the questions on the survey is “For how many days during the past 30 days was your mental health, which includes stress, depression, and problems with emotions, not good?”

Based on responses from 1,151 US residents, the survey reported a 95% confidence interval of 3.40 to 4.24 days in 2010. Given this information, which of the following statements would be most appropriate to make regarding the true average number of days of “not good” mental health in 2010 for US residents?

[expand title=View Answer] For all US residents in 2010, based on this 95% confidence interval, we would reject a null hypothesis stating that the true average number of days of “not good” mental health is 5 days. [/expand]

Q. A random sample of 100 runners who completed the 2012 Cherry Blossom 10 mile run yielded an average completion time of 95 minutes. A 95% confidence interval calculated based on this sample is 92 minutes to 98 minutes. Which of the following is false based on this confidence interval?

[expand title=View Answer]
The margin of error of this confidence interval is 3 minutes.

We are 95% confident that the true average finishing time of all runners who completed the 2012 Cherry Blossom 10 mile run is between 92 minutes and 98 minutes.

Based on this 95% confidence interval, we would reject a null hypothesis stating that the true average finishing time of all runners who completed the 2012 Cherry Blossom 10 mile run is 90 minutes.

95% of the time the true average finishing time of all runners who completed the 2012 Cherry Blossom 10 mile run is between 92 minutes and 98 minutes
[/expand]

Q6. Suppose we collected a sample of size n = 100 from some population and used the data to calculate a 95% confidence interval for the population mean. Now suppose we are going to increase the sample size to n = 300. Keeping all else constant, which of the following would we expect to occur as a result of increasing the sample size?

The standard error would decrease.
Width of the 95% confidence interval would increase.
The margin of error would decrease.

[expand title=View Answer] I and III [/expand]

Q7. Researchers investigating characteristics of gifted children collected data from schools in a large city on a random sample of thirty-six children who were identified as gifted children soon after they reached the age of four. The following histogram shows the distribution of the ages (in months) at which these children first counted to 10 successfully. Also provided are some sample statistics.

Calculate a 90% confidence interval for the average age at which gifted children first count to 10 successfully. Choose the closest answer.

[expand title=View Answer] (30.12, 31.26) [/expand]

Quiz 3: Week 1 Lab Answers

Q1. Which of the following is false?

[expand title=View Answer] The distribution of areas of houses in Ames is unimodal and right-skewed. [/expand]

Q2. Suppose we took two more samples, one of size 100 and one of size 1000. Which would you think would provide a more accurate estimate of the population mean?

[expand title=View Answer] Sample size of 1000 [/expand]

Q3. How many elements are there in this object called sample_means_small?

[expand title=View Answer] 5000 [/expand]

Q4. Which of the following is true about the elements in the sampling distributions you created?

[expand title=View Answer] Each element represents a mean square footage from a simple random sample of 10 houses. [/expand]

Q5. It makes intuitive sense that as the sample size increases, the center of the sampling distribution becomes a more reliable estimate for the true population mean. Also as the sample size increases, the variability of the sampling distribution _.

[expand title=View Answer] decreases[/expand]

Q6. Which of the following is false?

[expand title=View Answer] The variability of the sampling distribution with the smaller sample size (sample_means50) is smaller than the variability of the sampling distribution with the larger sample size (sample_means150). [/expand]

Inferential Statistics Week 03 Quiz Answers

Quiz 1: Practice Quiz Answers

Q1. Read the following scenario and then, from the choices that follow, choose the correct set of hypotheses for the scenario:

Since 2008, chain restaurants in California have been required to display calorie counts of each menu item. Prior to menus displaying calorie counts, the average calorie intake of diners at a restaurant was 1100 calories. After calorie counts started to be displayed on menus, a nutritionist collected data on the number of calories consumed at this restaurant from a random sample of diners. Do these data provide convincing evidence of a difference in the average calorie intake of a diners at this restaurant?

[expand title=View Answer] H_0: \mu = 1100 \\ H_A: \mu \ne 1100H0:μ=1100HA:μ=1100 [/expand]

Q2. Which of the following is the correct definition of the p-value?

[expand title=View Answer] P(observed or more extreme sample statistic | H_0H0 true) [/expand]

Q3. One-sided alternative hypotheses are phrased in terms of:

[expand title=View Answer]< or > [/expand]

Q4. A Type 2 error occurs when the null hypothesis is

[expand title=View Answer] not rejected when it is false [/expand]

Q5. True / False: Decreasing the significance level (\alphaα) will increase the probability of making a Type 1 error.

[expand title=View Answer] False[/expand]

Quiz 2: Week 2 Quiz Answers

Q1. A study suggests that the average college student spends 2 hours per
week communicating with others online. You believe that this is an
underestimate and decide to collect your own sample for a hypothesis
test. You randomly sample 60 students from your dorm and find that on
average they spent 3.5 hours a week communicating with others online.
Which of the following is the correct set of hypotheses for this
scenario?

[expand title=View Answer] H_0: \mu = 2\\ H_A: \mu > 2H0:μ=2HA:μ>2[/expand]

Q2. Which of the following is the correct definition of the p-value?

[expand title=View Answer] P(observed sample statistic | H_0H0 true) [/expand]

Q3. Two-sided alternative hypotheses are phrased in terms of:

[expand title=View Answer] ≠ [/expand]

Q4. A Type 1 error occurs when the null hypothesis is

[expand title=View Answer] rejected when it is true [/expand]

Q5. A statistician is studying blood pressure levels of Italians in the age range 75-80. The following is some information about her study:

The data were collected by responses to a survey conducted by email, and no measures were taken to get information from those who did not respond to the initial survey email.

The sample observations only make up about 4% of the population.

The sample size is 2,047.

The distribution of sample observations is skewed – the skew is easy to see, although not very extreme.

The researcher is ready to use the Central Limit Theorem (CLT) in the main part of her analysis. Which aspect of her study is most likely to prevent her from using the CLT?

[expand title=View Answer] (III), because the sample size is too small compared to all Italians in the age range 75-80. [/expand]

Q6. SAT scores are distributed with a mean of 1,500 and a standard deviation of 300. You are interested in estimating the average SAT score of first year students at your college. If you would like to limit the margin of error of your 95% confidence interval to 25 points, at least how many students should you sample?

[expand title=View Answer] 554[/expand]

Q7. The significance level in hypothesis testing is the probability of

[expand title=View Answer] rejecting a true null hypothesis[/expand]

Q8. The nutrition label on a bag of potato chips says that a one-ounce (28-gram) serving of potato chips has 130 calories and contains ten grams of fat, with three grams of saturated fat. A random sample of 35 bags yielded a sample mean of 134 calories with a standard deviation of 17 calories. We are evaluating whether these data provide convincing evidence that the nutrition label does not provide an accurate measure of calories in the bags of potato chips at the 10% significance level. Which of the following is correct?

[expand title=View Answer] The p-value is approximately 8%, which means we should reject the null hypothesis and determine that these data provide convincing evidence the nutrition label does not provide an accurate measure of calories in the bags of potato chips. [/expand]

Quiz 3: Week 2 Lab Answers

Q1. My distribution should be similar to others’ distributions who also collect random samples from this population, but it is likely not exactly the same since it’s a random sample.

[expand title=View Answer] True [/expand]

Q2. For the confidence interval to be valid, the sample mean must be normally distributed and have standard error \frac{s}{\sqrt{n}}
. Which of the following is not a condition needed for this to be true?

[expand title=View Answer] The sample size, 60, is less than 10% of all houses. [/expand]

Q3. What does “95% confidence” mean?

[expand title=View Answer] 95% confident that the sample mean is in this interval. [/expand]

Q4. What proportion of 95% confidence intervals would you expect to capture the true population mean?

[expand title=View Answer] 95% [/expand]

Q5. What is the appropriate critical value for a 99% confidence level?

[expand title=View Answer] 2.33[/expand]

Q6. We would expect 99% of the intervals to contain the true population mean.

[expand title=View Answer] False [/expand]

Inferential Statistics Week 04 Quiz Answers

Quiz 1: Week 3 Practice Quiz Answers

Q1. Consider the width of two bootstrap confidence intervals constructed based on the same sample. One of the intervals is constructed at a 90% confidence level and the other is constructed at a 95% confidence level. Which of the following is true?

[expand title=View Answer] The 95% interval is wider.[/expand]

Q2. Which of the following is not a situation where the paired test is preferred?

[expand title=View Answer] Assess the gender-related salary gap by comparing the salaries of randomly sampled men and women. [/expand]

Q3. You’ve just read a study that investigated the difference in brain sizes between EU and US citizens, based on data from random samples from both populations. At the 5% significance level the study failed to reject the null hypothesis that EU and US citizens have (on average) brains of equal size. Which of the following is true regarding a 99% confidence interval for the difference in brain sizes?

[expand title=View Answer] Without more information, it is impossible to know whether the interval contains 0. [/expand]

Q4. The figure below shows three unimodal and symmetric curves, which assignment is most plausible?

[expand title=View Answer]solid: t_{df = 1} , dashed: t_{df = 5}, dotted: Normal [/expand]

Q5. We are testing the following hypotheses:

H0 : μ = 3HA : μ > 3

The sample size is 18. The test statistic is calculated as T = 0.5. What is the p-value?

[expand title=View Answer] Jgreater than 0.1 [/expand]

Q6. What does ANOVA mean?

[expand title=View Answer] Analysis of variance [/expand]

Q7. Which of the following is not a condition required for comparing means across multiple groups using ANOVA?

[expand title=View Answer] The means of each group should be roughly equal. [/expand]

Quiz 2: Week 3 Quiz Answers

Q1. People of different ages were asked to stand on a “force platform” and maintain a stable upright position. The “wiggle” of the board in the forward-backward direction is recorded; more wiggle corresponds to less balance. The participants are divided into two age groups: young and elderly. The average wiggle among elderly people was 26.33 mm, and the average among young people was 18.125 mm. The bootstrap distribution for the difference in means is shown below, based on 100 bootstrap samples. Of the following choices, which is the most accurate 90% bootstrap confidence interval for the true difference in means?

[expand title=View Answer] (3 mm, 17 mm) [/expand]

Q2. Which of the following is false regarding paired data?

[expand title=View Answer] Two data sets of different sizes cannot be analyzed as paired data. [/expand]

Q3. Which of the following is false about bootstrap and sampling distributions?

[expand title=View Answer] Both distributions get narrower as the standard deviation decreases. [/expand]

Q4. Your friend, who took statistics a few years ago, recently read a study that examined whether there is any difference between the average birth weights of babies born to smoking mothers vs. non-smoking mothers. Your friend asked you to remind him what it means when the study says “a 95% confidence interval for the difference between the average birth weight from non-smoking mothers and smoking mothers (\mu_{non}- \mu_{smoke}μ
non

−μ smoke ) is 0.2 to 0.9 pounds.” Of the following possible responses to your friend’s question, which is true according to the study?

[expand title=View Answer]The study data does not provide convincing evidence (at 5% significance level) of a difference between the average birth weight from smoking mothers and non-smoking mothers. [/expand]

Q5. An insurance company wants to estimate (using a confidence interval) its average claim amount using data from 20 randomly selected claims. Which of the following is false?

[expand title=View Answer] JA confidence interval based on this sample is not accurate since the sample size is small.[/expand]

Q6. The figure below shows three tt-distribution curves. Which curve has the highest degree of freedom?

[expand title=View Answer] Solid [/expand]

Q7. Air quality measurements were collected in a random sample of 25 country capitals in 2013, and then again in the same cities in 2014. We would like to use these data to compare average air quality between the two years. Which of the following tests is the most appropriate?

[expand title=View Answer] paired t-test with two-sided alternative hypothesis [/expand]

Q8. We are testing the following hypotheses:

H0 : μ = 0.5

H_AHA : μ \neq= 0.5

The sample size is 26. The test statistic is calculated as T = 2.485. What is the p-value?

[expand title=View Answer] between 0.01 and 0.02 [/expand]

Q9. When doing an ANOVA, you observe large differences in means between groups. Within the ANOVA framework this would most likely be interpreted as:

[expand title=View Answer] Evidence strongly favoring the alternative hypothesis. [/expand]

Q10. Which of the following is not a condition required for comparing means across multiple groups using ANOVA?

[expand title=View Answer] There should be at least 10 successes and 10 failures. [/expand]

Q11. A study compared five different methods for teaching descriptive statistics. The five methods were traditional lecture and discussion, programmed textbook instruction, programmed text with lectures, computer instruction, and computer instruction with lectures. 45 students were randomly assigned, 9 to each method. After completing the course, students took a 1-hour exam.

Which of the following is the correct degrees of freedom for an F-test for evaluating if the average test scores are different for the different teaching methods?

[expand title=View Answer] df_G = 4, df_E = 40dfG=4,dfE=40 [/expand]

Q12. A study compared five different methods for teaching descriptive statistics. The five methods were traditional lecture and discussion, programmed textbook instruction, programmed text with lectures, computer instruction, and computer instruction with lectures. 45 students were randomly assigned, 9 to each method. After completing the course, students took a 1-hour exam. We are interested in finding out if the average test scores are different for the different teaching methods. Which of the following is the appropriate set of hypotheses?

[expand title=View Answer]H0: μ1 = μ2 = μ3 = μ4 = μ5 HA: at least one μi is different [/expand]

Q13. Researchers studying people’s sense of smell devised a measure of smelling ability. A higher score on this scale means the subject can smell better. A random sample of 36 people (18 male and 18 female) were involved in the study. The average score for the males was 10 with a standard deviation of 3.4 and the average score for the females was 11 with a standard deviation of 2.7. Which of the following is the correct standard error for the test evaluating whether the males and females have differing smelling abilities, on average? Choose the closest answer.

[expand title=View Answer] 1.047 [/expand]

Q14. A study compared five different methods for teaching descriptive statistics. The five methods were traditional lecture and discussion, programmed textbook instruction, programmed text with lectures computer instruction, and computer instruction with lectures. 45 students were randomly assigned, 9 to each method. After completing the course, students took a 1-hour exam. We are interested in finding out if the average test scores are different for the different teaching methods.

How many pairwise tests would we need to do in order to compare all pairs of means to each other?

[expand title=View Answer] 10 [/expand]

Quiz 3: Week 3 Lab Quiz Answers

Q1. There are 1,000 cases in this data set, what do the cases represent?

[expand title=View Answer] The births [/expand]

Q2. How many mothers are we missing weight gain data from?

[expand title=View Answer] 27 [/expand]

Q3. Make side-by-side boxplots of habit and weight. Which of the following is false about the relationship between habit and weight?

[expand title=View Answer] Both distributions are extremely right skewed. [/expand]

Q4. What are the hypotheses for testing if the average weights of babies born to smoking and non-smoking mothers are different?

[expand title=View Answer]
H_0: \mu_{smoking} = \mu_{non-smoking}H0:μsmoking=μnon−smoking

H_A: \mu_{smoking} > \mu_{non-smoking}HA:μsmoking>μnon−smoking
[/expand]

Q5. Change the type argument to “ci” to construct and record a confidence interval for the difference between the weights of babies born to smoking and non- smoking mothers. Which of the following is the best interpretation of the interval?

[expand title=View Answer] We are 95% confident that babies born to nonsmoker mothers are on average 0.05 to 0.58 pounds heavier at birth than babies born to smoker mothers. [/expand]

Q6. Calculate a 99% confidence interval for the average length of pregnancies (weeks). Note that since you’re doing inference on a single population parameter, there is no explanatory variable, so you can omit the x variable from the function. Which of the following is a correct interval?

[expand title=View Answer] (38.0892 , 38.5661)[/expand]

Q7. Now, a non-inference task: Determine the age cutoff for younger and mature mothers. Use a method of your choice. What is the maximum age of a younger mom and the minimum age of a mature mom, according to the data?

[expand title=View Answer] The maximum age of younger moms is 34 and the minimum age of mature moms is 35. [/expand]

Inferential Statistics Week 05 Quiz Answers

Quiz 1: Week 4 Practice Quiz

Q1. Suppose you want to construct a confidence interval for a population proportion. Which of the following, if it were true, would prevent you from being able to assume that the distribution of the sample proportion is nearly normal?

[expand title=View Answer] n = 104. Out of these 104 there are only a few successes (15), but relatively many failures (89). [/expand]

Q2. In 2013, Edward Snowden leaked details of top-secret NSA spying activities to the media. A poll conducted by USA TODAY / Pew Research Center asked 1,504 people in U.S. whether Snowden’s leaks have helped or harmed the public interest. 53% of respondents answered “helped the public interest”. You want to test whether a majority of people in the U.S. believe he helped the public interest. Which of the following is the correct set of hypotheses?

[expand title=View Answer] H_0: \rho = 0.5; H_A: \rho > 0.5H0:ρ=0.5;HA:ρ>0.5 [/expand]

Q3. In response to complaints from residents about too many (about 15%) of the cars passing by the local school speeding, the police started closely monitoring traffic. You want to check if the police’s efforts had an effect on the prevalence of speeding in this area. One day you observe 560 different cars pass by the school, and find that 70 of them were speeding. You calculate a p-value of 0.0976. Assuming the cars are representative of all cars that drive by the school, which of the following is true?

[expand title=View Answer] If in fact the police’s efforts didn’t have an effect, the probability of getting a random sample of 560 cars where 70 or less cars are speeding is 0.0976. [/expand]

Q4. When do we use the pooled proportion in calculation of the standard error of the

difference of two proportions (SE_{(\hat{p}p^1 − \hat{p}p^2)})?

[expand title=View Answer] when constructing a confidence interval for p1 − p2[/expand]

Q5. Rock-paper-scissors is a hand game played by two or more people where players choose to sign either ‘rock’, ‘paper’, or ‘scissors’ with their hands. We would like to test if players choose between these three options randomly, or if certain options are favored above others. What hypothesis test should we conduct to answer this research question?

[expand title=View Answer] Chi-square test of goodness of fit [/expand]

Q6. When doing a hypothesis test on a single proportion (i.e. for one categorical variable), we have studied how to calculate the p-value for the hypothesis test, beginning with generating simulated samples. Which of the following is the best description for how you should generate the simulated samples, and why?

[expand title=View Answer]Generate simulated samples based on the null hypothesis because we need to see how extreme our observed data looks if the null hypothesis is really true. [/expand]

Q7. True or false: In the calculation of the required sample size for a given margin of error of the confidence interval for a population proportion, we should use p= 0.5 if we don’t have any knowledge about the characteristics of the population.

[expand title=View Answer] False [/expand]

Q8. Suppose in a population 20% of people wear contact lenses. What is the expected shape of the sampling distribution of the proportion of contact lens wearers in random samples of 1000 people from this population?

[expand title=View Answer] nearly normal [/expand]

Q9. True/False: When the success-failure condition is not met, we should use a T-test to compare two proportions.

[expand title=View Answer] False[/expand]

Quiz 2: Week 4 Quiz Answers

Q1. Which of the following is not required for the distribution of the sample proportion to be nearly normal?

[expand title=View Answer] Sample size should be at least 30 and the population distribution should not be extremely skewed. [/expand]

Q2. When checking conditions for calculating a confidence interval for a proportion, you should use which number of successes and failures?

[expand title=View Answer] Expected (based on the null value) [/expand]

Q3. In May 2011, Gallup asked 1,721 students in grades five through twelve if their school teaches them about money and banking. Researchers are interested in finding out if a majority of students receive such education. Which of the following is the correct set of hypotheses?

[expand title=View Answer] H0 :p < 0.5; HA :p > 0.5 [/expand]

Q4. The campaign manager for a congressional candidate claims that the candidate has more than 50% support from the district’s electorate. A newspaper collects a simple random sample of 500 likely voters in this district and estimates the support for this candidate to be 52%. The p-value for the hypothesis test evaluating the campaign manager’s claim is 0.19. Which of the below is correct?

[expand title=View Answer]If in fact 50% of likely voters support this candidate, the probability of obtaining a random sample of 500 likely voters where 52% or more support the candidate is 0.19. [/expand]

Q5. Gallup conducts an annual poll of U.S. residents. Approximately 1,000 residents across all 50 states and Washington D.C. are asked “Do you believe the use of marijuana should be made legal?” The distribution of responses by date of the survey is shown in the table below. Imagine a hypothesis test evaluating whether there is a difference from 2012 to 2013 between the proportions of “yes” responses. Using the information in the table below, calculate the standard error for this hypothesis test. Choose the closest answer.

[expand title=View Answer] 0.4754 [/expand]

Q6. “In statistical inference for proportions, standard error (SE) is calculated differently for hypothesis tests and confidence intervals.” Which of the following is the best justification for this statement?

[expand title=View Answer] Because in hypothesis testing, we assume the null hypothesis is true, we calculate SE using the null value of the parameter. In confidence intervals, there is no null value, hence we use the sample proportion(s). [/expand]

Q7. At the beginning of the semester, an anonymous survey was conducted on students in a statistics class. Two of the questions on the survey were about gender and whether or not students have equal, more, or less energy in the afternoon compared to the morning. Below are the results.

What test should we perform to see if gender and energy level are associated?

[expand title=View Answer] Chi-square test of independence [/expand]

Q8. A variety of studies suggest that 10% of the world’s population is left-handed. It is also claimed that artists are more likely to be left-handed. In order to test this claim we take a random sample of 40 art students at a college and find that 6 of them (15%) are left-handed. Which of the following is the correct setup for calculating the p-value for this test?

[expand title=View Answer] Randomly sample 40 non-art students, and record the number of left-handed students in the sample. Repeat this many times and calculate the proportion of samples where at least 15% of the students are left-handed. [/expand]

Q9. True or false: The χ2 statistic is always non-negative.

[expand title=View Answer] True[/expand]

Q10. 80% of Americans start the day with a cereal breakfast. Based on this information, determine if the following statement is true or false.

“The sampling distribution of the proportions of Americans who start the day with a cereal breakfast in random samples of size 40 is right skewed.”

[expand title=View Answer] False [/expand]

Q11. At a stop sign, some drivers come to a full stop, some come to a ‘rolling stop’ (not a full stop, but slow down), and some do not stop at all. We would like to test if there is an association between gender and type of stop (full, rolling, or no stop). We collect data by standing a few feet from a stop sign and taking note of type of stop and the gender of the driver. What are the hypotheses for testing for an association between gender and type of stop?

[expand title=View Answer]
H0: Gender and type of stop are independent.

HA: Gender and type of stop are associated.
[/expand]

Q12. Does Weight Watchers work? Researchers randomly divided 500 people into two equal-sized groups. One group spent 6 months on the Weight Watchers program. The other group received a pamphlet about controlling portion sizes. At the end of the study 35% of the subjects in the pamphlet group and 55% of the subjects in the Weight Watchers group had lost at least 10 pounds. To test whether Weight Watchers is more effective for weight loss than pamphlets, a statistician used an index card to represent each subject in the study and wrote whether or not the subject lost at least 10 pounds on the index card. He then shuffled these cards together, and dealt them into two equal-sized groups. Which of the following best describes the expected result?

[expand title=View Answer] The difference between the proportions of cards indicating whether or not the subject lost at least 10 pounds will be about 0.[/expand]

Quiz 3: Week 4 Lab Quiz Answers

Q1. How many people were interviewed for this survey?

[expand title=View Answer] A poll conducted by WIN-Gallup International surveyed 51,927 people from 57 countries[/expand]

Q2. Which of the following methods were used to gather information?

[expand title=View Answer]
Face to face

Telephone

Internet
[/expand]

Q3. In the first paragraph, several key findings are reported. These percentages appear to be sample statistics.

[expand title=View Answer] False [/expand]

Q4. The title of the report is “Global Index of Religiosity and Atheism”. To generalize the report’s findings to the global human population, We must assume that the sample was a random sample from the entire population in order to be able to generalize the results to the global human population. This does seem to be a reasonable assumption.

[expand title=View Answer] True[/expand]

Q5. What does each row of Table 6 correspond to?

[expand title=View Answer] Religions[/expand]

Q6. What does each row of atheism correspond to?

[expand title=View Answer] Countries [/expand]

Q7. Using the command below, create a new dataframe called us12 that contains only the rows in atheism associated with respondents to the 2012 survey from the United States. Next, calculate the proportion of atheist responses. [TRUE / FALSE] This percentage agrees with the percentage in Table 6.

[expand title=View Answer] True [/expand]

Q8. Based on the R output, what is the margin of error for the estimate of the proportion of the proportion of atheists in US in 2012?

[expand title=View Answer] The margin of error for the estimate of the proportion of atheists in the US in 2012 is 0.0135. [/expand]

Q9. Which of the following is false about the relationship between p and ME.

[expand title=View Answer] The most conservative estimate for calculating a confidence interval occurs when p is set to 1 [/expand]

Q10. There is convincing evidence that Spain has seen a change in its atheism index between 2005 and 2012.

[expand title=View Answer]True [/expand]

Q11. There is convincing evidence that the United States has seen a change

in its atheism index between 2005 and 2012.

[expand title=View Answer] False [/expand]

Q12. If in fact there has been no change in the atheism index in the countries listed in Table 4, in how many of those countries would you expect to detect a change (at a significance level of 0.05) simply by chance? Hint: Type 1 error.

[expand title=View Answer] 1 [/expand]

Q13. Suppose you’re hired by the local government to estimate the proportion of residents that attend a religious service on a weekly basis. According to the guidelines, the estimate must have a margin of error no greater than 1% with 95% confidence. You have no idea what to expect for p. How many people would you have to sample to ensure that you are within the guidelines?Hint: Refer to your plot of the relationship between p and margin of error. Do not use the data set to answer this question.

[expand title=View Answer] At least 9604 people [/expand]