**Introduction to Statistics Coursera Quiz**

Welcome to the world of statistics! Whether you’re a beginner or already have some knowledge of the subject, taking a statistics course on Coursera can be an excellent way to enhance your understanding and boost your data analysis skills. And what better way to test your knowledge than with the Statistics Coursera Quiz? In this blog post, we’ll introduce you to the quiz, share some tips and insights, and help you prepare for success.

**Understanding the Format**

The Statistics Coursera Quiz is designed to assess your understanding of various statistical concepts covered in the course. The quiz can include multiple-choice questions, fill-in-the-blank questions, and even open-ended questions where you’ll need to explain your reasoning.

**What to Expect**

The quiz covers a wide range of topics, including probability, hypothesis testing, regression analysis, and data visualization. Each topic is broken down into smaller sections, allowing you to focus on specific areas of statistical analysis.

**Preparation Tips**

To excel in the Statistics Coursera Quiz, it’s important to follow these tips:

**1. Review Course Materials**: Make sure you have a solid understanding of the course materials. Go through the lectures, readings, and any supplementary materials provided by the instructor.

**2. Practice, Practice, Practice**: Take advantage of any practice quizzes or assignments that are available in the course. These will help you build your skills and boost your confidence.

**3. Form Study Groups**: Collaborate with other learners to discuss concepts, solve problems, and share insights. Learning from others and explaining ideas can significantly enhance your understanding.

**4. Utilize External Resources**: Don’t limit yourself to the course materials. Explore additional resources such as textbooks, online tutorials, and interactive websites to reinforce your knowledge.

**Strategies for Success**

Once you’re well-prepared, it’s time to tackle the Statistics Coursera Quiz. Here are some strategies to help you succeed:

**1. Read the Questions Carefully**: Take the time to understand what each question is asking. Pay attention to keywords and any specific details mentioned.

**2. Manage Your Time**: The quiz may have a time limit, so be mindful of how you allocate your time for each question. Don’t spend too long on a single question if you’re unsure.

**3. Show Your Work**: For open-ended questions, show your thought process and work through the problem step-by-step. Even if your final answer isn’t correct, you can still earn partial credit for your approach.

**4. Double-Check Your Answers**: Before submitting your quiz, review all your answers. Look for any errors or inconsistencies to ensure the best possible outcome.

**All Modules: Introduction to Statistics Coursera Quiz Answer**

**Quick Quiz About the Requirements**

**Q1. When is the deadline for the first Module?**

- The course is self-paced, so there is no set deadline. I just need to make sure that I complete all the assignments before the access closes.

**Q2. What is the passing threshold for an assignment in a specific module?**

- 80%

**Q3. What is your goal for this course? How will you achieve it? This space is for you. Be as specific as you want.**

**Hello, Coursera community,**I’m thrilled to share my insights on the goals and strategies for this course. My aim here is to provide you with a clear understanding of what this course is all about and how I plan to help you achieve your learning objectives. Let’s dive in:

**Goal for this Course:**My primary goal for this course in statistics is to empower you with comprehensive knowledge and practical skills in digital marketing. By the end of this course, you should be able to confidently navigate the digital landscape, strategize effective marketing campaigns, and optimize your online presence for maximum impact. Specifically, I aim to help you:

**Master the Fundamentals:**Build a strong foundation by grasping the core concepts of digital marketing, including SEO, social media, content creation, email marketing, and more.

**Develop Strategic Thinking:**Understand how to analyze market trends, consumer behavior, and competitors to devise well-informed digital marketing strategies.

**Hands-on Experience:**Gain practical experience through real-world projects, case studies, and simulations that mirror industry scenarios.

**Measure and Optimize:**Learn to track key performance indicators (KPIs), interpret data analytics, and refine your strategies for continuous improvement.

Stay Updated: Develop the skills to keep Statistics up-to-date with the ever-evolving digital landscape and adapt to new technologies and trends.

Achieving the Goal: To achieve these goals, I’ve designed a structured approach that combines theory, application, and interaction.

**Comprehensive Modules:**The course is divided into modules, each focusing on a specific aspect of digital marketing. This ensures a step-by-step understanding of the subject matter.

**Engaging Content:**I’ve curated engaging video lectures, interactive quizzes, and reading materials that cater to various learning styles.

**Practical Assignments:**You’ll work on Statistics assignments that mirror real-world challenges, such as creating a social media campaign, optimizing a website, or drafting effective email newsletters.

**Live Q&A Sessions:**Regular live sessions will allow you to interact with me directly, ask questions, and seek clarifications on complex topics.

**Peer Collaboration:**Collaborative projects and discussions with fellow learners will encourage the exchange of ideas and insights.

**Feedback and Improvement:**Constructive feedback on your assignments and projects will guide your growth, and you’ll have the chance to refine your work based on suggestions.

**Resource Library:**Access to a curated collection of industry resources, case studies, and tools will enhance your learning journey.

**Example:**For instance, in the module on SEO, you’ll learn about optimizing website content for search engines. You’ll apply this knowledge by conducting keyword research, implementing on-page SEO strategies, and analyzing website traffic to measure your success.

In

**conclusion**, my goal for this course is to equip you with the knowledge and skills needed to excel in the dynamic world of digital marketing. Through a blend of theory, hands-on experience, and interactive learning, you’ll gain the confidence to craft effective strategies and achieve remarkable results in the digital realm. I’m excited to embark on this learning journey with each of you!

**Best regards**,

[Your Name]

**Introduction and Descriptive Statistics for Exploring Data**

**Q1. What is an appropriate way to visualize a list of the eye colors of 120 people? Select all that apply.**

- pie chart
- dot plot

**Q2. According to**** the histogram of travel times to work from the US 2000 census (Page 6 of “Journey to Work: 2000”)****, roughly what percentage of commuters travel more than 45 minutes?**

- %Percentage = frac450500%Percentage = 500450

Divide 450 by 500

%Percentage = 0.9%Percentage = 0.9

Express as percentage

%Percentage = 90%Percentage = 90%

Hence, the percentage of commuters that travel more than 45 minutes is 90%

**Q3. According to**** the histogram of travel times to work from the US 2000 census (Page 6 of “Journey to Work: 2000”)****, approximately what is the median travel time, in minutes (i.e., 50% of commuters have at most that travel time, 50% have at least that travel time)?**

- %Percentage = frac450500%Percentage = 500450

Divide 450 by 500

%Percentage = 0.9%Percentage = 0.9

Express as percentage

%Percentage = 90% %Percentage = 90%

Hence, the percentage of commuters that travel more than 45 minutes is 90%

**Q4. You want to investigate whether households in California tend to have a higher income than households in Massachusetts. Which summary measure would you use to compare the two states?**

- median household income

**Q5. Suppose all household incomes in California increase by 5%. How does that change the mean household income?**

- the mean household income goes up by 5%

**Q6. Suppose all household incomes in California increase by 5%. How does that change the median household income?**

- median household income goes up by 5%

**Q7. Suppose all household incomes in California increase by 5%. How does that change the standard deviation of household incomes?**

- the standard deviation of household incomes goes up by 5%

**Q8. Suppose all household incomes in California increase by 5%. How does that change the interquartile range of household incomes?**

- the interquartile range of household incomes goes up by 5%

**Q9. Suppose all household incomes in California increase by $5,000. How does that change the mean household income?**

- the mean household income goes up by $5,000

**Q10. Suppose all household incomes in California increase by $5,000. How does that change the median household income?**

- the median household income goes up by $5,000

**Q11. Suppose all household incomes in California increase by $5,000. How does that change the standard deviation of household incomes?**

- the standard deviation of the household incomes doesn’t change

**Q12. Suppose all household incomes in California increase by $5,000. How does that change the interquartile range of household incomes?**

- the interquartile range of household incomes doesn’t change

**Q13. The median sales price for houses in a certain county during the last year was $342,000. What can we say about the percentage of sales represented by the houses that sold for more than $342,000?**

- the houses that sold for more than $342,000 represent exactly 50% of all sales

**Producing Data and Sampling**

**Q1. A new company located next to Times Square in New York wants to get a sense of how people feel about a proposed law on immigration. A reporter steps out of the building Statistics and randomly selects 100 people walking there and asks them about the proposed law. What can we say about this sampling plan? Single correct answer.**

- it leads to selection bias

**Q2. A car company wants to get a sense of how satisfied the owners of its new car model are with the quality of that car. It randomly selects 250 numbers from all the vehicle Statistics registration numbers that have been issued for this model and contacts the owners of that model. What can we say about this sampling plan?**

- it represents a simple random sampling

**Q3. An airline wants to do a customer survey in order to improve its service. For one month, it sends an email to a random sample of customers who booked with the airline on the previous day (no customer will be contacted more than once). The email states that the airline would like the customer to fill out a 10-minute survey in order to help the airline improve its service. What can we say about this sampling plan? Single correct answer.**

- it leads to non-response bias

**Q4. As in the previous question, an airline wants to do a customer survey in order to improve its service. For one month, it sends an email to a random sample of customers who flew with the airline on the previous day (no customer will be contacted more than once). Again, the email states that the airline would like the customer to fill out a 10-minute survey in order to help the airline improve its service, but this time it states in addition that every respondent will receive a gift card worth $100. What can we say about this sampling plan?**

- it leads to non-response bias

**Q5. Some years ago, there were many news reports about the “Paleo diet”. It was claimed that the Paleo Diet would result in weight loss as well as the prevention and control of many “diseases of civilization”.**

**A news channel decides to check this out. It recruits people who have followed the diet for the past year and selects 100 at random. It also recruits people who have not followed the diet and selects 100 at random. It finds that there is more weight loss in the diet group and that this result is ‘statistically significant’.**

- It is possible that the difference in weight loss is due to the placebo effect.

**Q6. A number of competitive female cross-country runners suffer from bone loss due to low estrogen levels. Some medical experts conjecture that this can be Statistics prevented by taking oral contraceptives, as those contain estrogen. This conjecture is to be tested with an experiment. The goal of the experiment is to find out whether taking an oral Statistics contraceptive prevents bone loss in female cross-country runners. Which of the following subjects should be recruited in order to do a good experiment? (Pick one of the three.)**

- A group of female runners who are not taking oral contraceptives, but who are willing to take them if asked by the organizers of the experiment to do so.

**The Normal Approximation for Data and the Binomial Distribution**

**Q1. Scores on a certain test follow the normal curve with an average of 1350 and a standard deviation of 120.**

**What percentage of test takers score below 1230? (Use the empirical rule.)**

- 16%

**Q2. As in the previous question, scores on a certain test follow the normal curve with an average of 1350 and a standard deviation of 120. **

**In order to qualify for a certain job, a Statistics candidate needs to score in the top 2.5%. What score does she need?**

- 1590

**Q3. Recall that the main object in a boxplot is a box that is bounded by the first and the third quartiles. So the length of the box is the difference between the third and the first quartile, which is called the interquartile range. This is a measure of the spread of the data; it is sometimes used as an alternative to the Statistics standard deviation. **

**If the data follow the normal curve, then the interquartile range equals how many standard deviations? (You may use the fact that the z-value of the third quartile is 0.7.)**

- 1.4

**Q4. A multiple-choice exam has 5 questions. Each question has 4 possible answers, of which one is correct. If a student Statistics guesses the answers to all five questions, what are the chances that he gets 2 correct?**

- 5!/2!3!â€‹(1/4â€‹)
^{2}(3/4â€‹)^{3}

**Q5. A fair coin is tossed 6 times. What are the chances of getting 2 tails in each of the first 3 and the last 3 tosses?**

- (3!/2!1!â€‹(1/2â€‹)
^{3})(3!/2!1!â€‹(1/2â€‹)^{3})=(3/8â€‹)^{2}

**Q6. A fair coin is tossed 400 times. Approximately what are the chances of getting more than 210 tails? (Use the empirical rule and the normal approximation to the binomial distribution.)**

- 16%

**Sampling Distributions and the Central Limit Theorem**

**Q1. A town has 10,000 registered voters, of whom 6,000 are voting for the Democratic Party. A survey organization is taking a sample of 100 registered voters (assuming sampling with replacement). The percentage of Democratic Statistics voters in the sample will be around ***_, give or take***. (You may use the fact that the standard deviation of 6,000 1s and 4,000 0s is about 0.5.)**

- 60%, give or take 5%

**Q2. You solicit 100 pledges for a charitable organization. Each pledge is equally likely to be $10, $50, or $100. You may use the fact that the standard deviation of the three Statistics amounts to $10, $50, and $100 is $37. What is the expected value of the sum of the 100 pledges?**

- $5333

**Q3. You solicit 100 pledges for a charitable organization. Each pledge is equally likely to be $10, $50, or $100. You may use the fact that the standard deviation of the three Statistics amounts to $10, $50, and $100 is $37. **

**What are the chances that the 100 pledges total more than $5,700?**

- 16%

**Q4. There are two candidates running for governor in CA and they are said to have roughly equal support from the voters. To get a better idea of who is ahead, a company polls 400 of the 20 million registered voters in California. Likewise, there are two candidates running for mayor in Palo Statistics Alto who are said to have roughly equal support, and the company polls 400 out of the 20,000 registered voters in Palo Alto. Will the first poll be more accurate, equally accurate, or less accurate than the second poll?**

- equally accurate

**Q5. The average taxable income reported on tax returns for the year 2016 is $ 45,000, and the standard deviation of the taxable income is $ 23,000. **

**Which of the following two statements is true? Both?**

- The chances that the sum of 100 randomly selected taxable incomes exceeds $ 4 million can be computed from the above information using the normal approximation.

**Q6. Questions (a)-(d) below relate to the following situation: Someone tosses a fair coin 100 times.**

**(a): How many tails can she expect to get?**

- 50

**Q7. (b): What is the “give and take” number for the result from Question (a)?**

- 5

**Q8. (b): What are the chances that she gets between 40 and 60 tails?**

**95%**

**Q9. A large group of people gets together, and everyone tosses a coin 100 times.**

**(d): About what percentage of people will get between 40 and 60 tails?**

- 95%

**Regression**

**Q1. Some people believe that musical activity (e.g. playing an instrument) enhances mathematical ability. 100 high school students were selected at random. For each student, musical activity was recorded in hours per week and mathematical ability was assessed by a test. The correlation coefficient was found to be 0.85. **

**Does the large correlation coefficient prove that musical activity enhances mathematical ability?**

- no

**Q2. What would your answer to the previous question be if you learned that all students in the study came from the same grade?**

- no

**Q3. For a group of commuters commuting to work on a given day, the correlation coefficient between a) time spent waiting at traffic signals, and b) total commuting time, was found to be 0.4. Which of the following statements about the correlation coefficient are true?**

- The more time a commuter spends waiting at traffic signals, the longer the total commute time, on average.
- The more time a commuter spends commuting to work, the more time he spends waiting at traffic signals, on average.

**Q4. A study followed 1,000 children over time. The scatter plot of heights at age 1 vs. heights at age 2 looks football-shaped with a correlation coefficient of r=0.8. Alice’s height at age 1 is in the 80th percentile.**

**Would you predict her height at age 2 to be below, at, or above the 80th percentile?**

- below

**Q5. In the previous question we learned that in a study of children’s height, the correlation coefficient between height at age 1 vs. height at age 2 is r=0.8. **

**Predict the z-score of Alice’s height at age 2. (You may use the fact that the z-score of the 80th percentile is z = 0.85.)**

- (0.8)(0.85) = 0.68

**Q6. Questions (a)-(d) below relate to the following situation: In a biology class, both the midterm scores and the final exam scores have an average of 50 and a standard deviation of 10. The scatterplot looks football-shaped and the correlation coefficient is 0.6. **

**Claudia would like to know what score her friend Emily got in the final. **

**Question (a): If you have no information on how Emily did on the midterm, what is your prediction for her score on the final?**

- 50

**Q7. Question (b): What is the “give or take” number for your prediction from Question (a)?**

**10**

**Q8. Now you learn that Emily got exactly the mean score of 50 on the midterm. **

**Question (c): Given this information, what is your prediction for Emily’s score in the final?**

- 50

**Q9. Question (d): What is the “give or take” number for your prediction from Question (c)?**

- 10(sqrt)1-(0.6)
^{2}=8

**Q10. A tutoring center advertises its services by stating that students who sign up improve their GPA on tests by 0.5 points on average. **

**Is this indeed evidence that the tutoring helps, or could this be due to the regression effect?**

- The improvement could be due to the regression effect.

**Q11. True or false: If an observation with large leverage has a small residual, then it is not influential.**

- False

**Confidence Intervals**

**Q1. A random sample of 500 sales prices of recently purchased homes in a county is taken. From that sample, a 90% confidence interval for the average sales price of all homes in the county is computed to be $215,000 +/- $35,000. **

**Is the following statement true or false? **

**“About 90% of all home sales in the county have a sales price in the range $215,000 +/- $35,000.”**

- false

**Q2. A random sample of 500 sales prices of recently purchased homes in a county is taken. From that sample, a 90% confidence interval for the average sales price of all homes in the county is computed to be $215,000 +/- $35,000. **

**Is the following statement: true or false? **

**“There is a 90% chance that the average sales price of all homes in the county is in the range of $215,000 +/- $35,000.**

- false

**Q3. A poll of 400 eligible voters in a city finds that 313 plan to vote in the next election. Find a 95% confidence interval for the percentage of all eligible voters in the city who plan to vote.**

- 100[313/400 Â±2â€‹ sqrt(313/400) (1-313/400)]/400

**Q4. Questions (a) and (b) below relate to the following: Based on a sample of 500 salaries in a large city, we want to find a confidence interval for the average salary in that city. **

**Question (a): Is it possible to do this using the formula “average +/- z SE”? (Keep in mind that the histogram of salaries is not normal but quite skewed.)**

- yes

**Q5. The margin of error for the confidence interval from Question (a), which was based on 500 salaries, turns out to be $5,400. How many salaries do we need to sample in order to shrink the margin of error to about $2,000?**

**Q6. You are interested in what the current starting salary for jobs in data science is. You solicit feedback on an online forum about data science and you get 230 replies with salary numbers. Can you use the formula “average +/- z SE” to find a confidence interval for the average starting salary?**

- no

**Tests of Significance**

**Q1. Which of the following statements is true? (Select all that apply.)**

- The p-value depends on the data.
- If the null hypothesis is true, then there is less than a 5% chance to get a p-value that is smaller than 5%.
- If a data scientist does many tests, even if all the null hypotheses are true, a certain proportion will be rejected in error.

**Q2. Read the first five paragraphs of the article “Online daters do better in the marriage stakes” by Regina Nuzzo in Nature News, 2013. [You can find it on the internet or here ]. The main claim of the article is that there is a statistically significant difference in marital outcomes between couples that meet online and couples that meet in other ways. Is this finding of practical relevance?**

- no

** Q3. A fair coin is tossed 100-100 times.****Which of the following statements is true? (Select all that apply.)**

- The standard error for the percentage of heads among the 100-100 tosses is 5%
- The standard error for the percentage of tails among the 100-100 tosses is
**5%**

**Q4. Is there a relationship between age and insomnia? A random sample of 184 people ages 18-29 was taken, and it was found that 26.1% suffer from insomnia and 73.9% do not. A separate random sample of 811 people ages 30 and over was taken, and it was found that 39.2% suffer from insomnia and 60.8% do not.**

**Which of the following four test statistics is appropriate for testing whether the prevalence of insomnia is different between the two age groups? (Select all that are.)**

- z = 0.261-0.392/sqrt {0.261(1-0.261)/184 + 0.392 (1-0.392)/811}

**Q5. You want to test whether plain M&Ms really contain 24% blue M&Ms as claimed on the manufacturer’s website. You sample 500 plain M&Ms at random and count the fraction of blue M&Ms.****Which of the following tests is appropriate to address this question?**

*z*-test

**Q6. A high school principal wants to find out whether the average SAT score of this year’s graduating class is higher than last year’s. She samples 13 students from this year’s graduating class at random and wants to compare their average SAT score to the average SAT score from last year’s graduating class.**

*t*-test

**Q7. To investigate whether there is a difference in scholastic abilities between first-borns and second-born siblings, 600 families that have at least two children were randomly selected. The scholastic abilities of the first-born and the second-born siblings were assessed with a test and are to be compared.**

- sign test

**Resampling**

**Q1. We want to use the Monte Carlo method to estimate the probability of getting exactly one ace (one spot) in three rolls of the die.****Which of the following is the correct description for doing this?**

- To simulate three rolls of a die, we draw three times a number at random (with replacement) from 1, 2, 3, 4, 5, and 6. If we get the number `1′ exactly once, then we label this trial to be a success.

**Q2. We want to use the Monte Carlo Method to approximate the standard error of our estimate from Question 1.**Which of the following is the correct description for doing this?

- We repeat the whole Monte Carlo simulation done in Question 1 many times (e.g. 2000 times).

**Q3. We want to use the bootstrap to estimate the bias of ^ Î¸ ^ :****E( Î¸ ^ )âˆ’Î¸**** where Î¸ is some function of our population of interest: = ( population ) Î¸=t(population) and ^ = ( sample ) Î¸ ^ =t(sample). As usual, we only have access to data from a sample of this population. Which of the following is a correct description for doing this?**

- The bootstrap plug-in principle suggests estimating the bias
*E*(*Î¸*^)âˆ’*t*(population)

by*E*(*Î¸*^âˆ—)âˆ’*t*(sample).*E*(*Î¸*^âˆ—) can be approximated by Monte Carlo, resulting in the bootstrap estimate of bias

**Q4. We want to compute a 90% bootstrap percentile interval for the correlation coefficient based on 32 pairs ( 1, 1 ), â€¦, ( 32, 32 ) (X 1 â€‹ , Y 1 â€‹ ),â€¦,(X 32 â€‹ , Y 32 â€‹ ). Which of the following is a correct description for doing this?**

- Resample 32 pairs (that is, don’t break any pairs apart) and compute the correlation coefficient

âˆ—*r*âˆ— of these 32 pairs.

Repeat B=1000 times to get B bootstrap versions*r*1âˆ—â€‹,â€¦,*rB*âˆ—â€‹.

The 90% bootstrap percentile interval is:

**Analysis of Categorical Data**

**Q1. Questions (a)-(d) below relate to the following: Some people suspect that child births may not be equally distributed over the seven days of the week because hospital staff (who can influence the time of delivery in some cases) may prefer to work on certain days of the week. Question (a): Which of the following is the null hypothesis?**

- child births occur equally likely on the seven days of the week

**Q2. To investigate, you note the day of the week of 300 births that were randomly selected from all births that occurred in New York City last year. **

**Question (b): What test should you use to test the null hypothesis?**

- chi-square test for goodness-of-fit

**Q4. Question (d): What would be the answer to Question (b) if you wanted to investigate a simpler question, namely whether the percentage of births on weekends is lower than expected?**

*z*-test

**Q5. This question and the next one are related to the following context: A food delivery start-up decides to advertise its service by placing ads on web pages. They wonder whether the percentage of viewers who click on the ad changes depending on how often the viewers were shown the ad. They randomly select 100 viewers from among those who were shown the ad** once, 135 from among those who were shown the ad** twice, and 150 from among those who were shown the ad three times.**

**Which is the null hypothesis?**

- the chances that the user clicks on the ad are the same for all three groups

**Q6. In the previous question, which test is appropriate to test the null hypothesis?**

- chi-square test of homogeneity

**Q7. A county wants to check whether the racial composition of the teachers in the county corresponds to that of the population in the county. It samples 500 teachers at random and wants to compare that sample with the census numbers about the racial groups in that county. **

**Which test would be appropriate?**

- chi-square test for goodness-of-fit

**Q8. An airline wants to find out whether there is a connection between the customer’s status in its frequent flyer program and the class of ticket that the customer buys. It samples 1,000 ticket records at random and for each ticket notes the status level (‘none’, ‘silver’, ‘gold’) and the ticket class (‘economy’, ‘business’, ‘first’).**

** Which test would be appropriate?**

- chi-square test of independence

**Q9. The airline wants to find out whether there is a connection between the customer’s status in its frequent flyer program and the amount that the customer spends on tickets in the following year. It samples 1,000 ticket records at random and for each ticket notes the status level (‘none’, ‘silver’, ‘gold’) and the amount spent on tickets in the following year. **

**Which test would be appropriate?**

- none of these

**One-Way Analysis of Variance**

**Q1. An online retailer strongly suspects that customers purchase more in the following month if they are shown a company ad more often. To confirm that hunch they randomly select 50 customers who are then sent one ad, 45 customers who are sent two ads, and 52 customers who are sent three ads. **

**Which is the null hypothesis?**

- the spending means for the three groups are the same.

**Q2. Based on the description of the experiment in the previous question and the boxplots below, do you think that the assumptions of ANOVA are met?**

- yes

**Q3. Based on the ANOVA table below and the boxplots, what is the conclusion of the analysis?**

- There is sufficient evidence to conclude that the spending means are not equal, but based on this analysis alone we cannot conclude that the spending means increase with the number of ads.

**Q4. Does eye color affect the type of vision correction that patients choose? From a large dataset of patients having vision correction, 70 patients were chosen randomly from those having brown eyes, 70 from those having green eyes, and 70 from those having blue eyes. For each patient, the type of vision correction was coded as follows: glasses=1, contact lenses=2, corrective surgery=3. Those numbers were used for an ANOVA, which resulted in a p-value of 0.5%. **

**Does the p-value of 0.5% mean that there is strong evidence that eye color has an effect on the type of vision correction that patients choose?**

- no

**Q5. A clinical trial aims to discern whether twelve interventions against high blood pressure have different effects. The study randomizes 10,000 subjects into twelve groups. Each group is administered one of the twelve interventions. After a month the change in blood pressure is measured for each subject. The ANOVA table gives a p-value of 17%. The investigators also perform pairwise two-sample t-tests for all pairs of treatments and find that two pairs show a statistically significant difference.**

** Which of the following options describes a valid conclusion?**

- There is not enough evidence to conclude that the twelve treatment means are different.

**Multiple Comparisons**

**Q1. Recall that a “discovery” occurs when a test rejects the null hypothesis. In the medical literature, a discovery is called a “positive result”. So a “false positive” is a “false discovery”.**

** What is the false discovery proportion (FDP) of the procedure that yielded the following results:**

- 9/9+36

**Q2. A medical study examines whether there is a significant correlation between any of the 12 lifestyle choices and high blood pressure. It doesn’t find any significant correlation, but upon further examination, the researchers find a highly significant ( p-value <0.5%) correlation between two of the lifestyle choices. This correlation seems not to have been noticed before.**

** Which of the following three statements is an appropriate summary of these findings? Select all that apply.**

- The seemingly significant correlation was found as a consequence of data snooping and therefore the
*p*-value is not valid. The researchers shouldn’t report anything. - The seemingly significant correlation was found as a consequence of data snooping and therefore the
*p*-value is not valid. However, this could potentially be a significant new finding. The researchers can report it as such, pointing out that they cannot attach a valid*p*-value to this finding. It can serve as a hypothesis for a future study with new data, which would then allow for statistically valid conclusions.

**Q3. 1,000 tests were evaluated with the Bonferroni correction. 31 tests had corrected p-values smaller than 5%. **

**Which of the following three statements is an appropriate conclusion?**

- This is sufficient evidence to reject all of these 31 null hypotheses because there is only a 5% chance that any of these 31
*p*-values would be this small if the null hypotheses were true.

**Q4. 1,000 tests were evaluated with the FDR at the 5% level, which resulted in 31 discoveries. **

**Which of the following three statements is an appropriate conclusion?**

- If we reject these 31 null hypotheses then we can expect that about 5% of them are rejected in error.

**Keep reading the article**
Introduction to Data Science: A Beginnerâ€™s Guide to Success

**Conclusion**

The Statistics Coursera Quiz is an excellent opportunity to test your knowledge and gauge your understanding of statistics. By following our tips and strategies, you’ll be well-prepared to tackle the quiz with confidence. Remember, the quiz is not just an assessment but also a chance to reinforce your learning and identify areas where you may need further practice. So, embrace the challenge, learn from the experience, and enjoy your statistics journey on Coursera!