- IB
- Question Type 6: Performing the chi-squared test for data categorization such that the expected frequency is at least 5
The student is expected to perform a goodness-of-fit test to determine if a six-sided die is fair based on observed frequencies.
A six-sided die is rolled times with the following outcomes:
| Face | 1 | 2 | 3 | 4 | 5 | 6 |
|---|---|---|---|---|---|---|
| Frequency | 34 | 31 | 28 | 30 | 28 | 29 |
Test at the significance level whether the die is fair.
[6]A survey of 120 high school students was conducted to determine whether their favorite subject (Math, Science, English, History) is independent of their gender. The results are shown in the following table.
| Math | Science | English | History | Total | |
|---|---|---|---|---|---|
| Male | 20 | 10 | 25 | 15 | 70 |
| Female | 10 | 20 | 15 | 5 | 50 |
| Total | 30 | 30 | 40 | 20 | 120 |
A test for independence is performed at the 5% significance level.
(a) State the null hypothesis, , for this test.
[1](b) Calculate the expected frequency for Male students who prefer English.
[2](c) Show that the number of degrees of freedom for this test is 3.
[1](d) Calculate the test statistic for this survey.
[3](e) The critical value for this test at the 5% significance level is 7.815. State the conclusion for this test, giving a reason for your answer.
[2]Chi-squared Test for Independence
A psychologist classifies 140 subjects by coping style (Problem-focused, Emotion-focused, Avoidant) and stress level (Low, Medium, High). The observed frequencies are shown in the following table:
| Low Stress | Medium Stress | High Stress | Total | |
|---|---|---|---|---|
| Problem-focused | 20 | 30 | 10 | 60 |
| Emotion-focused | 10 | 30 | 20 | 60 |
| Avoidant | 5 | 10 | 5 | 20 |
| Total | 35 | 70 | 35 | 140 |
Perform a test of independence at the level of significance to determine whether coping style is independent of stress level.
[8]In a genetics experiment, 120 plants are classified by flower color: purple, pink, or white. The expected Mendelian ratio is . Observed counts are 54 purple, 20 pink, and 46 white.
Test at the significance level whether the data fit this ratio.
[7]A random sample of 160 drivers is classified by vehicle type and cellphone use. The results are shown in the following table:
| Cellphone Use (Yes) | Cellphone Use (No) | Total | |
|---|---|---|---|
| Car | 30 | 50 | 80 |
| SUV | 20 | 40 | 60 |
| Truck | 10 | 10 | 20 |
| Total | 60 | 100 | 160 |
A test for independence is conducted at the significance level.
State the null and alternative hypotheses for this test.
[2]Calculate the expected frequency of SUV drivers who use cellphones.
[2]For this test, find: (i) the test statistic; (ii) the number of degrees of freedom.
[4]State the conclusion of the test, giving a reason for your answer.
[4]A marketing survey of 180 respondents was conducted to determine if there is an association between age group and preference for online shopping. The results are shown in the following table:
| Age group | Online Shopping (Yes) | Online Shopping (No) | Total |
|---|---|---|---|
| 18–29 | 50 | 10 | 60 |
| 30–49 | 40 | 20 | 60 |
| 50+ | 30 | 30 | 60 |
| Total | 120 | 60 | 180 |
A test of independence is carried out at the significance level.
State the null and alternative hypotheses for this test.
[2]Calculate the expected frequency for a respondent in the '50+' age group who prefers online shopping ('Yes').
[2]Show that the value of the statistic for this data is 15.
[3]State the number of degrees of freedom for this test.
[1]The critical value for this test at the significance level is .
Determine whether there is an association between age group and online shopping preference. Justify your answer.
[2]A group of 120 students are asked to choose one of four teaching methods. The number of students who chose each method is shown in the table below.
Perform a goodness of fit test, at the 5% significance level, to determine whether students have a preference for any of the teaching methods.
[6]A manufacturer claims that of its light bulbs last at least hours, last between and hours, and last less than hours. In a test of bulbs, lasted h, lasted h, and lasted h.
Test the claim at the level.
[7]In a study of 200 patients, blood type (A, B, AB, O) and the presence of a certain antibody (positive or negative) were recorded. The results are shown in the following table.
| Positive | Negative | Total | |
|---|---|---|---|
| Type A | 40 | 60 | 100 |
| Type B | 20 | 30 | 50 |
| Type AB | 10 | 10 | 20 |
| Type O | 30 | 0 | 30 |
| Total | 100 | 100 | 200 |
A test for independence is performed at the significance level.
(a) State the null hypothesis, , for this test.
(b) Show that the expected frequency for a patient having blood type O and testing positive for the antibody is 15.
(c) Calculate the test statistic for this data.
(d) State the number of degrees of freedom for this test.
(e) The critical value for this test is 7.815. State the conclusion of the test, giving a reason for your answer.
[8]A retailer claims that 20% of customers choose brand A, 30% brand B, 25% brand C, and 25% brand D. In a random sample of 200 purchases, the counts are 38, 64, 50, and 48 respectively. Test the claim at the 5% significance level.
[7]Chi-square goodness-of-fit test
A snack company claims that customers choose its three chip flavors—salted, barbecue, and sour cream & onion—with equal probability. In a survey of customers, the observed counts are salted, barbecue, and sour cream & onion. Test at the significance level whether the flavor choices are equally likely.
[7]