- IB
- Question Type 5: Categorizing data to ensure a specific expected frequency
The question requires a comparative analysis of two different test groupings, evaluating the trade-offs between granularity (sensitivity) and statistical power.
Compare the two (chi-square) tests in questions 3 and 4 in terms of sensitivity and category selection. Which grouping would you prefer and why?
[5]The following table shows the frequencies of responses for two age categories, and .
Conduct a chi-square test of independence at the level of significance. State the null and alternative hypotheses, the test statistic, the degrees of freedom, and the conclusion of the test.
The critical value for this test is .
[7]In a goodness-of-fit test for a distribution with 4 categories where 2 parameters have been estimated from the data, what is the appropriate degrees of freedom?
[2]The question requires performing a goodness-of-fit test for a Poisson distribution. It involves estimating the mean from the data, calculating expected frequencies, pooling categories with low expected frequencies, determining degrees of freedom, and drawing a conclusion based on a critical value at the 1% significance level.
Data on the number of defects per unit yields the following observed frequencies:
| Number of defects () | 0 | 1 | 2 | 3 | 4 |
|---|---|---|---|---|---|
| Observed frequency () | 50 | 30 | 15 | 4 | 1 |
Assuming a Poisson distribution with the mean estimated from the data, perform a goodness-of-fit test at the 1% significance level. Calculate the expected frequencies, the test statistic, the degrees of freedom, and state your conclusion.
[9]Determine the degrees of freedom for a test of independence between a categorical variable with levels and another with levels.
[3]Statistics and probability: Chi-squared test for independence
Outline the procedure to select appropriate age categories for a chi-square test of independence when given raw age data and a binary preference variable, ensuring that all expected frequencies exceed 5.
[4]A contingency table yields the observed frequencies below:
Carry out the chi-square test of independence at the significance level. Compute the statistic, the degrees of freedom, and state your conclusion.
[6]A set of continuous data is grouped into 5 intervals with observed frequencies . The sample mean and variance are estimated as and . A chi-square goodness-of-fit test is to be conducted at a significance level to determine if the data follows a normal distribution.
Outline the steps required to compute the expected frequencies from the normal model, state the degrees of freedom for this test, and describe the calculation of the test statistic.
[8]The following observed frequencies for a categorical variable with 4 equally likely outcomes were recorded: . Carry out a goodness-of-fit test at the significance level. Compute the test statistic, the number of degrees of freedom, and the conclusion of the test. [6 marks]
[6]Given a detailed contingency table with age in single years and preference counts, describe how you would combine adjacent age classes to achieve the expected frequency condition . What criteria guide your decisions?
[6]When performing a goodness-of-fit test to a Poisson model, you estimated the mean from the data and used 6 observed categories (). State the degrees of freedom for the test.
[2]In a trial of a die rolled 120 times, the observed face counts for the six faces are . Test whether the die is fair at the 1% significance level.
[7]