- IB
- AHL 4.12—Data collection, reliability and validity tests
Practice AHL 4.12—Data collection, reliability and validity tests with authentic IB Mathematics Applications & Interpretation (AI) exam questions for both SL and HL students. This question bank mirrors Paper 1, 2, 3 structure, covering key topics like core principles, advanced applications, and practical problem-solving. Get instant solutions, detailed explanations, and build exam confidence with questions in the style of IB examiners.
A marine biologist, Raj, investigates whether water temperature ( ) affects the swimming speed (m/s) of a fish species. He observes 10 fish in controlled tanks and records:
| Fish | Temperature (X) | Speed (Y) |
|---|---|---|
| 1 | 15 | 0.8 |
| 2 | 16 | 0.9 |
| 3 | 18 | 1.0 |
| 4 | 20 | 1.2 |
| 5 | 22 | 1.3 |
| 6 | 24 | 1.5 |
| 7 | 26 | 1.6 |
| 8 | 28 | 1.7 |
| 9 | 30 | 1.8 |
| 10 | 32 | 1.9 |
Raj assumes a linear model and tests if the sample mean temperature aligns with an ocean average of . He also checks if speeds are normally distributed.
Name a test to verify normality of swimming speeds.
Calculate the correlation coefficient, .
Conduct a one-tailed test at the significance level for positive correlation. State hypotheses and conclusion.
(i) Find the linear regression equation. (ii) Predict the speed at .
Test if the sample mean temperature differs from at the significance level. State hypotheses and conclusion.
Suggest one way to improve the validity of Raj's study.

A nutritionist, Clara, investigates whether daily fiber intake (grams) influences energy levels in adults. She surveys 15 adults, recording their fiber intake and energy scores (out of 100) based on a standardized questionnaire. The data is:
| Adult | Fiber Intake (X) | Energy Score (Y) |
|---|---|---|
| 1 | 10 | 60 |
| 2 | 12 | 62 |
| 3 | 15 | 65 |
| 4 | 18 | 68 |
| 5 | 20 | 70 |
| 6 | 22 | 72 |
| 7 | 25 | 75 |
| 8 | 28 | 78 |
| 9 | 30 | 80 |
| 10 | 32 | 82 |
| 11 | 35 | 85 |
| 12 | 38 | 88 |
| 13 | 40 | 90 |
| 14 | 42 | 92 |
| 15 | 45 | 95 |
Clara assumes a linear model and tests if the sample mean fiber intake aligns with a population mean of 25 grams. She also checks for normality of energy scores.
Name a test to verify if energy scores are normally distributed.
Calculate Pearson's correlation coefficient, .
Conduct a one-tailed test at the significance level to determine if fiber intake positively correlates with energy scores. State hypotheses and conclusion.
(i) Find the linear regression coefficients and . (ii) Predict the energy score for a fiber intake of 27 grams.
Test if the sample mean fiber intake differs from 25 grams at the significance level. State the test, hypotheses, and conclusion.
Suggest one method to improve the reliability of Clara's study.
A farmer claims defective apples in crates of 10 follow . She inspects 100 crates:
| Defective Apples | Crates |
|---|---|
| 0 | 20 |
| 1 | 35 |
| 2 | 25 |
| 3 | 15 |
| 4 | 5 |
Estimate .
Test at significance if the data follows . State hypotheses, expected frequencies, and conclusion.
A six-sided dice is rolled 60 times, with the following results.
| Outcome | Frequency |
|---|---|
| 1 | 12 |
| 2 | 15 |
| 3 | 13 |
| 4 | 7 |
| 5 | 6 |
| 6 | 7 |
Is there evidence, at the 5% significance level, that the dice is not fair?
State the null and alternative hypotheses for the chi-squared test.
Calculate the expected frequency for each outcome if the dice is fair.
Calculate the chi-squared statistic and p-value for the observed frequencies.
State the conclusion for the test at the 5% significance level. Give a reason for your answer.
A six-sided die is rolled 39 times, with the following results.
| Outcome | 1 | 2 | 3 | 4 | 5 | 6 |
|---|---|---|---|---|---|---|
| Frequency | 8 | 3 | 7 | 9 | 5 | 7 |
Is there evidence, at the 10% significance level, that the die is not fair?
A factory, producing plastic gifts for a fast food restaurant’s Jolly meals, claims that just $1\%$ of the toys produced are faulty.
A restaurant manager wants to test this claim. A box of $200$ toys is delivered to the restaurant. The manager checks all the toys in this box and four toys are found to be faulty.
The restaurant manager performs a one-tailed hypothesis test, at the $10\%$ significance level, to determine whether the factory’s claim is reasonable. It is known that faults in the toys occur independently.
Identify the type of sampling used by the restaurant manager.
Write down the null and alternative hypotheses.
Find the $p$-value for the test.
State the conclusion of the test. Give a reason for your answer.
A researcher, Omar, studies whether the number of hours spent on meditation per week affects anxiety levels in employees. He collects data from 10 employees:
| Employee | Meditation Hours () | Anxiety Score () |
|---|---|---|
| 1 | 1 | 80 |
| 2 | 2 | 75 |
| 3 | 3 | 70 |
| 4 | 4 | 65 |
| 5 | 5 | 60 |
| 6 | 6 | 55 |
| 7 | 7 | 50 |
| 8 | 8 | 45 |
| 9 | 9 | 40 |
| 10 | 10 | 35 |
Omar assumes a linear model and tests if the sample mean meditation hours align with a company average of 5 hours.
Define what is meant by a criterion-related validity test in this context.
Calculate the Pearson's product-moment correlation coefficient, .
Justify whether meditation hours are a valid predictor of anxiety levels.
(i) Find the linear regression equation of on .
(ii) Interpret the value of in the context of the study.
Test if the sample mean meditation hours differ from the company average of 5 hours at the significance level. State the null and alternative hypotheses, the -value, and your conclusion.
| Diet Type | 11–13 | 14–15 | 16–17 | 18–19 |
|---|---|---|---|---|
| Vegetarian | 8 | 15 | 8 | 7 |
| Vegan | 20 | 10 | 8 | 6 |
| Meat | 14 | 20 | 6 | 3 |
State suitable hypotheses for a test for independence.
Explain why the last two columns of the table need to be combined.
Conduct a test for independence, using a 5% significance level. State your conclusion in context.
A dairy farmer, Sofia, believes the number of defective milk bottles in a batch of 10 follows a binomial distribution . She inspects 80 batches and records the results in the following table:
| Number of Defective Bottles | Number of Batches |
|---|---|
| 0 | 25 |
| 1 | 30 |
| 2 | 20 |
| 3 or more | 5 |
Calculate the mean number of defective bottles per batch and estimate the value of .
Conduct a chi-squared goodness-of-fit test at the significance level to determine if the binomial distribution is an appropriate model. State the hypotheses, expected frequencies, and your conclusion.
Explain why combining categories may be necessary when performing a chi-squared goodness-of-fit test.
Without performing further calculations, describe the effect on the degrees of freedom if the value of had been specified by the farmer beforehand rather than estimated from the data.
The standard deviation, , of a sample size 15 is 13.8. Find .