- IB
- SL 4.4—Pearsons, scatter diagrams, eqn of y on x
Practice SL 4.4—Pearsons, scatter diagrams, eqn of y on x with authentic IB Mathematics Analysis and Approaches (AA) exam questions for both SL and HL students. This question bank mirrors Paper 1, 2, 3 structure, covering key topics like functions and equations, calculus, complex numbers, sequences and series, and probability and statistics. Get instant solutions, detailed explanations, and build exam confidence with questions in the style of IB examiners.
A librarian records the number of books borrowed, , and the number of library visits, , by eight members over a month. The data are shown below.
| Books borrowed | Library visits |
|---|---|
| 2 | 1 |
| 4 | 2 |
| 6 | 3 |
| 8 | 4 |
| 10 | 5 |
| 12 | 6 |
| 14 | 7 |
| 16 | 8 |
Find Pearson's product-moment correlation coefficient, , and interpret its value in context.
Find the equation of the regression line on .
Estimate the number of library visits for a member who borrows 9 books.
Draw a scatter diagram of the data with the regression line.
A researcher studies the relationship between the number of hours, , spent studying per week and the average test score, , out of 100, for eight randomly selected students. The data are shown in the following table.
| Hours studying | Test score |
|---|---|
| 2 | 55 |
| 4 | 60 |
| 6 | 65 |
| 8 | 70 |
| 10 | 75 |
| 12 | 80 |
| 14 | 85 |
| 16 | 90 |
The relationship is modeled by the regression equation .
Write down the value of and of .
Use the regression equation to estimate the test score for a student who studies for 9 hours per week.
Draw a scatter diagram of the data, including the regression line.
A dataset records study-hours and test scores for eight students:
(a) Using technology, find (i) ; (ii) the regression line ; (iii) the regression line . [5]
Using technology, find (i) ; (ii) the regression line ; (iii) the regression line .
Predict at and find the residual for .
Using on , estimate for ; then invert to estimate when . Explain the difference.
Compute and interpret; comment on extrapolating to .
A biologist studies the relationship between the length of a fish, (in cm), and its weight, (in grams), for a sample of 10 fish. The data are shown below.
| Length | Weight |
|---|---|
| 20 | 150 |
| 25 | 200 |
| 30 | 270 |
| 35 | 350 |
| 40 | 450 |
| 45 | 570 |
| 50 | 700 |
| 55 | 850 |
| 60 | 1000 |
| 65 | 1200 |
It is assumed that follow a bivariate normal distribution with product moment correlation coefficient . The relationship can be modeled by the regression line .
(i) State suitable hypotheses and to test for a positive linear association between length and weight, using a one-tailed test.
(ii) Calculate Pearson's product-moment correlation coefficient, , and the corresponding -value at the significance level. Interpret the result in context.
Find the equation of the regression line on .
The regression line of on is given by . Find the coordinates of the point where the two regression lines intersect, and interpret this point in context.
Estimate the weight of a fish with a length of 70 cm , and explain why this estimate may be unreliable.
Draw a scatter diagram of the data with both regression lines and their intersection point.
A gardener records the amount of water, , in liters, given to a plant each week and the height of the plant, , in centimeters, after 10 weeks. The data for seven plants are shown below.
| Water | Height |
|---|---|
| 1 | 20 |
| 2 | 25 |
| 3 | 28 |
| 4 | 30 |
| 5 | 35 |
| 6 | 38 |
| 7 | 40 |
It is assumed that ( ) follow a bivariate normal distribution with product moment correlation coefficient .
(i) State suitable hypotheses and to test whether there is a correlation between the amount of water and plant height, using a two-tailed test.
(ii) Calculate Pearson's product-moment correlation coefficient, , for these data.
(iii) Using a significance level, state your conclusion in the context of the gardener's study.
(b) Comment on whether the regression line of on should be used to predict the height of a plant receiving 8 liters of water.
A nutrition researcher investigates the relationship between the amount of protein (in grams) in a breakfast meal and the time (in minutes) after which a person begins to feel hungry again. The following data were obtained from eight participants.
| Protein (g) | 10 | 14 | 18 | 21 | 25 | 28 | 32 | 36 |
|---|---|---|---|---|---|---|---|---|
| Hunger time (min) | 60 | 75 | 82 | 90 | 110 | 115 | 124 | 130 |
Using technology, find: (i) the mean and standard deviation of and ; (ii) the value of the Pearson product–moment correlation coefficient .
Find the equation of the regression line of on in the form .
A new breakfast bar contains 30 g of protein. Estimate, using your regression model, how long it will take for an average person to feel hungry again.
The researcher claims that hunger time increases by about 2.5 minutes for each additional gram of protein. Test this claim against your model and comment on whether it is supported.
Calculate the coefficient of determination, and interpret its meaning in context.
A teacher records the number of pages read, , and the time taken, , in minutes, for six students completing a reading task. The data are shown below.
| Pages read | Time taken |
|---|---|
| 10 | 15 |
| 15 | 22 |
| 20 | 28 |
| 25 | 34 |
| 30 | 40 |
| 35 | 45 |
Calculate Pearson's product-moment correlation coefficient, , and interpret its value in context.
Find the equation of the regression line on .
Estimate the time taken to read 18 pages.
Interpret the slope of the regression line in context.
A farmer records the number of seeds planted, (in thousands), and the crop yield, (in kg), for 10 fields. The data for and are shown below.
| 2.30 | 4.61 |
| 2.71 | 5.01 |
| 3.00 | 5.30 |
| 3.22 | 5.52 |
| 3.40 | 5.70 |
| 3.50 | 5.80 |
| 3.69 | 5.99 |
| 3.91 | 6.21 |
| 4.09 | 6.39 |
| 4.20 | 6.50 |
The relationship between and can be modeled by the regression equation . The relationship between and can be modeled as .
Find the equation of the regression line on .
Use the regression equation to estimate the crop yield when 15,000 seeds are planted.
Find the values of and in the model .
Calculate Pearson's product-moment correlation coefficient for and , and interpret its value in context.
If the farmer increases the number of seeds by in a field with 20,000 seeds, estimate the expected percentage increase in crop yield.
A store manager records the daily advertising budget, , in dollars, and the number of customers, , visiting the store over seven days. The data are shown below.
| Advertising budget | Customers |
|---|---|
| 50 | 20 |
| 100 | 25 |
| 150 | 30 |
| 200 | 35 |
| 250 | 40 |
| 300 | 45 |
| 350 | 50 |
Find the equation of the regression line on .
Write down the mean values and .
Draw a scatter diagram of the data, including the regression line and the point .
Estimate the number of customers if the advertising budget is 400 dollars, and explain why this estimate may not be reliable.
A scientist studies the effect of temperature, , in degrees Celsius, on the reaction time, , in seconds, of a chemical process. The data for six experiments are shown below.
| Temperature | Reaction time |
|---|---|
| 10 | 8.0 |
| 15 | 7.5 |
| 20 | 6.8 |
| 25 | 6.2 |
| 30 | 5.5 |
| 35 | 5.0 |
Calculate Pearson's product-moment correlation coefficient, .
Find the equation of the regression line on .
Estimate the reaction time at .
State one reason why the regression line may not be suitable for predicting the reaction time at .
Practice SL 4.4—Pearsons, scatter diagrams, eqn of y on x with authentic IB Mathematics Analysis and Approaches (AA) exam questions for both SL and HL students. This question bank mirrors Paper 1, 2, 3 structure, covering key topics like functions and equations, calculus, complex numbers, sequences and series, and probability and statistics. Get instant solutions, detailed explanations, and build exam confidence with questions in the style of IB examiners.
A librarian records the number of books borrowed, , and the number of library visits, , by eight members over a month. The data are shown below.
| Books borrowed | Library visits |
|---|---|
| 2 | 1 |
| 4 | 2 |
| 6 | 3 |
| 8 | 4 |
| 10 | 5 |
| 12 | 6 |
| 14 | 7 |
| 16 | 8 |
Find Pearson's product-moment correlation coefficient, , and interpret its value in context.
Find the equation of the regression line on .
Estimate the number of library visits for a member who borrows 9 books.
Draw a scatter diagram of the data with the regression line.
A researcher studies the relationship between the number of hours, , spent studying per week and the average test score, , out of 100, for eight randomly selected students. The data are shown in the following table.
| Hours studying | Test score |
|---|---|
| 2 | 55 |
| 4 | 60 |
| 6 | 65 |
| 8 | 70 |
| 10 | 75 |
| 12 | 80 |
| 14 | 85 |
| 16 | 90 |
The relationship is modeled by the regression equation .
Write down the value of and of .
Use the regression equation to estimate the test score for a student who studies for 9 hours per week.
Draw a scatter diagram of the data, including the regression line.
A dataset records study-hours and test scores for eight students:
(a) Using technology, find (i) ; (ii) the regression line ; (iii) the regression line . [5]
Using technology, find (i) ; (ii) the regression line ; (iii) the regression line .
Predict at and find the residual for .
Using on , estimate for ; then invert to estimate when . Explain the difference.
Compute and interpret; comment on extrapolating to .
A biologist studies the relationship between the length of a fish, (in cm), and its weight, (in grams), for a sample of 10 fish. The data are shown below.
| Length | Weight |
|---|---|
| 20 | 150 |
| 25 | 200 |
| 30 | 270 |
| 35 | 350 |
| 40 | 450 |
| 45 | 570 |
| 50 | 700 |
| 55 | 850 |
| 60 | 1000 |
| 65 | 1200 |
It is assumed that follow a bivariate normal distribution with product moment correlation coefficient . The relationship can be modeled by the regression line .
(i) State suitable hypotheses and to test for a positive linear association between length and weight, using a one-tailed test.
(ii) Calculate Pearson's product-moment correlation coefficient, , and the corresponding -value at the significance level. Interpret the result in context.
Find the equation of the regression line on .
The regression line of on is given by . Find the coordinates of the point where the two regression lines intersect, and interpret this point in context.
Estimate the weight of a fish with a length of 70 cm , and explain why this estimate may be unreliable.
Draw a scatter diagram of the data with both regression lines and their intersection point.
A gardener records the amount of water, , in liters, given to a plant each week and the height of the plant, , in centimeters, after 10 weeks. The data for seven plants are shown below.
| Water | Height |
|---|---|
| 1 | 20 |
| 2 | 25 |
| 3 | 28 |
| 4 | 30 |
| 5 | 35 |
| 6 | 38 |
| 7 | 40 |
It is assumed that ( ) follow a bivariate normal distribution with product moment correlation coefficient .
(i) State suitable hypotheses and to test whether there is a correlation between the amount of water and plant height, using a two-tailed test.
(ii) Calculate Pearson's product-moment correlation coefficient, , for these data.
(iii) Using a significance level, state your conclusion in the context of the gardener's study.
(b) Comment on whether the regression line of on should be used to predict the height of a plant receiving 8 liters of water.
A nutrition researcher investigates the relationship between the amount of protein (in grams) in a breakfast meal and the time (in minutes) after which a person begins to feel hungry again. The following data were obtained from eight participants.
| Protein (g) | 10 | 14 | 18 | 21 | 25 | 28 | 32 | 36 |
|---|---|---|---|---|---|---|---|---|
| Hunger time (min) | 60 | 75 | 82 | 90 | 110 | 115 | 124 | 130 |
Using technology, find: (i) the mean and standard deviation of and ; (ii) the value of the Pearson product–moment correlation coefficient .
Find the equation of the regression line of on in the form .
A new breakfast bar contains 30 g of protein. Estimate, using your regression model, how long it will take for an average person to feel hungry again.
The researcher claims that hunger time increases by about 2.5 minutes for each additional gram of protein. Test this claim against your model and comment on whether it is supported.
Calculate the coefficient of determination, and interpret its meaning in context.
A teacher records the number of pages read, , and the time taken, , in minutes, for six students completing a reading task. The data are shown below.
| Pages read | Time taken |
|---|---|
| 10 | 15 |
| 15 | 22 |
| 20 | 28 |
| 25 | 34 |
| 30 | 40 |
| 35 | 45 |
Calculate Pearson's product-moment correlation coefficient, , and interpret its value in context.
Find the equation of the regression line on .
Estimate the time taken to read 18 pages.
Interpret the slope of the regression line in context.
A farmer records the number of seeds planted, (in thousands), and the crop yield, (in kg), for 10 fields. The data for and are shown below.
| 2.30 | 4.61 |
| 2.71 | 5.01 |
| 3.00 | 5.30 |
| 3.22 | 5.52 |
| 3.40 | 5.70 |
| 3.50 | 5.80 |
| 3.69 | 5.99 |
| 3.91 | 6.21 |
| 4.09 | 6.39 |
| 4.20 | 6.50 |
The relationship between and can be modeled by the regression equation . The relationship between and can be modeled as .
Find the equation of the regression line on .
Use the regression equation to estimate the crop yield when 15,000 seeds are planted.
Find the values of and in the model .
Calculate Pearson's product-moment correlation coefficient for and , and interpret its value in context.
If the farmer increases the number of seeds by in a field with 20,000 seeds, estimate the expected percentage increase in crop yield.
A store manager records the daily advertising budget, , in dollars, and the number of customers, , visiting the store over seven days. The data are shown below.
| Advertising budget | Customers |
|---|---|
| 50 | 20 |
| 100 | 25 |
| 150 | 30 |
| 200 | 35 |
| 250 | 40 |
| 300 | 45 |
| 350 | 50 |
Find the equation of the regression line on .
Write down the mean values and .
Draw a scatter diagram of the data, including the regression line and the point .
Estimate the number of customers if the advertising budget is 400 dollars, and explain why this estimate may not be reliable.
A scientist studies the effect of temperature, , in degrees Celsius, on the reaction time, , in seconds, of a chemical process. The data for six experiments are shown below.
| Temperature | Reaction time |
|---|---|
| 10 | 8.0 |
| 15 | 7.5 |
| 20 | 6.8 |
| 25 | 6.2 |
| 30 | 5.5 |
| 35 | 5.0 |
Calculate Pearson's product-moment correlation coefficient, .
Find the equation of the regression line on .
Estimate the reaction time at .
State one reason why the regression line may not be suitable for predicting the reaction time at .