- IB
- SL 4.4—Pearsons, scatter diagrams, eqn of y on x
Practice SL 4.4—Pearsons, scatter diagrams, eqn of y on x with authentic IB Mathematics Analysis and Approaches (AA) exam questions for both SL and HL students. This question bank mirrors Paper 1, 2, 3 structure, covering key topics like functions and equations, calculus, complex numbers, sequences and series, and probability and statistics. Get instant solutions, detailed explanations, and build exam confidence with questions in the style of IB examiners.
A librarian records the number of books borrowed, , and the number of library visits, , by eight members over a month. The data are shown below.
| Books borrowed | Library visits |
|---|---|
| 2 | 1 |
| 4 | 2 |
| 6 | 3 |
| 8 | 4 |
| 10 | 5 |
| 12 | 6 |
| 14 | 7 |
| 16 | 8 |
Find Pearson's product-moment correlation coefficient, , and interpret its value in context.
Find the equation of the regression line on .
Estimate the number of library visits for a member who borrows 9 books.
Draw a scatter diagram of the data with the regression line.
A researcher studies the relationship between the number of hours, , spent studying per week and the average test score, , out of 100, for eight randomly selected students. The data are shown in the following table.
| Hours studying | Test score |
|---|---|
| 2 | 55 |
| 4 | 60 |
| 6 | 65 |
| 8 | 70 |
| 10 | 75 |
| 12 | 80 |
| 14 | 85 |
| 16 | 90 |
The relationship is modeled by the regression equation .
Write down the value of and .
Use the regression equation to estimate the test score for a student who studies for 9 hours per week.
Draw a scatter diagram of the data, including the regression line.
A teacher investigates the relationship between the number of practice questions, , completed by students and their score, , out of 50 , on a quiz. The data for nine students are shown below.
| Practice questions | Quiz score |
|---|---|
| 5 | 20 |
| 10 | 25 |
| 15 | 30 |
| 20 | 35 |
| 25 | 38 |
| 30 | 40 |
| 35 | 42 |
| 40 | 45 |
| 45 | 48 |
Calculate Pearson's product-moment correlation coefficient, r.
Find the equation of the regression line on .
Estimate the quiz score for a student who completes 22 practice questions.
State one limitation of using this regression line to predict the quiz score for a student who completes 60 practice questions.
A teacher records the number of pages read, , and the time taken, , in minutes, for six students completing a reading task. The data are shown below.
| Pages read | Time taken |
|---|---|
| 10 | 15 |
| 15 | 22 |
| 20 | 28 |
| 25 | 34 |
| 30 | 40 |
| 35 | 45 |
Calculate Pearson's product-moment correlation coefficient, , and interpret its value in context.
Find the equation of the regression line on .
Estimate the time taken to read 18 pages.
Interpret the slope of the regression line in context.
A farmer records the number of seeds planted, (in thousands), and the crop yield, (in kg ), for 10 fields. The data for and are shown below.
| 2.30 | 4.61 |
| 2.71 | 5.01 |
| 3.00 | 5.30 |
| 3.22 | 5.52 |
| 3.40 | 5.70 |
| 3.50 | 5.80 |
| 3.69 | 5.99 |
| 3.91 | 6.21 |
| 4.09 | 6.39 |
| 4.20 | 6.50 |
The relationship between and can be modeled by the regression equation . The relationship between and can be modeled as .
Find the equation of the regression line on .
Use the regression equation to estimate the crop yield when 15,000 seeds are planted.
Find the values of and in the model .
Calculate Pearson's product-moment correlation coefficient for and , and interpret its value in context.
If the farmer increases the number of seeds by in a field with 20,000 seeds, estimate the expected percentage increase in crop yield.
A store manager records the daily advertising budget, , in dollars, and the number of customers, , visiting the store over seven days. The data are shown below.
| Advertising budget | Customers |
|---|---|
| 50 | 20 |
| 100 | 25 |
| 150 | 30 |
| 200 | 35 |
| 250 | 40 |
| 300 | 45 |
| 350 | 50 |
Find the equation of the regression line on .
Write down the mean values and .
Draw a scatter diagram of the data, including the regression line and the point
Estimate the number of customers if the advertising budget is 400 dollars, and explain why this estimate may not be reliable.
A scientist studies the effect of temperature, , in degrees Celsius, on the reaction time, , in seconds, of a chemical process. The data for six experiments are shown below.
| Temperature | Reaction time |
|---|---|
| 10 | 8.0 |
| 15 | 7.5 |
| 20 | 6.8 |
| 25 | 6.2 |
| 30 | 5.5 |
| 35 | 5.0 |
Calculate Pearson's product-moment correlation coefficient, r.
Find the equation of the regression line on .
Estimate the reaction time at .
State one reason why the regression line may not be suitable for predicting the reaction time at .
A fitness coach records the number of hours, , spent training per week and the maximum heart rate, , in beats per minute (bpm), during a workout for seven athletes. The data are shown below.
| Training hours | Heart rate |
|---|---|
| 3 | 140 |
| 5 | 145 |
| 7 | 150 |
| 9 | 152 |
| 11 | 155 |
| 13 | 158 |
| 15 | 160 |
It is assumed that follow a bivariate normal distribution with product moment correlation coefficient .
(i) State suitable hypotheses and to test whether there is a correlation between training hours and heart rate, using a two-tailed test.
(ii) Calculate Pearson's product-moment correlation coefficient, , and interpret its value in context.
(iii) Using a significance level, state your conclusion in the context of the coach's study.
The regression line of on is given by . Estimate the heart rate for an athlete training 10 hours per week.
A researcher measures the daily sunlight hours, , and the growth rate of a plant, , in millimeters per day, over eight days. The data are shown below.
| Sunlight hours | Growth rate |
|---|---|
| 4 | 2.0 |
| 5 | 2.5 |
| 6 | 3.0 |
| 7 | 3.2 |
| 8 | 3.5 |
| 9 | 3.8 |
| 10 | 4.0 |
| 11 | 4.2 |
The relationship is modeled by the regression line .
Find the equation of the regression line on .
Calculate Pearson's product-moment correlation coefficient, r.
Draw a scatter diagram of the data with the regression line.
Estimate the growth rate for a day with 12 hours of sunlight, and comment on the reliability of this estimate.
A coffee shop owner records the daily temperature, , in degrees Celsius, and the number of iced coffees sold, , over six days. The data are shown below.
| Temperature | Iced coffees sold |
|---|---|
| 20 | 10 |
| 22 | 12 |
| 24 | 15 |
| 26 | 17 |
| 28 | 20 |
| 30 | 22 |
The relationship is modeled by the regression line .
Find the equation of the regression line on .
Estimate the number of iced coffees sold when the temperature is .
Draw a scatter diagram of the data with the regression line.