- IB
- Question Type 3: Identifying the appropriate variables for a given contextual information
Person A studies the effect of weekly junk food consumption on average resting heart rate. The dataset includes variables:
= number of candy bars eaten per week = hours of TV watched per week = cholesterol level (mg/dL) = average resting heart rate (bpm)
Identify the explanatory and response variables and justify your choice.
[4]A study is conducted to investigate the factors affecting the resting heart rate of a group of athletes. One of the explanatory variables, , is found to have a correlation coefficient of with the resting heart rate.
For the chosen variable with correlation coefficient , calculate the coefficient of determination and interpret its meaning in this context.
[3]A set of daily sugar intake data (in grams) over 10 days is given below:
Use the -score criterion () to determine whether the value should be considered an outlier. Show all your calculations.
[7]Simple linear regression and correlation coefficients.
Two explanatory variables and have correlation coefficients and with the response variable .
State which variable should be chosen for a simple linear regression model and justify your choice.
[3]Statistical modeling and control of confounding variables using multiple linear regression.
In a study investigating the relationship between weekly junk food servings and resting heart rate, daily exercise minutes are also recorded for each participant.
Identify the potential confounding variable and propose a mathematical method to control for it when modeling resting heart rate.
[5]Two candidate explanatory variables are weekly junk food servings and daily sugar intake (g) . The corresponding average resting heart rates (bpm) for five subjects are given by:
Subject A: ,
Subject B: ,
Subject C: ,
Subject D: ,
Subject E: ,
Compute the Pearson correlation coefficient for each candidate ( for and for ) and decide which variable shows the stronger linear association with .
[6]A study investigated the relationship between sugar intake, (in grams), and heart rate, (in beats per minute). The following summary statistics were obtained from the data:
The Pearson correlation coefficient between sugar intake and heart rate is .
Using the summary statistics provided, find the least-squares regression line predicting heart rate from sugar intake . Give the coefficients and to three significant figures.
[4]A dataset lists ages and corresponding resting heart rates . By excluding individuals with ages outside the range, calculate the mean resting heart rate of the remaining group.
[3]Weekly servings were recorded for week numbers with values . The study focuses only on weeks 10–15. Remove data outside this range and compute the average servings for the relevant weeks.
[3]The weekly junk food servings recorded over six weeks are . Identify the outlier by eye and calculate the mean weekly servings after removing this outlier.
[3]A dataset for a study records the following variables: weekly fast-food meals, daily water intake (), weekly junk food servings, subject age, and resting heart rate (bpm).
To analyze the effect of junk food consumption on resting heart rate, select the two relevant variables and justify why the others should be excluded.
[4]The distribution of weekly junk food servings is positively skewed. Propose a mathematical transformation to approximate normality before modeling and justify your choice.
[4]