Number and Algebra
Functions
Geometry and Trigonometry
Statistics and Probability
Calculus
Calculate the Spearman rank correlation coefficient rsr_srs for the dataset including the outlier:
(1,3),(3,5),(4,7),(6,10),(7,12),(9,200).(1,3), (3,5), (4,7), (6,10), (7,12), (9,200).(1,3),(3,5),(4,7),(6,10),(7,12),(9,200).
Calculate the Spearman rank correlation coefficient rsr_srs for the dataset after removing the outlier:
(1,3),(3,5),(4,7),(6,10),(7,12).(1,3), (3,5), (4,7), (6,10), (7,12).(1,3),(3,5),(4,7),(6,10),(7,12).
Calculate the Pearson correlation coefficient rrr for the dataset after removing the outlier, i.e.
For the dataset without the outlier (1,3),(3,5),(4,7),(6,10),(7,12)(1,3), (3,5), (4,7), (6,10), (7,12)(1,3),(3,5),(4,7),(6,10),(7,12) calculate both the Pearson correlation rrr and the Spearman correlation rsr_srs. Compare and comment on the two values.
Calculate the Pearson correlation coefficient rrr for the dataset including the extreme outlier:
Calculate the percentage change in the Pearson correlation coefficient when the outlier (9,200)(9,200)(9,200) is removed, using rwith≈0.706r_{\text{with}}\approx0.706rwith≈0.706 and rwithout≈0.994r_{\text{without}}\approx0.994rwithout≈0.994.
Calculate and compare the Spearman correlation coefficient before and after removing the outlier. Quantify the change and discuss the impact of the outlier on rank-based association.
Calculate and compare the Pearson correlation coefficient before and after removing the outlier. Quantify the change and discuss the impact of the outlier on linear association.
For the dataset including the outlier (1,3),(3,5),(4,7),(6,10),(7,12),(9,200)(1,3), (3,5), (4,7), (6,10), (7,12), (9,200)(1,3),(3,5),(4,7),(6,10),(7,12),(9,200) calculate both the Pearson correlation rrr and the Spearman correlation rsr_srs. Compare and comment on the two values.
Explain why the Spearman rank correlation coefficient is less sensitive to the extreme outlier than the Pearson correlation coefficient, using mathematical reasoning based on ranks vs. aw values.
Compare the Pearson correlation coefficients for the three datasets:
(a) with outlier (9,200)(9,200)(9,200), (b) with reduced outlier (9,20)(9,20)(9,20), (c) without any outlier.
Comment on how each change in the outlier affects rrr.
For the modified dataset where the outlier’s yyy-value is reduced to 20, i.e.
(1,3),(3,5),(4,7),(6,10),(7,12),(9,20),(1,3), (3,5), (4,7), (6,10), (7,12), (9,20),(1,3),(3,5),(4,7),(6,10),(7,12),(9,20),
calculate the Pearson correlation coefficient rrr.
Previous
Question Type 3: Using parameter or algebraic forms of data when calculating with rank tables
Next
No next topic