MAT 243 Project Three Summary Report

[Full Name]

[SNHU Email]

Southern New Hampshire University

Notes:

â€¢ Replace the bracketed text on page one (the cover page) with your personal information.

â€¢ You will use your selected team for all three projects

1. Introduction

Discuss the statement of the problem in terms of the statistical analyses that are being performed. Be

sure to address the following:

â€¢

â€¢

â€¢

What is the data set that you are exploring?

How will your results be used?

What type of analyses will you be running in this project?

Answer the questions in a paragraph response. Remove all questions and this note before

submitting! Do not include Python code in your report.

2. Data Preparation

There are some important variables that are used in this project. Identify and explain these variables.

See the introductory section and Step 1 of the Python script to address the following:

â€¢

â€¢

What does the variable avg_pts_differential represent? How would you explain it to someone

who does not understand the data?

What does the variable avg_elo_n represent? How would you explain it to someone who does

not understand the data?

Answer the questions in a paragraph response. Remove all questions and this note before

submitting! Do not include Python code in your report.

3. Scatterplot and Correlation for the Total Number of Wins and Average Relative Skill

You constructed a scatterplot of the total number of wins and the average relative skill to study their

correlation. You also calculated the Pearson correlation coefficient along with its P-value.

See Step 2 in the Python script to address the following items:

â€¢

â€¢

â€¢

â€¢

In general, how are data visualization techniques used to study relationship trends between two

variables?

How is the correlation coefficient used to get the strength and direction of the association

between two variables?

In this activity, you generated a scatterplot of the total number of wins and the average relative

skill. Include a screenshot of this plot in your report.

What do the scatterplot and the Pearson correlation coefficient tell you about the association

between total number of wins and average relative skill?

â€¢

Is the correlation coefficient statistically significant based on the P-value? Use a 1% level of

significance.

Answer the questions in a paragraph response. Remove all questions and this note before

submitting! Do not include Python code in your report.

4. Simple Linear Regression: Predicting the Total Number of Wins using Average Relative Skill

You created a simple linear regression model for the total number of wins in a regular season using the

average relative skill as the predictor variable.

See Step 3 in the Python script to address the following items:

â€¢

â€¢

â€¢

In general, how is a simple linear regression model used to predict the response variable using

the predictor variable?

What is the equation for your model?

What are the results of the overall F-test? Summarize all important steps of this hypothesis test.

This includes:

a. Null Hypothesis (statistical notation and its description in words)

b. Alternative Hypothesis (statistical notation and its description in words)

c. Level of Significance

d. Report the test statistic and the P-value in a formatted table as shown below:

Table 1: Hypothesis Test for the Overall F-Test

Statistic

Test Statistic

P-value

â€¢

â€¢

â€¢

Value

X.XX

*Round off to 2 decimal places.

X.XXXX

*Round off to 4 decimal places.

e. Conclusion of the hypothesis test and its interpretation based on the P-value

Based on the results of the overall F-test, can average relative skill predict the total number of

wins in the regular season?

What is the predicted total number of wins in a regular season for a team that has an average

relative skill of 1550? Round your answer down to the nearest integer.

What is the predicted number of wins in a regular season for a team that has an average relative

skill of 1450? Round your answer down to the nearest integer.

Answer the questions in a paragraph response. Remove all questions and this note (but not the

table) before submitting! Do not include Python code in your report.

5. Scatterplot and Correlation for the Total Number of Wins and Average Points Scored

You constructed a scatterplot of total number of wins and average points scored. You also calculated the

Pearson correlation coefficient along with its P-value.

See Step 4 in the Python script to answer the following questions:

â€¢

â€¢

â€¢

In this activity, you generated a scatterplot of the total number of wins and average points

scored. Include a screenshot of this plot in your report.

What do the scatterplot and the Pearson correlation coefficient tell you about the association

between total number of wins and average points scored?

Is the correlation coefficient statistically significant based on the P-value? Use a 1% level of

significance.

Answer the questions in a paragraph response. Remove all questions and this note before

submitting! Do not include Python code in your report.

6. Multiple Regression: Predicting the Total Number of Wins using Average Points Scored and Average

Relative Skill

You created a multiple regression model with the total number of wins as the response variable, with

average points scored and average relative skill as predictor variables.

See Step 5 in the Python script to answer the following questions:

â€¢

â€¢

â€¢

In general, how is a multiple linear regression model used to predict the response variable using

predictor variables?

What is the equation for your model?

What are the results of the overall F-test? Summarize all important steps of this hypothesis test.

This includes:

a. Null Hypothesis (statistical notation and its description in words)

b. Alternative Hypothesis (statistical notation and its description in words)

c. Level of Significance

d. Report the test statistic and the P-value in a formatted table as shown below:

Table 2: Hypothesis Test for the Overall F-Test

Statistic

Test Statistic

P-value

â€¢

â€¢

â€¢

Value

X.XX

*Round off to 2 decimal places.

X.XXXX

*Round off to 4 decimal places.

e. Conclusion of the hypothesis test and its interpretation based on the P-value

Based on the results of the overall F-test, is at least one of the predictors statistically significant

in predicting the total number of wins in the season?

What are the results of individual t-tests for the parameters of each predictor variable? Is each

of the predictor variables statistically significant based on its P-value? Use a 1% level of

significance.

Report and interpret the coefficient of determination.

â€¢

â€¢

What is the predicted total number of wins in a regular season for a team that is averaging 75

points per game with a relative skill level of 1350?

What is the predicted total number of wins in a regular season for a team that is averaging 100

points per game with an average relative skill level of 1600?

Answer the questions in a paragraph response. Remove all questions and this note (but not the

table) before submitting! Do not include Python code in your report.

7. Multiple Regression: Predicting the Total Number of Wins using Average Points Scored, Average

Relative Skill, Average Points Differential, and Average Relative Skill Differential

You created a multiple regression model with the total number of wins as the response variable, with

average points scored, average relative skill, average points differential, and average relative skill

differential as predictor variables.

See Step 6 in the Python script to answer the following questions:

â€¢

â€¢

â€¢

In general, how is a multiple linear regression model used to predict the response variable using

predictor variables?

What is the equation for your model?

What are the results of the overall F-test? Summarize all important steps of this hypothesis test.

This includes:

a. Null Hypothesis (statistical notation and its description in words)

b. Alternative Hypothesis (statistical notation and its description in words)

c. Level of Significance

d. Report the test statistic and the P-value in a formatted table as shown below:

Table 3: Hypothesis Test for Overall F-Test

Statistic

Test Statistic

P-value

â€¢

â€¢

â€¢

â€¢

Value

X.XX

*Round off to 2 decimal places.

X.XXXX

*Round off to 4 decimal places.

e. Conclusion of the hypothesis test and its interpretation based on the P-value

Based on the results of the overall F-test, is at least one of the predictors statistically significant

in predicting the number of wins in the season?

What are the results of individual t-tests for the parameters of each predictor variable? Is each

of the predictor variables statistically significant based on its P-value? Use a 1% level of

significance.

Report and interpret the coefficient of determination.

What is the predicted total number of wins in a regular season for a team that is averaging 75

points per game with a relative skill level of 1350, average point differential of -5 and average

relative skill differential of -30?

â€¢

What is the predicted total number of wins in a regular season for a team that is averaging 100

points per game with a relative skill level of 1600, average point differential of +5 and average

relative skill differential of +95?

Answer the questions in a paragraph response. Remove all questions and this note (but not the

table) before submitting! Do not include Python code in your report.

8. Conclusion

Describe the results of the statistical analyses clearly, using proper descriptions of statistical terms and

concepts. Fully describe what these results mean for your scenario.

â€¢

â€¢

Briefly summarize your findings in plain language.

What is the practical importance of the analyses that were performed?

9. Citations

You were not required to use external resources for this report. If you did not use any resources, you

should remove this entire section. However, if you did use any resources to help you with your

interpretation, you must cite them. Use proper APA format for citations.

Insert references here in the following format:

Author’s Last Name, First Initial. Middle Initial. (Year of Publication). Title of book: Subtitle of book,

edition. Place of Publication: Publisher.

Purchase answer to see full

attachment