# statistic case

## statistic case

(2) Regression Analysis
1. According to the graphs that we built for the variables, we found the distribution of the variables (opening, budget) to be abnormal. The opening distribution has two outliers, and is right skewed. And the budget distribution is right skewed as well. These two variables might have contributed to the outliers and the right-skewed distribution of the response variable- US Revenue. Yes, this should worry us for the outliers will pull the regression line towards itself, distorting the regression result.

2.
Correlation matrix:.
According to the pair plot and correlation matrix we built for the variables, US Revenue seems linearly dependent on Budget and Opening and not so much on Opinion. However, transformation between US Revenue and Theaters might be needed. Nothing unusual stands out.

3.
Multiple linear regression results:
Dependent Variable: US Revenue
Independent Variable(s): Budget, Opening, Theaters, Opinion
Parameter estimates:

Analysis of variance table for multiple regression model: Summary of fit:
Root MSE: 15.69288
R-squared: 0.9807