We're using cookies, but you can turn them off in your browser settings. Otherwise, you are agreeing to our use of cookies. Learn more in our Privacy Policy

Evaluating Regression Model Fit and Interpreting Model Results

2024 Curriculum CFA Program Level II Quantitative Methods

Learning Outcomes

The member should be able to:
  • evaluate how well a multiple regression model explains the dependent variable by analyzing ANOVA table results and measures of goodness of fit
  • formulate hypotheses on the significance of two or more coefficients in a multiple regression model and interpret the results of the joint hypothesis tests
  • calculate and interpret a predicted value for the dependent variable, given the estimated regression model and assumed values for the independent variable

Summary

  • In multiple regression, adjusted R2 is used as a measure of model goodness of fit since it does not automatically increase as independent variables are added to the model. Rather, it adjusts for the degrees of freedom by incorporating the number of independent variables. 
  • Adjusted R2 will increase (decrease) if a variable is added to the model that has a coefficient with an absolute value of its t-statistic greater (less) than 1.0. 
  • Akaike’s information criterion (AIC) and Schwarz’s Bayesian information criteria (BIC) are also used to evaluate model fit and select the “best” model among a group with the same dependent variable. AIC is preferred if the purpose is prediction, BIC is preferred if goodness of fit is the goal, and lower values of both measures are better. 
  • Hypothesis tests of a single coefficient in a multiple regression, using t-tests, are identical to those in simple regression.
  • The joint F-test is used to jointly test a subset of variables in a multiple regression, where the “restricted” model is based on a narrower set of independent variables nested in the broader “unrestricted” model. The null hypothesis is that the slope coefficients of all independent variables outside the restricted model are zero. 
  • The general linear F-test is an extension of the joint F-test, where the null hypothesis is that the slope coefficients on all independent variables in the unrestricted model are equal to zero. 
  • Predicting the value of the dependent variable using an estimated multiple regression model is similar to that in simple regression. First, sum, for each independent variable, the estimated slope coefficient multiplied by the assumed value of that variable, and then add the estimated intercept coefficient. 
  • In multiple regression, the confidence interval around the forecasted value of the dependent variable reflects both model error and sampling error (from forecasting the independent variables); the larger the sampling error, the larger is the standard error of the forecast of Y and the wider is the confidence interval.
 

0.75 PL Credit

If you are a CFA Institute member don’t forget to record Professional Learning (PL) credit from reading this article.