A Study on Influence of Predictor Multicollinearity on Performance of the Stepwise Regression Prediction Equation

PDF

  • The prediction accuracy of the traditional stepwise regression prediction equation (SRPE) is affected by the multicollinearity among its predictors. This paper introduces the condition number analysis into the prediction modeling to minimize the multicollinearity in the SRPE. In the condition number prediction modeling, the condition number is used to select the combination of predictors with the lowest multicollinearity from the possible combinations of a number of candidate predictors (variables), and the selected combina- tion is then used to construct the condition number regression prediction equation (CNRPE). This novel prediction modeling is performed in typhoon track prediction, which is a difficult task among meteorological disaster predictions. Six pairs of typhoon track latitude/longitude SRPEs and CNRPEs for July, August, and September are built by employing the traditional and the novel prediction modeling approaches, respectively, and by using a large number of identical modeling samples. The comparative analysis indicates that under the condition of the same candidate predictors (variables) and predictands (dependent variables),although the fitting accuracy of the novel prediction models used for the historical samples of South China Sea (SCS) typhoon tracks is slightly lower than that of the traditional prediction models, the prediction accuracy for the independent samples is obviously improved, with the averaged prediction error of the novel models for July, August, and September being 153.9 km, which is 75.3 km smaller than that of the traditional models (a reduction of 33%). This is because the novel prediction modeling effectively minimizes the multicollinearity by computation and analysis of the condition number. It is shown further that when F =1.0, 2.0, and 3.0, the average prediction errors of the traditional SRPEs are obviously larger than those of the CNRPEs. Moreover, extremely large and unreasonable prediction errors occur at some individual points of the typhoon track predicted by the SRPEs due to the multicollinearity existing in the combination of predictors.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return