Software Mining (ML, Testing) Notes Unit 2, 3
Software Mining (ML, Testing) Notes Unit 2, 3
//////////////////////////////////////////////////////////////////////////////////////////////////////////
NOT DONE BY MAM
/////////////////////////////////////////////////////////////////////////////////////////////////////////
MACHINE LEARNING ALGORITHM
5. Root Mean Squared Error
Root of Mean Squared Error (MSE) or root of the mean squared distances between
actual and predicted values.
Higher the RMSE the larger the deviation in actual and predicted value. Lower the
RMSE value the better the model is with its predictions.
Advantages of RMSE:
i) The value of MSE is same as output unit, which makes the interpretation of loss
easy.
Disadvantages of RMSE:
The logarithmic was introduced to interpret the relative difference between actual
and predicted value. To avoid natural log of possible 0 values, add 1 on both actual
and predicted values before taking logarithmic.
Advantages of MSLE:
i) Treats small differences between small actual and predicted values same as big
differences between large actual and predicted values.
Disadvantages of MSLE:
Linear Regression:
Linear Regression is a supervised machine learning algorithm which performs Regression by
plotting a straight line which best fits the data points.
Assumptions:
Assumes a linear relationship between the independent variable 'x', and the dependent variable 'y'
Assumes no correlation between the independent variables 'x' (Multicollinearity)
Assumes residuals have constant variance at every level of x (Homoscedasticity)
Assumes residuals of the model are normally distributed (Normality)
Assumes no pattern is formed when residuals are plotted
Advantages Disadvantages
Chi-square method
Kolmogorov-Smirnov
Shapirov-Wilk
Histogram
Quantile-Quantile Plot
Miu = mean
Assumtion