Skip to main content

Bias and variance:

 Lets understand the concept of bias and variance. how we can define bias and variance is a major concern. lets understand with an example we have one datasets with x(independent variable),y(dependent) and y'(predicted) variable.

the difference between y and y' is called bias, so the actual value is 5 and we get 6 in predicted called bias.

bias is the error between the average model and the ground truth model.

the bias is the estimated function of the that tell the capacity of the model to predict the value.

high bias means:

the model is underfitting i.e-high error on both training and test data and model is simple.

variance is the average variability in the model prediction for the given datasets.

high variance means:

model is complex and over fitting model i.e less error on training data and high error on test data. so in this case need to work on noisy input.


upper both picture share the bais and lower is variance. left side has low and right side has high. 
this picture describes the underfitting and overfitting issue.



now increasing bias reduce variance but not always.

error-bias**2+variance and irreducible error.

best model has low variance and bias and need to compromise with bias and variance.

example with data is pending with video.



Comments

Popular posts from this blog

Feature engineering

 we all know machine learning model do not understand text data, so we need to transform the text data to number. i.e we need to transform the raw data to well prepared data to implement the machine learning model for prediction. the process to convert the raw data to make well prepared data is called feature engineering.

Evaluation metrics for Regression

 Welcome to this post, you fall in very good place and interesting part of machine learning implementation. Evaluation- how we feel with this word, definitely we are going to examine something, i .e out machine learning model. feels very responsible state of mind- going to check and correcting our model. when we are discussing about the machine learning, it divided into 3 parts supervised, unsupervised and reinforcement. we are not going to discuss in detail about this you can use other Resorces to learn or our other post. supervised machine learning is of two types regression and classification. we will learn to evaluate these two separately in this post. Evaluation metrics for Regression: when we have made the regression model where our target value is continuous, we use following method to evaluate the model: MAE: (mean absolute error) MSE: (mean square error) RMSE: (root mean square error) R squared error Adjusted R squared. before we discussed about these techniques for evalua...

Decision Tree

\we are discussion about decision tree which falls under supervised learning algorithm and used for classification and regression both. I Will try to cover to each and every topic regarding this algorithm and write python code to implement the same. before falling to discussion, we need understand this popular technique of machine learning. now the definition is: this tree like structure and a kind of predictive modeling approach. it has tree like structure upside down use to represent decision for decision making. this can handle high dimensional data with high accuracy. this tree also used to predict house price, car value and categorical data as well. the decisoion tree represent root node,terminal node,decision node and branches. sandhyakrishnana decion tree medium