Skip to main content

Cross_Validation in machine learning.

 we all are aware about the machine learning steps that we follows. one very important step is to check the performance of the machine learning model in unseen data(not used while training).

This is called cross validation techniques.


to understand this in detail please cover till model selection. so now we are in cross validation step. what we are going to do. we are all aware that we do train test split. what is this, i am not going to teach you in detail here but yes giving you the rough idea about this. we divide the complete datasets into 2 parts. before model selection or u can say model fitment. these two parts are training datasets and testing datasets. in training datasets, we do fit the machine learning model and same we do testing in test datasets to check either machine learning model working fine with datasets or not.

it is not possible to complete the cross validation without video so i have created one video on cross validation that cover all types and parts in cross validation.

here i am discussing about process that we follow in machine learning so that we come to know where to use cross validation and why to use.

1-data gathering.

2-joinig, merging, cleaning etc.

3-EDA and feature engineering.

4-separate data in dependent and independent variables. i.e x and y

5-spliting datasets into training and testing.

6-seleting of machine learning and fit into training datasets.

7-model evaluation according to regression and classification datasets.

8-check accuracy score.( here we use cross validation and check score)

so, what actually we do here, as name suggest cross- means randomly and validation means verify.

lets, understand with examples-we have 100 record datasets for evaluation and got accuracy score is 95.7. 






Comments

Popular posts from this blog

Feature engineering

 we all know machine learning model do not understand text data, so we need to transform the text data to number. i.e we need to transform the raw data to well prepared data to implement the machine learning model for prediction. the process to convert the raw data to make well prepared data is called feature engineering.

Evaluation metrics for Regression

 Welcome to this post, you fall in very good place and interesting part of machine learning implementation. Evaluation- how we feel with this word, definitely we are going to examine something, i .e out machine learning model. feels very responsible state of mind- going to check and correcting our model. when we are discussing about the machine learning, it divided into 3 parts supervised, unsupervised and reinforcement. we are not going to discuss in detail about this you can use other Resorces to learn or our other post. supervised machine learning is of two types regression and classification. we will learn to evaluate these two separately in this post. Evaluation metrics for Regression: when we have made the regression model where our target value is continuous, we use following method to evaluate the model: MAE: (mean absolute error) MSE: (mean square error) RMSE: (root mean square error) R squared error Adjusted R squared. before we discussed about these techniques for evalua...

Decision Tree

\we are discussion about decision tree which falls under supervised learning algorithm and used for classification and regression both. I Will try to cover to each and every topic regarding this algorithm and write python code to implement the same. before falling to discussion, we need understand this popular technique of machine learning. now the definition is: this tree like structure and a kind of predictive modeling approach. it has tree like structure upside down use to represent decision for decision making. this can handle high dimensional data with high accuracy. this tree also used to predict house price, car value and categorical data as well. the decisoion tree represent root node,terminal node,decision node and branches. sandhyakrishnana decion tree medium