Have you ever wondered how machine learning models are constructed? ‘Explainability of machine learning models’ and ‘machine learning models being a black-box’ is one of the most debated topics in model transparency. Today we will explore this and learn some quick techniques on how to find out which variables are influencing the model results and by how much.
I have generated a synthetic dataset with some variables that depict the statistics of football matches like number of goal scored, number of passes, ball possession %, Number of red or yellow cards etc. Using the dataset we will explore the following models:
- Decision Tree model
- Random Forest model
This will be the agenda for today:
- Train the decision tree model
- Train the random forest model
- Explore the influential variables in the models
- Find the threshold of the influential variables
So without further ado let’s get started..