Shapley Values
This is another blog post in the series on model explainability. Here I will provide a brief description of Shapley values in the context of explaining outpu...
This is another blog post in the series on model explainability. Here I will provide a brief description of Shapley values in the context of explaining outpu...
According to Google Colab website,
Estimating permutation feature importances and plotting relationships between explanatory variables and model outputs by means of partial dependence plots ar...
Investigating feature importances for a developed model is a very important step in achieving the goal of interpretable machine learning. This not only allow...
The topic of model interpretability has gained a lot of attention recently with the rapid development of highly complex machine learning algorithms for deali...
In this blog post we are going to discuss a mock quantitative interview question by Jane Street. Suppose you are offered to play the following game: at the s...
Consider the following image.
Wikipedia defines Monte Carlo methods as “…a broad class of computational algorithms that rely on repeated random sampling to obtain numerical results.” The ...
I have recently finished watching and working through a series of lectures by David Silver on Reinforcement Learning that I found immensely useful. Throughou...
In this post, we cover the theory behind a discrete-time Kalman filter. Kalman filter is an algorithm that allows us to get a more precise information about ...
According to Wikipedia, gradient descent (ascent) is a first-order iterative optimization algorithm for finding a local minimum (maximum) of a differentiable...
Recently a model vetter has pointed out to a mistake that I committed when developing one of the models. The mistake was not of methodological type, rather i...
According to different sources, it is advisable that the data that is used to build a model be split into 3 datasets: training, validation and test. This is ...
This short blog post relates to addressing a problem of imbalanced datasets. An imbalanced dataset is a dataset where the classes are not approximately equal...
This is an introductory post about genetic algorithms (GAs) , which are a suite of methods of solving optimization problems. GAs form a subset of more genera...
This is the second part for the project about constructing a predictive model for the abalone dataset. In this post, we are going to fit 3 different regressi...
This is the first in a series of posts about constructing a predictive model, based on physical measurements, of age of abalone, where abalone referes to a g...