Introduction to Classification & Regression Trees (CART)
Decision Trees are commonly used in data mining with the objective of creating a model that predicts the value of a target (or dependent variable) based on the values of several input (or independent variables). In today's post, we discuss the CART decision tree methodology. The CART or Classification & Regression Trees methodology was introduced in 1984 by Leo Breiman , Jerome Friedman , Richard Olshen and Charles Stone as an umbrella term to refer to the following types of decision trees: Classification Trees : where the target variable is categorical and the tree is used to identify the "class" within which a target variable would likely fall into. Regression Trees : where the target variable is continuous and tree is used to predict it's value. The CART algorithm is structured as a sequence of questions, the answers to which determine what the next question, if any should be. The result of these questions is a tree like structure where the