Dataset with less yes (6%) and more No (94%). All classifier algorithms (ANN, C4, CART) in SPSS predicts all values as No in test set. What do i do?
The data has around 2500 rows and 85 columns.
Look into oversampling techniques as for example implemented in R by the
SMOTE function in the
The SMOTE function oversamples your rare event by using bootstrapping and k-nearest neighbor to synthetically create additional observations of that event. The definition of rare event is usually attributed to any outcome/dependent/target/response variable that happens less than 15% of the time.