user6903745 user6903745 - 23 days ago 8
Python Question

Retrieve list of training features names from classifier

Is there a way to retrieve the list of feature names used for training of a classifier, once it has been trained with the

fit
method? I would like to get this information before applying to unseen data.
The data used for training is a pandas
DataFrame
and in my case, the classifier is a
RandomForestClassifier
.

Answer

Based on the documentation and previous experience, there is no way to get a list of the features considered at least at one of the splitting.

Is your concern that you do not want to use all your features for prediction, just the ones actually used for training? In this case I suggest to list the feature_importances_ after fitting and eliminate the features that does not seem relevant. Then train a new model with only the relevant features and use those features for prediction as well.