Performance metrics when using balance_classes #15518
Unanswered
hasithjp
asked this question in
Technical Notes
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Motivation
When balance_classes is used in H2O modeling, the performance metrics will not match the performance metrics constructed during training.
An example is shown below:
Solution
When balance_classes is enabled in H2O modeling, the model is built on a balanced version of the training data frame. The performance metrics constructed during training are based on this balanced version of the training data frame. Therefore, when performance is calculated on the unbalanced training data frame, the metrics will be different.
To determine the performance metrics on the unbalanced training data frame use the following:
h2o.performance(gbm_model, newdata = training_frame)
JIRA Issue Migration Info
Jira Issue: TN-9
Assignee: Megan Kurka
Reporter: Megan Kurka
State: Open
Beta Was this translation helpful? Give feedback.
All reactions