Categorical value out of bounds error when calling model using Python

Description

Have tried h2o xgboost with the latest two versions (including 3.22.1.5), both give error in predictions “Categorical value out of bounds” (see attached figures for details) mainly because the categorical value when scoring is not seen during training. I have been using 3.20.0.2, which does not have this issue.

I saw a similar issue using AutoML in R language, from here. https://0xdata.atlassian.net/browse/PUBDEV-6266

Hope this can be fixed soon. Thanks!

Assignee

Michal Kurka

Fix versions

Reporter

Yan Gao

Support ticket URL

None

Labels

Affected Spark version

None

Customer Request Type

None

Task progress

None

CustomerVisible

Yes

Components

Affects versions

Priority

Major
Configure