Scoring on Test/Validation dataset requires offset column

Description

Motivation

For H2O models trained with an offset column it is necessary to provide offset column for scoring as well otherwise H2O throws an error. Even though in most cases omission of the offset column is likely a mistake there are use cases when user doesn't have the information at scoring time and still wants to make a prediction and post-process the prediction later.

Solution

If offset column is not available at scoring time, user needs to explicitly create a zero offset column. This will make the algorithms behave as if the offset was not specified.

Example:

References

Assignee

Michal Kurka

Reporter

Michal Kurka

Labels

None

Affected Spark version

None

AffectedContact

None

AffectedCustomers

None

AffectedPilots

None

AffectedOpenSource

None

Support Assessment

None

Customer Request Type

None

Support ticket URL

None

End date

None

Baseline start date

None

Baseline end date

None

Task progress

None

Task mode

None

Priority

Major
Configure