GLM doesn't correctly impute missing values on sparse datasets

Description

GLM scoring can produce incorrect results for sparse data (NA-sparse compressed chunks).

https://stackoverflow.com/questions/47404817/glm-model-h2o-predict-gives-very-different-results-depending-on-number-of-rows

Assignee

Michal Kurka

Fix versions

Reporter

Michal Kurka

Support ticket URL

None

Labels

None

Affected Spark version

None

Customer Request Type

None

Task progress

None

CustomerVisible

No

Components

Priority

Blocker
Configure