Support parallel training (e.g. spark_apply in rsparkling, or Python/R)

Description

Feature to fit a model by group in h2o using some type of distributed apply function.

Here’s an example using of what it’d look like using spark_apply:

Current workaround is in spark, loop through categories and pulling back spark dataframes by category, and then fit a model.

Assignee

Michal Kurka

Fix versions

Reporter

Joseph Granados

Support ticket URL

Labels

None

Affected Spark version

None

Customer Request Type

None

Task progress

None

CustomerVisible

Yes

Priority

Major
Configure