GLM build model : For 100 GB file takes forever ...................

Description

Steps to reproduce :

Build GLM Model with below params :

Binomial
Variable importance enabled
Lamda search enabled
Strong rules enabled
Standardized enabled
Intercept enabled
Max iterations : 100

Rest are all default ...

Data set : 100 GB file 15K rows, 2200 cols

Gets slower and slower.

Assignee

New H2O Bugs

Reporter

Neeraja Madabhushi

Labels

None

CustomerVisible

No

testcase 1

None

testcase 2

None

testcase 3

None

h2ostream link

None

Affected Spark version

None

AffectedContact

None

AffectedCustomers

None

AffectedPilots

None

AffectedOpenSource

None

Support Assessment

None

Customer Request Type

None

Support ticket URL

None

End date

None

Baseline start date

None

Baseline end date

None

Task progress

None

Task mode

None

Components

Priority

Major
Configure