java.lang.ArrayIndexOutOfBoundsException while training an H2OKMeansEstimator

Description

kmeans Model Build progress: |█████████████████████████████████████ (failed) 85%
---------------------------------------------------------------------------
EnvironmentError Traceback (most recent call last)
<ipython-input-48-42e9d52776a3> in <module>()
1 mjuly16 = H2OKMeansEstimator(model_id='model_niche_july16v7_v2_estimate_k', estimate_k=True, k=8, standardize=True, init="PlusPlus",nfolds=4,max_iterations=20,ignored_columns=ignored_columns)
----> 2 mjuly16.train(x=x, training_frame=data_h2o_july16)
3 predjuly16 = mjuly16.predict(data_h2o_july16)
4 mjuly16

/opt/sparkling-water/2.1.2/py/build/dist/h2o_pysparkling_2.1-2.1.2-py2.7.egg/h2o/estimators/estimator_base.pyc in train(self, x, y, training_frame, offset_column, fold_column, weights_column, validation_frame, max_runtime_secs, ignored_columns, model_id)
202 return
203
--> 204 model.poll()
205 model_json = h2o.api("GET /%d/Models/%s" % (rest_ver, model.dest_key))["models"][0]
206 self._resolve_model(model.dest_key, model_json)

/opt/sparkling-water/2.1.2/py/build/dist/h2o_pysparkling_2.1-2.1.2-py2.7.egg/h2o/job.pyc in poll(self)
71 if (isinstance(self.job, dict)) and ("stacktrace" in list(self.job)):
72 raise EnvironmentError("Job with key {} failed with an exception: {}\nstacktrace: "
---> 73 "\n{}".format(self.job_key, self.exception, self.job["stacktrace"]))
74 else:
75 raise EnvironmentError("Job with key %s failed with an exception: %s" % (self.job_key, self.exception))

EnvironmentError: Job with key $0300ffffffff$_b00ff256ac377dd34bf61730b02e4470 failed with an exception: java.lang.ArrayIndexOutOfBoundsException: 6
stacktrace:
java.lang.ArrayIndexOutOfBoundsException: 6
at water.util.ArrayUtils.add(ArrayUtils.java:153)
at hex.ModelMetricsClustering$MetricBuilderClustering.reduce(ModelMetricsClustering.java:131)
at hex.ModelMetricsClustering$MetricBuilderClustering.reduce(ModelMetricsClustering.java:80)
at hex.ModelBuilder.cv_mainModelScores(ModelBuilder.java:491)
at hex.ModelBuilder.computeCrossValidation(ModelBuilder.java:288)
at hex.ModelBuilder$1.compute2(ModelBuilder.java:203)
at water.H2O$H2OCountedCompleter.compute(H2O.java:1192)
at jsr166y.CountedCompleter.exec(CountedCompleter.java:468)
at jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:263)
at jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:974)
at jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1477)
at jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104)

Assignee

New H2O Bugs

Fix versions

None

Reporter

Avkash Chauhan

Support ticket URL

None

Labels

None

Affected Spark version

None

Customer Request Type

Support Incident

Task progress

None

ReleaseNotesHidden

None

CustomerVisible

No

Support Assessment

Data Science Issue

AffectedCustomers

Priority

Major
Configure