Duplicate rows added in conversion to H2OFrame for large data sets

Description

Duplicate rows are sometimes added when converting large pandas dataframe with large values to h2o frame. This was reproducable with the below code on windows 7 PC (Python: 3.6.7 final, h2o: 3.26.0.9) and windows 10 laptop (Python: 3.6.8 final, h2o: 3.18.0.2) but not on Docker container running on Azure cloud VM (Docker: jupyter/scipy-notebook, Python: 3.7.3 final, h2o: 3.28.0.1). Problem persisted after upgrading h2o to 3.28.0.3.

Assignee

New H2O Bugs

Fix versions

None

Reporter

Michael McCartney

Support ticket URL

None

Labels

Affected Spark version

None

Customer Request Type

None

Task progress

None

CustomerVisible

No

Components

Affects versions

Priority

Major
Configure