Records from a pandas data frame got duplicated when importing into H2Oframe

Description

For an unknown reason in some cases records from a pandas data frame got dupplicated when the data frame is loaded into H2O with "h2o.H2OFrame(pandas_df)" from the Python API.
The bug seems to have some inherent randomness as the dupplication does not always happen.
Attached is a Jupyter workbook that gives a reproducible example together with the necessary system, python and package information as well as log files.

Assignee

New H2O Bugs

Fix versions

None

Reporter

Roberto Rösler

Support ticket URL

None

Labels

Affected Spark version

None

Customer Request Type

None

Task progress

None

CustomerVisible

Yes

Components

Affects versions

Priority

Blocker
Configure