Open issues

Integration Testing for different variation of Spark
SW-179
Add support for Spark Dynamic Allocation
SW-556
Test Sparkling Water on Standalone Cluster
SW-106
Sparkling shell: newly created classes are not visible by remote nodes
SW-36
Use downloadLogs method from H2O and remove relevant methods on Sparkling Water side
SW-1430
Add Target Encoding to Sparkling Water Python API
SW-1425
Upgrade to H2O 3.26.0.2
SW-1424
Add H2O-3 KMeans to Sparkling Water
SW-1423
Add automated tests to test h2o native hive in sparkling water environment
SW-1416
Introduce back Jenkinsfile-internal-hadoop-smoke and Spark nightly tests
SW-1410
Distinguish between classification and regression mode for all H2O algorithms.
SW-1407
Switch to scala formatter
SW-1391
In release pipeline, allow to specify output directory for extended H2O jars
SW-1377
Add conda-forge to unblock release of 2.1 to conda
SW-1376
MOJO depploymet package
SW-1368
Be able to annotate jira with flag indicating the change should not be mentioned in release notes
SW-1366
Remove h2o wrapper around SVM from sparkling water
SW-1354
Expose option to pick a given certificate from a keystore to SW users
SW-1324
Remove jetty relocation
SW-1319
Add example to use word2vec in Scala
SW-1315
expose Tokenize function in Scala
SW-1314
Benchmarks: Compare Performance of Datasets Loaded With Apache Spark vs. H2O-3
SW-1297
Benchmarks: Compare Local vs. YARN deployment
SW-1296
Benchmarks: Compare Internal vs. External Backend
SW-1295
Benchmarks: Testing Applications
SW-1293
Benchmarks
SW-1291
Consider Apache Arrow serialization for h2o frames
SW-1272
Convert spark df to H2Oframe failed: java.lang.ArrayIndexOutOfBoundsException: 65535
SW-1266
XGBoost: RemoteDisconnected('Remote end closed connection without response',) in Azure Databricks
SW-1265
Explore exposing Spark DBscan into sparkling water
SW-1260
Start running hadoop smoke tests
SW-1253
Reconsider the role and naming conventions of the 'columnsToCategorical' and 'allStringColumnsToCategorical' properties
SW-1231
Add Target Encoding to Sparkling Water Scala API
SW-1207
Have Parity between GBM Parameters in Sparkling Water Scala API and R/Python API
SW-1206
Fix formating in pysparkling package
SW-1205
Generate Sparkling Water Pipelines wrappers automatically
SW-1201
Better handling of a Spark SQL logic leading to execution of broadcast joins
SW-1192
Use H2OMojoModel to load mojo & mojo pipelines as well, slowly deprecating publicly visible H2OMOJOPipelineModel
SW-1187
Think about num of partitions during Spark Frame -> H2O Frame
SW-1185
Delete h2o model after fitting Spark pipeline/stage
SW-1157
Loophole in H2O authentication with Sparkling water
SW-1151
Better sparse Spark Row to H2O Row conversion for MOJO predict
SW-1142
Enable mojo named columns for predictions on normal mojos to be consistent with DAI mojos
SW-1137
Look into testing SW on Kubeflow & Kubernetes
SW-1132
Long term stabilization ideas
SW-1128
Deploying client configuration fails h2o external cluster
SW-1126
Create Sparkling Water distribution package also for other hadoop versions supported by Spark
SW-1074
Sparkling Water for Spark 2.3 with h2o version 3.16.0.2 inside it
SW-1053
Document predicting using UDF
SW-1050
Test Sparkling Water to EMR(from edge node) deployment using terraform as part of jenkins pipeline
SW-1010
issue 1 of 135

Integration Testing for different variation of Spark

Description

We need to test for:

  • HDP (different version)

  • CDH5.7, CDH5.8

  • EMR

The tests have to

  • be automatic

  • be defined at a single space - see (Jenkins Pipelines)

Environment

None

Status

Assignee

Michal Malohlava

Reporter

Michal Malohlava

Labels

None

Release Priority

None

CustomerVisible

No

testcase 1

None

testcase 2

None

testcase 3

None

h2ostream link

None

Affected Spark version

None

AffectedContact

None

AffectedCustomers

None

AffectedPilots

None

AffectedOpenSource

None

Support Assessment

None

Customer Request Type

None

Support ticket URL

None

End date

None

Baseline start date

None

Baseline end date

None

Task progress

None

Task mode

None

Components

Priority

Blocker