[ML-3581] Add benchmarks to mllib-large.yaml for regression (#150)

Benchmark for regression is added to mllib-large.yaml.
DecisionTreeRegression, GLMRegression, LinearRegression, and RandomForestRegression are added.

GBT, AFTSurvivalRegression, and IsotonicRegression are missing in spark-sql-perf.
This commit is contained in:
ludatabricks 2018-06-12 10:32:02 -07:00 committed by Joseph Bradley
parent 9ab2a8bb14
commit 6a45dc8a2d

View File

@ -67,4 +67,26 @@ benchmarks:
numItems: 6000000
regParam: 0.01
rank: 10
maxIter: 10
maxIter: 10
- name: regression.DecisionTreeRegression
params:
depth: [5, 10]
- name: regression.GLMRegression
params:
numExamples: 500000
numTestExamples: 500000
numFeatures: 1000
link: log
family: gaussian
tol: 0.0
maxIter: 10
regParam: 0.1
- name: regression.LinearRegression
params:
regParam: 0.01
tol: 0.0
maxIter: 20
- name: regression.RandomForestRegression
params:
depth: 10
maxIter: 4