Commit Graph

101 Commits

Author SHA1 Message Date
Michael Armbrust
43f7457d03 Add required developer info to pom 2016-01-19 13:03:31 -08:00
Michael Armbrust
24d1b3f6e3 Setting version to 0.3.1-SNAPSHOT 2016-01-19 12:54:00 -08:00
Michael Armbrust
ed246c945f Setting version to 0.3.0 2016-01-19 12:53:06 -08:00
Michael Armbrust
9afabf249a remove sql dependency 2016-01-19 12:52:03 -08:00
Michael Armbrust
d52b4c398c add results to git ignore 2016-01-19 12:39:21 -08:00
Michael Armbrust
663ca7560e Main Class for running Benchmarks from the command line
This PR adds the ability to run performance test locally as a stand alone program that reports the results to the console:

```
$ bin/run --help
spark-sql-perf 0.2.0
Usage: spark-sql-perf [options]

  -b <value> | --benchmark <value>
        the name of the benchmark to run
  -f <value> | --filter <value>
        a filter on the name of the queries to run
  -i <value> | --iterations <value>
        the number of iterations to run
  --help
        prints this usage text

$ bin/run --benchmark DatasetPerformance
```

Author: Michael Armbrust <michael@databricks.com>

Closes #47 from marmbrus/MainClass.
2016-01-19 12:37:51 -08:00
Michael Armbrust
5c93fff323 Upgrade to 1.6
Author: Michael Armbrust <michael@databricks.com>

Closes #48 from marmbrus/upgrade.
2016-01-18 09:11:35 -08:00
Davies Liu
cec648ac0f try to run all TPCDS queries in benchmark (even can't be parsed) 2016-01-08 15:03:44 -08:00
Davies Liu
3105219fb0 Merge commit '11d1f9dd7237ea2a09ecfa61f09d7623ad52fd47' 2016-01-08 11:29:07 -08:00
Davies Liu
11d1f9dd72 update some queries:
" -> `
   fill some values
2016-01-08 11:27:50 -08:00
Michael Armbrust
9269f8f594 Capture BuildInfo when available
Author: Michael Armbrust <michael@databricks.com>

Closes #45 from marmbrus/buildInfo.
2015-12-23 11:03:06 -08:00
Michael Armbrust
4ba3802f95 Setting version to 0.2.4-SNAPSHOT 2015-12-23 00:11:53 -08:00
Michael Armbrust
61e6bd1897 Setting version to 0.2.3 2015-12-23 00:11:15 -08:00
Michael Armbrust
7825449eef Include publishing to BinTray in release process
After this you should be able to use the library in the shell as follows:

```
bin/spark-shell --packages com.databricks:spark-sql-perf:0.2.3
```

Author: Michael Armbrust <michael@databricks.com>

Closes #46 from marmbrus/publishToMaven.
2015-12-23 00:09:35 -08:00
Michael Armbrust
b2e4896efc Setting version to 0.2.3-SNAPSHOT 2015-12-08 16:07:15 -08:00
Michael Armbrust
c764be3e00 Setting version to 0.2.2 2015-12-08 16:06:53 -08:00
Michael Armbrust
f8aa93d968 Initial set of tests for Datasets
Author: Michael Armbrust <michael@databricks.com>

Closes #42 from marmbrus/dataset-tests.
2015-12-08 16:04:42 -08:00
Michael Armbrust
0aa2569a18 Write only one file per run
Author: Michael Armbrust <michael@databricks.com>

Closes #35 from marmbrus/oneResultFile.
2015-12-08 15:46:20 -08:00
Michael Armbrust
12b7537181 Update databricks plugin
Author: Michael Armbrust <michael@databricks.com>

Closes #43 from marmbrus/updatePlugin.
2015-12-08 15:29:47 -08:00
Yin Huai
3af656defa Make ExecutionMode.HashResults handle null value
In Spark 1.6, if a value is null, `getLong` will throw an exception. Before 1.6, it will return 0. With this PR, we will check if the result is null. If it is null, null will be returned instead of 0.

Author: Yin Huai <yhuai@databricks.com>

Closes #41 from yhuai/fixSumHash.
2015-12-08 15:28:48 -08:00
Nong Li
43c2f23bb9 Fixes for Q34 and Q73 to return results deterministically.
Author: Nong Li <nong@databricks.com>

Closes #38 from nongli/tpcds.
2015-11-25 15:03:33 -08:00
Nong
70e0dbe656 Add official TPCDS 1.4 queries.
Author: Nong <nong@cloudera.com>

Closes #36 from nongli/tpcds.
2015-11-24 13:12:46 -08:00
Nong Li
1aa5bfc838 Add remaining tpcds tables.
Author: Nong Li <nongli@gmail.com>

Closes #34 from nongli/tpcds.
2015-11-19 13:50:00 -08:00
Andrew Or
e2073129cf Setting version to 0.2.2-SNAPSHOT 2015-11-18 13:17:56 -08:00
Andrew Or
180003e4f9 Setting version to 0.2.1 2015-11-18 13:17:48 -08:00
Nong Li
8d9e8ce9a3 Add another fact table and updates to load a single table at a time.
Author: Nong Li <nongli@gmail.com>

Closes #31 from nongli/more_tables.
2015-11-18 11:12:01 -08:00
Andrew Or
426ae30a2e Increase integration surface area with Spark perf
The changes in this PR are centered around making `Benchmark#runExperiment` accept things other than `Query`s. In particular, in spark-perf we don't always have a DataFrame or an RDD to work with and may want to run arbitrary code (e.g. ALS.train). This PR makes it possible to use the same code in `Benchmark` to do this.

I tested this on dogfood and it works well there.

Author: Andrew Or <andrew@databricks.com>

Closes #33 from andrewor14/spark-perf.
2015-11-18 10:50:46 -08:00
Michael Armbrust
e516e1e7b3 Use published preview release of 1.6
Author: Michael Armbrust <michael@databricks.com>

Closes #32 from marmbrus/spark16.
2015-11-16 22:46:36 -08:00
Andrew Or
172ae79f8d Introduce small integration point with Spark perf
This allows us to report Spark perf results in the same format as SQL benchmark results. marmbrus

Author: Andrew Or <andrew@databricks.com>

Closes #30 from andrewor14/spark-perf.
2015-11-16 17:46:53 -08:00
Michael Armbrust
ebcd5db414 Setting version to 0.2.1-SNAPSHOT 2015-11-13 12:41:59 -08:00
Michael Armbrust
e547490b7c Setting version to 0.2.0 2015-11-13 12:41:51 -08:00
Michael Armbrust
344b31ed69 Update to Spark 1.6
Some internal interfaces changed, so we need to bump the Spark version to run tests on Spark 1.6.

Author: Michael Armbrust <michael@databricks.com>

Closes #29 from marmbrus/spark16.
2015-11-13 12:40:00 -08:00
Cheng Lian
50808c436b Fixes typos in README.md
Author: Cheng Lian <lian@databricks.com>

Closes #25 from liancheng/readme-fix.
2015-11-11 12:05:44 -08:00
Nong Li
dc48f2e49b Support generating the data as "text".
This previously failed since text only supports a single column. Having the option of
text output is useful to quickly see what the generator is doing.

Author: Nong Li <nongli@gmail.com>

Closes #27 from nongli/text.
2015-11-11 12:05:14 -08:00
bit1129
f63d40ce9f Add 2 queries
Author: bit1129 <bit1129@gmail.com>

Closes #22 from bit1129/master.
2015-09-16 10:10:20 -07:00
Michael Armbrust
40d085f1c7 Add dashboard notebook
Author: Michael Armbrust <michael@databricks.com>

Closes #21 from marmbrus/master.
2015-09-11 17:46:07 -07:00
Michael Armbrust
a30dcd3cb7 Merge pull request #20 from marmbrus/releases
Add a plugin for making releases to Databricks
2015-09-11 12:25:16 -07:00
Michael Armbrust
8b441c1ee2 Update build.sbt 2015-09-11 12:16:55 -07:00
Michael Armbrust
482aac583a env in travis 2015-09-09 22:47:40 -07:00
Michael Armbrust
9d8ca399db env in travis 2015-09-09 22:45:31 -07:00
Michael Armbrust
241d45817c Setting version to 0.1.3-SNAPSHOT 2015-09-09 22:34:56 -07:00
Michael Armbrust
9c196d0be5 Setting version to 0.1.2 2015-09-09 22:33:46 -07:00
Michael Armbrust
a4458af0be verison 2015-09-09 22:32:46 -07:00
Michael Armbrust
479e4081c2 Add a release process for pushing to DBC 2015-09-09 22:32:31 -07:00
Michael Armbrust
f03b3af719 Fail gracefully when invalid CPU logs are encountered
Author: Michael Armbrust <michael@databricks.com>

Closes #18 from marmbrus/parseCpuFail.
2015-09-09 22:02:23 -07:00
Michael Armbrust
e2dc749480 Add more tests for join performance
Author: Michael Armbrust <michael@databricks.com>

Closes #17 from marmbrus/joinPerf.
2015-09-09 21:56:47 -07:00
Michael Armbrust
08cb68ca20 Make it easier to write benchmarks in notebooks
Author: Michael Armbrust <michael@databricks.com>

Closes #19 from marmbrus/notebookTests.
2015-09-09 21:49:50 -07:00
Michael Armbrust
ddeead18ce Add compilation testing with travis
There are no tests yet... but this at least tests compilation.

Author: Michael Armbrust <michael@databricks.com>

Closes #15 from marmbrus/travis.
2015-09-09 21:36:26 -07:00
Michael Armbrust
a49938903d Add Merge Script
Shamelessly copied from spark... still prompts for options that don't make sense (JIRA) but its better than messy merge trees :)

Author: Michael Armbrust <michael@databricks.com>

Closes #16 from marmbrus/mergeScript.
2015-09-09 20:03:52 -07:00
Yin Huai
88fa2f5af2 Merge pull request #9 from yhuai/genData
Add data generation support for TPC-DS
2015-09-04 15:48:10 -07:00