Update README to specify that TPCDS kit needs to be installed on all nodes.

This commit is contained in:
Juliusz Sompolski 2018-02-27 12:06:12 +01:00
parent 31f34beee5
commit 91604a3ab0

View File

@ -48,6 +48,9 @@ the TPCDS data requires dsdgen built and available on the machines. We have a fo
you will need. The fork includes changes to generate TPCDS data to stdout, so that this library can
pipe them directly to Spark, without intermediate files. Therefore, this library will not work with
the vanilla TPCDS kit.
TPCDS kit needs to be installed on all cluster executor nodes under the same path!
It can be found [here](https://github.com/databricks/tpcds-kit).
```