From 91604a3ab02d92d38f444f65a8bc2d7f1b9789a0 Mon Sep 17 00:00:00 2001 From: Juliusz Sompolski Date: Tue, 27 Feb 2018 12:06:12 +0100 Subject: [PATCH] Update README to specify that TPCDS kit needs to be installed on all nodes. --- README.md | 3 +++ 1 file changed, 3 insertions(+) diff --git a/README.md b/README.md index 99792ce..7d241c9 100644 --- a/README.md +++ b/README.md @@ -48,6 +48,9 @@ the TPCDS data requires dsdgen built and available on the machines. We have a fo you will need. The fork includes changes to generate TPCDS data to stdout, so that this library can pipe them directly to Spark, without intermediate files. Therefore, this library will not work with the vanilla TPCDS kit. + +TPCDS kit needs to be installed on all cluster executor nodes under the same path! + It can be found [here](https://github.com/databricks/tpcds-kit). ```