diff --git a/README.md b/README.md index 96389f155d56c949e475e6f63f940ed5dffafc2e..d3aae774d3ae7710be4b82e39d7e1c3375a55412 100644 --- a/README.md +++ b/README.md @@ -58,14 +58,12 @@ To connect to the master and workers with a browser, you need a command of the f ```bash ssh -L 18080:localhost:18080 -L 8080:localhost:8080 kesselheim1@jwb0129i.juwels -J kesselheim1@juwels-booster.fz-juelich.de ``` -Then you can navigate to (http://localhost:8080) to the the output. +Then you can navigate to (http://localhost:8080) to the the output. This requires to enable your ssh key to be available not only from your network connection but also for connections within the supercomputer. Use the from-clause `from=10.0.0.0/8 ...` for that. -Open Questions +### Open Questions - In the Scala Example, is uses all worker instances as expected. The Python Example uses only 2. Why? -ToDos: -- Include a Python Virtual Environment -- Create a Notebook that illustrates how to run the Pi example in Juypter +### ToDos - The history server does not work yet. It crashed with this error message: ``` Exception in thread "main" java.io.FileNotFoundException: Log directory specified does not exist: file:/tmp/spark-events Did you configure the correct one through spark.history.fs.logDirectory?