Skip to content

MapReduce implementations using PySpark. Part of the course Massively Parallel Algorithms (2IMA35) at TU/e.

Notifications You must be signed in to change notification settings

jeroenvanriel/parallel-algos

Repository files navigation

Docker image

Based on pyspark-notebook from Jupyter Docker Stacks, also see options for PySpark.

Build and tag the image using docker build --rm -t jupyter/my-pyspark-notebook .

Then run it using docker run -p 8888:8888 -p 4040:4040 -v "$PWD":/home/jovyan/work jupyter/my-pyspark-notebook

The Web UI is running on port 4040 by default, but each new spark context that is created is put onto an incrementing port. Therefore, it may be helpful to map some more ports from the start by using -p 4040-4050:4040-4050 when starting the container.

Helpful resources

About

MapReduce implementations using PySpark. Part of the course Massively Parallel Algorithms (2IMA35) at TU/e.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published