GitHub - jeroenvanriel/parallel-algos: MapReduce implementations using PySpark. Part of the course Massively Parallel Algorithms (2IMA35) at TU/e.

Docker image

Based on pyspark-notebook from Jupyter Docker Stacks, also see options for PySpark.

Build and tag the image using docker build --rm -t jupyter/my-pyspark-notebook .

Then run it using docker run -p 8888:8888 -p 4040:4040 -v "$PWD":/home/jovyan/work jupyter/my-pyspark-notebook

The Web UI is running on port 4040 by default, but each new spark context that is created is put onto an incrementing port. Therefore, it may be helpful to map some more ports from the start by using -p 4040-4050:4040-4050 when starting the container.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
Part 2		Part 2
figures		figures
.gitignore		.gitignore
Dockerfile		Dockerfile
MST_Dense.ipynb		MST_Dense.ipynb
README.md		README.md
graphs.ipynb		graphs.ipynb
mst-sparse-graphs.ipynb		mst-sparse-graphs.ipynb
requirements.txt		requirements.txt
word-count.ipynb		word-count.ipynb
words.txt		words.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Docker image

Helpful resources

About

Releases

Packages

Contributors 2

Languages

jeroenvanriel/parallel-algos

Folders and files

Latest commit

History

Repository files navigation

Docker image

Helpful resources

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages