Document elastic cluster scaling down #41

guillaumeeb · 2022-10-19T05:14:40Z

Pending some questions to @micafer.

sebastian-luna-valero

My suggestions below.

An additional question I had. After deploying daskhub I usually see user-scheduler and image-cleaner pods sitting on worker nodes indefinitely. I guess that's by design, but I wonder whether that's going to prevent the cluster from elastic shrinking?

Thanks!

EGI.md

Co-authored-by: Sebastian Luna-Valero <[email protected]>

guillaumeeb · 2022-10-19T18:32:34Z

Thanks for the review, so I need to dig a bit deeper after @micafer answer:

There are some pods that are created using a K8s typed called DaemonSet, in this case there will be one pod deployed in each available node.
CLUES ignores this pods to mark a node as "used", so in nodes 2 and 3 there will be some other pods that CLUES cannot ignore.
So you can try to "pack" the pods into one node, using the comands "kubectl drain" and "kubectl cordon" to free the nodes.

sebastian-luna-valero · 2022-10-21T10:09:35Z

Do you have code/notebook with the workload? So I can rerun on my end and check whether I can help as well.
Thanks!

guillaumeeb · 2022-10-21T11:43:53Z

Sure, I'm just using the notebook from this repo: import package part and then just jump to Setup Dask gateway cluster section.

Just use some bigger number for Dask worker memory, and scale a bit more:

cluster = gateway.new_cluster(worker_memory=8, worker_cores=2)
cluster.scale(18)

Document elastic cluster scaling down

e74afc1

sebastian-luna-valero reviewed Oct 19, 2022

View reviewed changes

EGI.md Outdated Show resolved Hide resolved

EGI.md Outdated Show resolved Hide resolved

Apply suggestions from code review

a3b060d

Co-authored-by: Sebastian Luna-Valero <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document elastic cluster scaling down #41

Document elastic cluster scaling down #41

guillaumeeb commented Oct 19, 2022

sebastian-luna-valero left a comment

guillaumeeb commented Oct 19, 2022

sebastian-luna-valero commented Oct 21, 2022

guillaumeeb commented Oct 21, 2022

Document elastic cluster scaling down #41

Are you sure you want to change the base?

Document elastic cluster scaling down #41

Conversation

guillaumeeb commented Oct 19, 2022

sebastian-luna-valero left a comment

Choose a reason for hiding this comment

guillaumeeb commented Oct 19, 2022

sebastian-luna-valero commented Oct 21, 2022

guillaumeeb commented Oct 21, 2022