You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I think the fair comparison would be future::multisession (aka parallel's SOCK clusters) and maybe future.batchtools::batchtools_local (write to file and launch). mclapply() (aka FORKed processes) is kind of different; with it's own pros and cons.
So I haven't done any profiling what so ever; if there is a big difference between plan(callr) and plan(multisession), I'd expect the overhead comes from the callr framework itself.
Also, the obvious comment is that, when the overhead is greater it would require bigger computational tasks before it's worth parallelizing.
Would it make sense to start callr::r_session's in the background so that at least we do not have to wait for process startup after an idleness period? Maybe workers would set the maximum of parallel run sessions, and minWorkersIdle or similar would be the minimum of process to setup in advance to avoid initialization latency?
I am really happy to see
future
extend to so many forms of parallelism!Is there anything we can do about overhead?
The text was updated successfully, but these errors were encountered: