You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you for maintaining this package, I recently tried this package and found that future_map has no obvious speed up effects compared to map function in simple linear regression setting. Here I tried a toy example and plot how the running time change along the workers
I also noticed although the workers is set, the htop command in command line interface did not show that number of CPUs are utilized. I am currently not clear about the details of future_map implementation, but the cpu utilization makes me wonder if the slowness is due to I/O bottleneck. If so, this might indicate there are some improvement space (for example, avoid unnecessary file creating/copying/writing)?
Given that future_map's performance is not as satisfying in this attempt, may I ask if you could share some wisdom on the application scenario of future_map where there is a significant speed up?
The text was updated successfully, but these errors were encountered:
wbvguo
changed the title
future_map is consistently slower than map in simple linear regression settingfuture_map not obviously faster than map in simple linear regression setting
Jan 13, 2024
One thing is you're nesting data, which is a documented limitation as mentioned in #234. Aside from that though others have noted in #260 there seems to have been some change in the past year or so that's led to a degradation of furrr's performance. I've been using it for a while with excellent results but in the past year or so I've noticed the same code (on the same test data) running noticeably slower, and at one point even found several gigs of temp files that hadn't been cleaned up.
Dear
furrr
developer,Thank you for maintaining this package, I recently tried this package and found that
future_map
has no obvious speed up effects compared tomap
function in simple linear regression setting. Here I tried a toy example and plot how the running time change along theworkers
benchmark
plots
I also noticed although the
workers
is set, thehtop
command in command line interface did not show that number of CPUs are utilized. I am currently not clear about the details offuture_map
implementation, but the cpu utilization makes me wonder if the slowness is due to I/O bottleneck. If so, this might indicate there are some improvement space (for example, avoid unnecessary file creating/copying/writing)?Given that
future_map
's performance is not as satisfying in this attempt, may I ask if you could share some wisdom on the application scenario offuture_map
where there is a significant speed up?This might be a repeat of issue #41, #234, #252
Thanks!
session info
The text was updated successfully, but these errors were encountered: