-
Notifications
You must be signed in to change notification settings - Fork 101
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Better mimic DocumentDataset's read_*
functions to Dask's read_*
functions
#50
Comments
I believe the reason we have a custom read_json implementation is the ability to specify |
From #46: Is there a reason you didn't use Yeah, the I would be interested in that discussion as well. My intuition is that we should mimic the behavior of Dask as much as possible, but there might be good reasons to deviate. Yeah, I agree that the goal should be to mimic Dask's |
From #130:
" and "
For now, I will link #50 here and merge |
From #77: I'm generally in favor of overhauling the IO helpers in the current setup for something better. When we tackle #50. I'll share more thoughts there, but moving to encouraging users using the Regarding: |
Another TODO: Support for |
Right now,
DocumentDataset
has a couple ofread_*
functions:(1)
(2)
(3)
It would be good if these functions could support Dask's read_json and read_parquet parameters (there is no
read_pickle
function in Dask but we can perhaps look to Pandas for this).In addition to this, we can restructure our
to_*
functions as well.The text was updated successfully, but these errors were encountered: