Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Better DateTime parsing, options to promote NULL int columns to float #4

Open
wants to merge 2 commits into
base: master
Choose a base branch
from

Conversation

behrenhoff
Copy link

@behrenhoff behrenhoff commented Nov 9, 2018

When dealing with data types where no 1:1 mapping to np.type is possible, the code needs to have more possibilites to modify column properties.

In particular, this PR addresses

  • DateTime columns (exported as ISO 8601 string, column index put into parse_csv's parse_dates argument)
  • Dealing with "Integer" & "Long Integer" columns which may be NULL: promotion rules to np.float_ (either all int columns or the ones not marked "NOT NULL")
  • There were no tests at all. This commit introduces a few basic tests for column parsing.

When dealing with data types where no 1:1 mapping to np.type is
possible, the code needs to have more possibilites to modify column
properties.

In particular, this commit addresses
 - DateTime columns (exported as ISO 8601 string, column index put into
 parse_csv's parse_dates argument)
 - Integer & Long Integer columns which may be NULL and promotion rules
 into np.float_ (either all int columns or the ones not markes "NOT
 NULL")
 - There were no tests at all. This commit introduces a few basic tests
 for column parsing.
@DeflateAwning
Copy link

I have reviewed these changes, and approve of them being integrated to the upstream repo

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants