- Upgrade dependencies
- Upgrade dependencies
- Add datapatch
- Fix
read_json
for already opened str/bytes stream handler
- Add sql support via sqlalchemy
This tool was archived, but now re-activated to be used as a plugin in investigraph.
Therefore all the data fetching & storing logic is dropped from this library, only the core functionality of specifying and executing pandas.DataFrame
transform operations via a yaml
specification.
Refer to the README.md
of the current version to see what this library (still) can do and what not.
These changes survived in the archived master branch
- Add
url_replace
feature for dynamically url rewriting
- Add option to resample paginated sources by download date format
- Small bugfixes
- Add pagination by offset logic
- Allow lambda functions as strings in yaml for column transformation and df operations
- Add option in yaml to set cache header for google cloud storage blobs
- Small bugfixes
- first release with basic functionality