0.2.12
What's Changed
- Python API to manage the dataset registry by @dreadatour in #29
- cli: hide subcommands from the listing by @skshetry in #79
- datachain: rename include_sys kwarg to sys by @skshetry in #69
- Adding
DataChain.export_files(...)
by @ilongin in #30 - Update cv tutorial:
fashion_product_images
by @mnrozhkov in #62 - Add and clean up docstrings in datachain api by @dberenbaum in #63
- docs: fix invalid python code inside docstrings by @skshetry in #85
- Hide traceback for xfails in Studio test runs by @rlamy in #87
- Rename UDF to UDFStep for clarity, and remove from root namespace by @rlamy in #88
- Fix mutate() by @dmpetrov in #78
- update pytest-servers to 0.5.5 by @mattseddon in #94
- Remove vendored-code-specific folders by @dtulga in #95
- Rename repository references to datachain by @dtulga in #93
- do not overwrite version with None in DatasetQuery constructor by @mattseddon in #92
- always include sys signals by @skshetry in #81
- Add more UniqueId fields by @rlamy in #90
- Added more generalize
SignalsSchema.;get_signals()
method instead ofget_file_signals(...)
by @ilongin in #86 - Added input params to
distinct()
by @ilongin in #96 - Fix for
order_by
with sub signals by @ilongin in #82 - Remove legacy signals in from_storage() by @rlamy in #72
- Updates to examples by @dberenbaum in #77
- More docs updates by @dberenbaum in #100
- Add 'update' param to DataChain.from_storage method by @dreadatour in #99
- Fix repository reference in Notebook by @dtulga in #105
- fix(ux): remove reference to DatasetQuery by @shcheklein in #104
- datachain: implement to_parquet by @skshetry in #97
- File refactor by @dberenbaum in #102
- fixing regressions from switching to ModelStore.add() by @volkfox in #109
- add ModelStore to top level imports by @dmpetrov in #112
- add truncate option to show and update default width of output by @mattseddon in #116
- merge/join: exclude sys signals by @skshetry in #120
- Added
descending
parameter toDataChain.order_by(...)
by @ilongin in #122 - remove get_value() from DataModel by @dmpetrov in #119
- Add file modes for binary/text by @dberenbaum in #107
- remove docstring from DataModel.pydantic__init_subclass by @skshetry in #123
- Examples cleanup by @dberenbaum in #111
- rename ModelStore.add() to register() by @dmpetrov in #113
- datachain: generalize data access functions into collect(), and collect_flatten by @skshetry in #121
- Add nrows for partial parsing of csv/parquet by @dberenbaum in #124
- Update index.md by @volkfox in #128
- Picture for getting started by @volkfox in #127
- moving pic to the right place by @volkfox in #131
- cleanup signal refs in examples by @dberenbaum in #129
- cleanup api reference index by @dberenbaum in #130
- Fix for text and images files export by @ilongin in #135
- update computer vision quick start example by @mattseddon in #136
- update computer vision image example by @mattseddon in #139
- Huggingface test updates and bug fix by @dberenbaum in #140
- Readme update by @dmpetrov in #133
- readme: fix link to image by @dmpetrov in #143
- Update badge by @skshetry in #144
- don't depend on datachain from PATH to exec processes by @skshetry in #118
- dc: try to fix dataset_stats for DataChain.from_storage() generated dataset by @skshetry in #151
New Contributors
- @dreadatour made their first contribution in #29
- @mnrozhkov made their first contribution in #62
Full Changelog: 0.2.11...0.2.12