Releases: iterative/datachain
Releases · iterative/datachain
0.4.0
0.3.20
0.3.19
What's Changed
- Reintroduce and update test_udf_after_limit() by @rlamy in #458
IndexedFile
->ArrowRow
by @dberenbaum in #445- assert each example has some output instead of stdout and stderr by @mattseddon in #468
- query: remove compat for executing last query expression by @skshetry in #449
- Introduce DatasetVersionNotFoundError in errors by @amritghimire in #461
- use official github action for uv and
uv build
by @skshetry in #470
Full Changelog: 0.3.18...0.3.19
0.3.18
What's Changed
- Remove obsolete UDF code by @rlamy in #452
- added embeddings/gen example by @tibor-mach in #362
- update pytest-servers to 0.5.7 by @mattseddon in #454
- Introduce telemetry in datachain by @amritghimire in #411
- Replace
UniqueId
withFile
by @rlamy in #450 - Auto load json cols by @dberenbaum in #444
New Contributors
- @tibor-mach made their first contribution in #362
Full Changelog: 0.3.17...0.3.18
0.3.17
What's Changed
- [pre-commit.ci] pre-commit autoupdate by @pre-commit-ci in #451
- remove legacy udf decorator by @mattseddon in #438
- Remove storage from dataset query and refactor related codebase by @ilongin in #367
Full Changelog: 0.3.16...0.3.17
0.3.16
What's Changed
- Move 'join' SQL implementation to warehouse by @dreadatour in #409
Full Changelog: 0.3.15...0.3.16
0.3.15
What's Changed
- Add resolve files by @EdwardLi-coder in #313
- unskip test_udf_parallel by @mattseddon in #432
- fix last modified comparison in resolve file test by @mattseddon in #436
- Refactor
Client.parse_url()
by @ilongin in #435 - Set stream for nested file signals by @dberenbaum in #443
- Read arrow files from cache by @dberenbaum in #442
- Auto-detect huggingface datasets when reading tabular data by @dberenbaum in #398
- Add
datachain.lib.tar.process_tar()
generator by @rlamy in #440 - Fix storage dependencies by @ilongin in #421
Full Changelog: 0.3.14...0.3.15
0.3.14
What's Changed
- fix dependency install instructions for examples by @mattseddon in #426
- Show progress bar for pytorch conversion by @dberenbaum in #429
- Fix calculating datasets stats size by @dreadatour in #418
- use the correct fixtures in tests by @mattseddon in #428
- Adding Complex Type Support to Signal Schema by @dtulga in #422
- tests: fix mock for subprocess stdout/stderr to return BytesIO by @skshetry in #431
- prevent tests from hanging on CI (windows) by @mattseddon in #427
- Remove Entry class and use File instead by @rlamy in #419
Full Changelog: 0.3.13...0.3.14
0.3.13
0.3.12
What's Changed
- Fixes settings by @dberenbaum in #397
- fix open file method for tar files by @dberenbaum in #412
- disable execution of last query expression by default by @skshetry in #407
New Contributors
- @yathomasi made their first contribution in #408
Full Changelog: 0.3.11...0.3.12