0.0.6
Pre-release
Pre-release
What's Changed
- GPT Engine Builder by @fpetrini15 in #24
- Modularize TRT LLM Builders by @fpetrini15 in #26
- Add --backend support to bench command and default to custom image by @rmccorm4 in #27
- Fix model infer on TRT LLM with negative ints, and minor cleanup by @rmccorm4 in #28
- Fix profile subcommand to account for offline (non-streaming) metrics and V1 batching by @rmccorm4 in #29
- Minor Repo Optimizations by @fpetrini15 in #30
- Bring back IFB default to TRT LLM models and bump to 24.01 by @rmccorm4 in #31
- Bump cli version to 0.0.3, bump trtllm version to 0.7.1, and bump vllm version to 0.3.0 by @rmccorm4 in #32
- Give GPT2 quicker build/load settings for demos, fix Dockerfile version syntax, bump CLI version to 0.0.4 by @rmccorm4 in #33
- Add note on MPI dependencies by @rmccorm4 in #34
- Add CLI subcommand tests to CI by @krishung5 in #35
- Bump to v0.0.5 - CI testing working for 24.01 by @rmccorm4 in #38
- Add extra tests for CLI by @krishung5 in #36
- CLI TRT LLM v0.8.0 Refresh by @fpetrini15 in #37
- Bump to v0.0.6 - CI testing working for 24.02 by @fpetrini15 in #39
- Flatten CLI Args by @fpetrini15 in #40
- Update README commands by @rmccorm4 in #42
- Enable CLI Concurrent Testing by @fpetrini15 in #41
- README Restructuring by @fpetrini15 in #43
- Address some documentation issues by @rmccorm4 in #50
New Contributors
- @krishung5 made their first contribution in #35
Full Changelog: 0.0.2...0.0.6