0.0.8
What's Changed
- Disable Echo (exclude input text from output text) in TRT-LLM by default by @nnshah1 in #58
- Enable calls to GenAI-Perf for profile subcommand by @dyastremsky in #52
- Fix wrong huggingface login command in readme by @matthewkotila in #60
- Tweak test timeouts to account for testing Llama 2 and Llama 3 models by @rmccorm4 in #61
- Add GitLab CI trigger in GitHub checks by @nvda-mesharma in #64
- test: Unit Tests for
triton {metrics, config, status}
by @KrishnanPrash in #66 - chore: Upgrade dependencies for 24.05 by @KrishnanPrash in #67
- refactor: Simplify testing with ScopedTritonServer instead of pytest fixtures by @KrishnanPrash in #68
- ci: Restrict numpy to version 1.x by @KrishnanPrash in #70
- refactor: Add TritonCLIException to denote expected vs unexpected errors by @rmccorm4 in #69
- build: Update CLI version references to 0.0.8 and Triton references to 24.05 by @rmccorm4 in #72
New Contributors
- @nnshah1 made their first contribution in #58
- @nvda-mesharma made their first contribution in #64
- @KrishnanPrash made their first contribution in #66
Full Changelog: 0.0.7...0.0.8