Skip to content

0.0.8

Compare
Choose a tag to compare
@rmccorm4 rmccorm4 released this 11 Jun 22:49
· 17 commits to main since this release
8f577d3

What's Changed

  • Disable Echo (exclude input text from output text) in TRT-LLM by default by @nnshah1 in #58
  • Enable calls to GenAI-Perf for profile subcommand by @dyastremsky in #52
  • Fix wrong huggingface login command in readme by @matthewkotila in #60
  • Tweak test timeouts to account for testing Llama 2 and Llama 3 models by @rmccorm4 in #61
  • Add GitLab CI trigger in GitHub checks by @nvda-mesharma in #64
  • test: Unit Tests for triton {metrics, config, status} by @KrishnanPrash in #66
  • chore: Upgrade dependencies for 24.05 by @KrishnanPrash in #67
  • refactor: Simplify testing with ScopedTritonServer instead of pytest fixtures by @KrishnanPrash in #68
  • ci: Restrict numpy to version 1.x by @KrishnanPrash in #70
  • refactor: Add TritonCLIException to denote expected vs unexpected errors by @rmccorm4 in #69
  • build: Update CLI version references to 0.0.8 and Triton references to 24.05 by @rmccorm4 in #72

New Contributors

Full Changelog: 0.0.7...0.0.8