Skip to content

0.0.6

Pre-release
Pre-release
Compare
Choose a tag to compare
@rmccorm4 rmccorm4 released this 24 Apr 00:53
· 32 commits to main since this release
039b165

What's Changed

  • GPT Engine Builder by @fpetrini15 in #24
  • Modularize TRT LLM Builders by @fpetrini15 in #26
  • Add --backend support to bench command and default to custom image by @rmccorm4 in #27
  • Fix model infer on TRT LLM with negative ints, and minor cleanup by @rmccorm4 in #28
  • Fix profile subcommand to account for offline (non-streaming) metrics and V1 batching by @rmccorm4 in #29
  • Minor Repo Optimizations by @fpetrini15 in #30
  • Bring back IFB default to TRT LLM models and bump to 24.01 by @rmccorm4 in #31
  • Bump cli version to 0.0.3, bump trtllm version to 0.7.1, and bump vllm version to 0.3.0 by @rmccorm4 in #32
  • Give GPT2 quicker build/load settings for demos, fix Dockerfile version syntax, bump CLI version to 0.0.4 by @rmccorm4 in #33
  • Add note on MPI dependencies by @rmccorm4 in #34
  • Add CLI subcommand tests to CI by @krishung5 in #35
  • Bump to v0.0.5 - CI testing working for 24.01 by @rmccorm4 in #38
  • Add extra tests for CLI by @krishung5 in #36
  • CLI TRT LLM v0.8.0 Refresh by @fpetrini15 in #37
  • Bump to v0.0.6 - CI testing working for 24.02 by @fpetrini15 in #39
  • Flatten CLI Args by @fpetrini15 in #40
  • Update README commands by @rmccorm4 in #42
  • Enable CLI Concurrent Testing by @fpetrini15 in #41
  • README Restructuring by @fpetrini15 in #43
  • Address some documentation issues by @rmccorm4 in #50

New Contributors

Full Changelog: 0.0.2...0.0.6