Will TensorRT-LLM be available within Triton or will it be a separate server ? #6290

MatthieuToulemont · 2023-09-11T09:03:10Z

MatthieuToulemont
Sep 11, 2023

Like many, I am pretty stoked about the TensorRT-LLM announcement.

I am wondering if this will be accessible from within Triton as a specific backend or will we need to run a separate process to benefit from TensorRT-LLM ?

Answered by dyastremsky

Sep 13, 2023

Very happy to hear that, Matthieu! Thanks for sharing.

Triton will continue to work as the one solution for all of your AI model inferencing. We are working on a TensorRT-LLM backend that you can easily plug your models into.

View full answer

dyastremsky · 2023-09-13T22:48:31Z

dyastremsky
Sep 13, 2023
Collaborator

Very happy to hear that, Matthieu! Thanks for sharing.

Triton will continue to work as the one solution for all of your AI model inferencing. We are working on a TensorRT-LLM backend that you can easily plug your models into.

1 reply

MatthieuToulemont Sep 22, 2023
Author

Amazing !! Thank you for your answer

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Will TensorRT-LLM be available within Triton or will it be a separate server ? #6290

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Will TensorRT-LLM be available within Triton or will it be a separate server ? #6290

MatthieuToulemont Sep 11, 2023

Replies: 1 comment · 1 reply

dyastremsky Sep 13, 2023 Collaborator

MatthieuToulemont Sep 22, 2023 Author

MatthieuToulemont
Sep 11, 2023

Replies: 1 comment 1 reply

dyastremsky
Sep 13, 2023
Collaborator

MatthieuToulemont Sep 22, 2023
Author