Loading model NOT from Huggingface? #527

molntamas · 2025-02-10T10:08:19Z

molntamas
Feb 10, 2025

A have a fine tuned text classification model. (based on: intfloat/multilingual-e5-large-instruct)

I was thinking about hosting inference with infinity, as I understand this should be possible.
If I run a docker based image, e.g. for CPU specialized one, can I specifiy a local file path or a custom URL to load the model from?

Thank you! :)

wirthual · 2025-02-10T15:09:47Z

wirthual
Feb 10, 2025
Collaborator

Yes this is possible.

You can mount your local model from the host machine into the container and then specify the path inside the container when launching the infinity container.

Check out the example from this issue:

cd /tmp
git install lfs 
mkdir models && cd models && git clone https://huggingface.co/BAAI/bge-m3 && cd ..
docker run -it   -v /tmp/models:/models  -p 8081:8081  michaelf34/infinity:0.0.70 v2  --model-id "/models/bge-m3"  --served-model-name bge-m3  --port 8081

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loading model NOT from Huggingface? #527

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Loading model NOT from Huggingface? #527

molntamas Feb 10, 2025

Replies: 1 comment

wirthual Feb 10, 2025 Collaborator

molntamas
Feb 10, 2025

wirthual
Feb 10, 2025
Collaborator