[Question] #3059

alexdsh · 2024-12-06T21:59:49Z

❓ General Questions

add the ability to load other models, except for those that are by default. Make a choice from the local storage. Is it possible to somehow limit the level of loading of the graphics core, to at least 90%, since when the model is running, the phone freezes completely, including stopping the interface update (I generally just have a clean screen, white).

ereish64 · 2024-12-12T13:58:58Z

I don't think CPU offloading is available at the moment (someone please correct me if I am wrong on this), however you can compile the model quantized so that it takes less memory (and processing power) if you haven't already. Try q4fp16 / 4 bit, floating point 16.

alexdsh · 2024-12-16T18:23:51Z

so it's not the processor that's overloaded, but the graphics core., regarding the model, I use gemma2-2B q4fp16.mlc, it's already quantized to the maximum, besides, I also launched gemma2-7B-int1.gguf (though in another application where the processor calculates everything, without a gpu, it's Layla, but although it has an interesting "memory mapping" function implemented, allowing you to intelligently load model segments from swap when there's little physical memory. unfortunately, the model itself works strangely there, it writes outright nonsense. therefore, mlc chat suits me, but alas, it's enough for one, maximum 2 questions-answers, and then the application closes when it runs out of memory, neither zram 4 gb nor swap 4 gb on a flash drive helps. at least implement the same work with memory as Layla, plus the choice of your model and fix the work with the gpu so that the screen doesn't freeze, if you implement this, it would be would be the best app for running models locally.thanks!

alexdsh added the question Question about the usage label Dec 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] #3059

[Question] #3059

alexdsh commented Dec 6, 2024

ereish64 commented Dec 12, 2024

alexdsh commented Dec 16, 2024

[Question] #3059

[Question] #3059

Comments

alexdsh commented Dec 6, 2024

❓ General Questions

ereish64 commented Dec 12, 2024

alexdsh commented Dec 16, 2024