max_memory? #10

fabianocastello · 2023-05-05T22:30:55Z

I'm quite new to LLM world. I'm brazilian and decided to start with Cabrita. I managed to deal with several errors but i am stuck with this:

"Some modules are dispatched on the CPU or the disk. Make sure you have enough GPU RAM to fit
the quantized model. If you have set a value for max_memory you should increase that. To have
an idea of the modules that are set on the CPU or RAM you can print model.hf_device_map."

I tried several solutions but none work. Is it a memory problem?

I have a MacBook Pro i7 16Gb (2018)

The text was updated successfully, but these errors were encountered:

feliperafael · 2023-05-23T03:20:18Z

Probably your issue really is a lack of memory, since the minimum required to run this model (with 7 billion parameters) is around 12gb of VRAM (GPU RAM memory). You can try using GPUs from google colab, or use a computer with at least 32gb of RAM and a video card with at least 12gb of VRAM

matthsena · 2023-06-07T12:33:01Z

I was having the same problem trying to fine tune the alpaca-lora, to solve it i had to subscribe to Colab Pro to use more powerful GPUs and more RAM. I didn't try to run the cabrita-lora.ipynb before that, but try to run it in the free version of Colab, if you get an error the solution may be the same one I took

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

max_memory? #10

max_memory? #10

fabianocastello commented May 5, 2023

feliperafael commented May 23, 2023

matthsena commented Jun 7, 2023

max_memory? #10

max_memory? #10

Comments

fabianocastello commented May 5, 2023

feliperafael commented May 23, 2023

matthsena commented Jun 7, 2023