ValueError when Loading Qwen2-VL Model with Liger Kernel #249

rahatarinasir · 2024-09-16T09:34:16Z

🐛 Describe the bug

I'm encountering a ValueError when trying to load the Qwen2-VL model using the AutoLigerKernelForCausalLM class from the Liger Kernel. The error message indicates an unrecognized configuration class for this model.

Reproduce

Clone the repository:
bash

git clone https://github.com/linkedin/Liger-Kernel.git
cd Liger-Kernel
pip install -e .

Install necessary packages:
python

from huggingface_hub import login
hf_token = input("Enter your Hugging Face token: ")
login(token=hf_token)
pip install torch
pip install triton # >= 2.3.0
pip install liger-kernel
pip install liger-kernel-nightly
pip install git https://github.com/huggingface/transformers@21fac7abba2a37fae86106f87fcf9974fd1e3830

Attempt to load the model:
python

from transformers import Qwen2VLConfig, Qwen2VLForConditionalGeneration
from liger_kernel.transformers import AutoLigerKernelForCausalLM

config = Qwen2VLConfig.from_pretrained("Qwen/Qwen2-VL-2B-Instruct")
model = AutoLigerKernelForCausalLM.from_pretrained("Qwen/Qwen2-VL-2B-Instruct", config=config)

Error Message
angelscript

ValueError: Unrecognized configuration class <class 'transformers.models.qwen2_vl.configuration_qwen2_vl.Qwen2VLConfig'> for this kind of AutoModel: AutoLigerKernelForCausalLM.
Model type should be one of [list of model types].

It seems that the AutoLigerKernelForCausalLM class does not recognize the Qwen2VLConfig. Is there a workaround or an update planned to support this model? Any guidance would be greatly appreciated!

Versions

Environment Report:

Operating System: Linux-6.1.85 -x86_64-with-glibc2.35
Python version: 3.10.12
PyTorch version: 2.4.0 cu121
CUDA version: 12.1
Triton version: 3.0.0
Transformers version: 4.45.0.dev0

tyler-romero · 2024-09-16T17:55:41Z

Qwen2-VL isnt based off of AutoModelForCausalLM, so it cant be loaded with AutoLigerKernelForCausalLM. Instead its a "ForConditionalGeneration" model, which doesn't have an AutoModel parent class.

As a workaround, I'd recommend using the monkeypatch function apply_liger_kernel_to_qwen2_vl after from transformers import Qwen2VLForConditionalGeneration

arit2 mentioned this issue Sep 24, 2024

No liger kernels will be applied. Qwen2-vl hiyouga/LLaMA-Factory#5528

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ValueError when Loading Qwen2-VL Model with Liger Kernel #249

ValueError when Loading Qwen2-VL Model with Liger Kernel #249

rahatarinasir commented Sep 16, 2024 •

edited

Loading

tyler-romero commented Sep 16, 2024 •

edited

Loading

ValueError when Loading Qwen2-VL Model with Liger Kernel #249

ValueError when Loading Qwen2-VL Model with Liger Kernel #249

Comments

rahatarinasir commented Sep 16, 2024 • edited Loading

🐛 Describe the bug

Reproduce

Versions

Environment Report:

tyler-romero commented Sep 16, 2024 • edited Loading

rahatarinasir commented Sep 16, 2024 •

edited

Loading

tyler-romero commented Sep 16, 2024 •

edited

Loading