Discovered during self-quantization: The model is mistakenly registered as Qwen3, but it is actually based on Qwen2.5

I encountered a critical compatibility issue while attempting to quantize this model locally.
I found that the model architecture was accidentally registered as Qwen3 inside the configuration file. However, this multimodal model is essentially built on Qwen2.5.
Furthermore, the Transformers version recommended by this project does not yet support the Qwen3 architecture. This mismatch directly prevents model loading and successful INT8 quantization, and always triggers runtime errors.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Discovered during self-quantization: The model is mistakenly registered as Qwen3, but it is actually based on Qwen2.5 #64

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Discovered during self-quantization: The model is mistakenly registered as Qwen3, but it is actually based on Qwen2.5 #64

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions