Skip to content

Discovered during self-quantization: The model is mistakenly registered as Qwen3, but it is actually based on Qwen2.5 #64

Description

@dayuan12138

I encountered a critical compatibility issue while attempting to quantize this model locally.
I found that the model architecture was accidentally registered as Qwen3 inside the configuration file. However, this multimodal model is essentially built on Qwen2.5.
Furthermore, the Transformers version recommended by this project does not yet support the Qwen3 architecture. This mismatch directly prevents model loading and successful INT8 quantization, and always triggers runtime errors.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions