Support Cosyvoice2-0.5B By allowing Qwen2 architecture to have a optional bias tensor #14711

tempstudio · 2025-07-16T05:00:45Z

Cosyvoice2-0.5B (https://github.com/FunAudioLLM/CosyVoice/blob/main/cosyvoice/vllm/cosyvoice2.py#L93) is a TTS model finetuned on top of the Qwen2-0.5B model, with an extra bias tensor on the decoder head.

This change allows this bias tensor to be loaded for better quality when running Cosyvoice2 in llama.cpp.

qwaqrm added 2 commits July 15, 2025 23:51

support cosyvoice2 over qwen2

54dd8e2

Merge branch 'master' of https://github.com/tempstudio/llama.cpp

1095d56

ggerganov approved these changes Jul 16, 2025

View reviewed changes

ggerganov merged commit b0f0ecc into ggml-org:master Jul 16, 2025
45 of 48 checks passed

Provide feedback