Supported Models#
Generative Models#
Model |
Support |
W8A8 |
LoRA |
Tensor Parallel |
Expert Parallel |
Data Parallel |
Piecewise Kunlun Graph |
|---|---|---|---|---|---|---|---|
Qwen3 |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
|
Qwen3-Moe |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
Qwen3-Next |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
Deepseek v3.2 |
✅ |
✅ |
✅ |
✅ |
✅ |
Multimodal Language Models#
Model |
Support |
W8A8 |
LoRA |
Tensor Parallel |
Expert Parallel |
Data Parallel |
Piecewise Kunlun Graph |
|---|---|---|---|---|---|---|---|
Qwen3-VL |
✅ |
✅ |
✅ |
✅ |
✅ |