Supported Models

Supported Models#

Generative Models#

Model

Support

W8A8

LoRA

Tensor Parallel

Expert Parallel

Data Parallel

Piecewise Kunlun Graph

Qwen3

Qwen3-Moe

Qwen3-Next

Deepseek v3.2

Multimodal Language Models#

Model

Support

W8A8

LoRA

Tensor Parallel

Expert Parallel

Data Parallel

Piecewise Kunlun Graph

Qwen3-VL