Supported Features#
The feature support principle of vLLM-KunLun is: aligned with the vLLM. We are also actively collaborating with the community to accelerate support.
You can check the [support status of vLLM V1 Engine][v1_user_guide]. Below is the feature support status of vLLM-KunLun:
Features Supported#
Feature |
Status |
Note |
|---|---|---|
Tensor Parallel |
🟢 Functional |
|
Experts Parallel |
🟢 Functional |
|
Graph Mode |
🟢 Functional |
|
Quantization |
🟢 Functional |
|
LoRA |
⚠️ Need Test |
Only LLM models |