Skip to main content
Ctrl+K
vllm-kunlun - Home vllm-kunlun - Home

Getting Started

  • Quickstart
  • Installation
  • Tutorials
    • Single XPU (Qwen3-8B)
    • Single XPU (Qwen3-VL-32B)
    • Single XPU (InternVL2_5-26B)
    • Multi XPU (Qwen2.5-VL-32B)
    • Multi XPU (GLM-4.5)
    • Multi XPU (Qwen3-Coder-480B-A35B(W8A8))
    • Multi XPU (DeepSeek-V3.2-Exp-w8a8)
    • Multi XPU (GLM-5-W8A8-INT8)
  • FAQs

User Guide

  • Features and Models
    • Supported Models
    • Supported Features
  • Configuration Guide
    • Environment Variables
  • Feature Guide
    • Graph Mode Guide
    • Quantization Guide
    • LoRA Adapters Guide
  • Release Notes

Developer Guide

  • Contributing
    • Contributing
  • Feature Guide
    • Kunlun Graph
  • Accuracy
    • Accuracy
      • Overall accuracy test
      • Operator accuracy test
    • Accuracy Report
      • Qwen2.5-VL-7B-Instruct
      • InternVL3_5-30B-A3B
      • GLM-4.5
      • GLM-Air-4.5
  • Performance
    • Performance_benchmark
      • vLLM server performance
      • Operator performance
      • Profiling

Community

  • Governance
  • Maintainers and Acknowledgments
  • Versioning policy
  • User stories
  • Repository
  • Suggest edit
  • .md

Accuracy

Accuracy#

Accuracy

  • Accuracy
  • Accuracy Report

previous

Kunlun Graph

next

Accuracy

By the vllm-kunlun team

© Copyright 2025, vllm-kunlun team.