Qwen Blog

Qwen Blog

Blog

Read the latest Qwen model guides, product updates, and practical walkthroughs.

Qwen 3.5 API: Access Qwen Models via OpenRouter and Alibaba Cloud

Qwen 3.5 API: Access Qwen Models via OpenRouter and Alibaba Cloud

How to access Qwen 3.5 through APIs including OpenRouter and Alibaba Cloud DashScope. Covers API keys, Python and curl examples, pricing, and model IDs for qwen 3.5 api integration.

Apr 3, 2026
QQ-Chat Team
Qwen 3.5 Benchmark Results: How It Compares Across Tasks

Qwen 3.5 Benchmark Results: How It Compares Across Tasks

A breakdown of Qwen 3.5 benchmark results across reasoning, coding, math, and multilingual tasks — with comparisons to GPT-4o, Claude, and Llama.

Apr 3, 2026
QQ-Chat Team
Qwen 3.5 for Coding: Best Models, Tips, and Examples

Qwen 3.5 for Coding: Best Models, Tips, and Examples

Which Qwen 3.5 model is best for coding? A practical guide to code generation, debugging, and IDE workflows with the Qwen 3.5 family.

Apr 3, 2026
QQ-Chat Team
Qwen 3.5 GGUF: Download and Run Quantized Models Locally

Qwen 3.5 GGUF: Download and Run Quantized Models Locally

How to download and run Qwen 3.5 GGUF files for local inference with llama.cpp. Covers quantization levels, where to find GGUF files, setup instructions, and quality vs performance tradeoffs.

Apr 3, 2026
QQ-Chat Team
Qwen 3.5 on Hugging Face: Download, Deploy, and Chat

Qwen 3.5 on Hugging Face: Download, Deploy, and Chat

How to find, download, and run Qwen 3.5 models from Hugging Face. Covers model cards, transformers integration, inference examples, and variant comparison for qwen 3.5 huggingface users.

Apr 3, 2026
QQ-Chat Team
Run Qwen 3.5 Locally: Complete Setup Guide

Run Qwen 3.5 Locally: Complete Setup Guide

Everything you need to run Qwen 3.5 locally: hardware requirements by model size, setup with Ollama, vLLM, llama.cpp, and Transformers, plus performance optimization tips.

Apr 3, 2026
QQ-Chat Team
Qwen 3.5 Uncensored: What You Should Know

Qwen 3.5 Uncensored: What You Should Know

An honest guide to qwen3.5 uncensored models: what uncensored really means, where community fine-tunes come from, safety considerations, and how to approach them responsibly.

Apr 3, 2026
QQ-Chat Team
Fine-Tune Qwen 3.5 with Unsloth: Step-by-Step Guide

Fine-Tune Qwen 3.5 with Unsloth: Step-by-Step Guide

A practical guide to fine-tuning Qwen 3.5 with Unsloth, covering installation, LoRA and QLoRA setup, training configuration, and exporting your fine-tuned model.

Apr 3, 2026
QQ-Chat Team
How to Run Qwen 3.5 with vLLM: Setup Guide

How to Run Qwen 3.5 with vLLM: Setup Guide

A complete guide to running Qwen 3.5 models with vLLM for high-throughput inference. Covers installation, serving, model variants, and performance tuning for vllm qwen3.5 deployments.

Apr 3, 2026
QQ-Chat Team
Qwen 3.5 vs Qwen 3.6: What Changed and Which to Choose

Qwen 3.5 vs Qwen 3.6: What Changed and Which to Choose

A detailed comparison of Qwen 3.5 vs Qwen 3.6 covering key differences, feature upgrades, context window changes, and practical guidance on which version fits your workflow.

Apr 3, 2026
QQ-Chat Team
Qwen3.6-Plus API: How to Access and Integrate Qwen 3.6

Qwen3.6-Plus API: How to Access and Integrate Qwen 3.6

How to use the Qwen3.6-Plus API — endpoints, request format, tool calling, and integration tips for developers building with Qwen 3.6.

Apr 3, 2026
QQ-Chat Team
Qwen3.6-Plus for Coding: When It Beats Qwen3.5-Plus

Qwen3.6-Plus for Coding: When It Beats Qwen3.5-Plus

A practical look at where Qwen3.6-Plus feels better for coding than Qwen3.5-Plus, and where the older model is still enough.

Apr 3, 2026
QQ-Chat Team
Qwen3.6-Plus 1M Context Window: What It Changes in Practice

Qwen3.6-Plus 1M Context Window: What It Changes in Practice

A practical guide to Qwen3.6-Plus's 1M context window, what it helps with, and what long context still does not solve.

Apr 3, 2026
QQ-Chat Team
Qwen3.6-Plus: Features, Use Cases, and How It Compares to Qwen 3.5

Qwen3.6-Plus: Features, Use Cases, and How It Compares to Qwen 3.5

What Qwen3.6-Plus brings to the table — agentic coding, 1M context, multimodal reasoning — and when to pick it over Qwen 3.5 models.

Apr 3, 2026
QQ-Chat Team
Qwen3.5 Ollama: when local runs make sense and when the browser is easier

Qwen3.5 Ollama: when local runs make sense and when the browser is easier

A practical first pass on qwen3.5 ollama: what people usually mean, how to decide between local and hosted use, and which Qwen page to open next.

Apr 2, 2026
QQ-Chat Team