— Ch. 1 · The First Public Release —
Qwen.
~3 min read · Ch. 1 of 5
Alibaba Cloud launched a beta version of Qwen in April 2023 under the name Tongyi Qianwen. The system opened for public use in September 2023 after receiving regulatory clearance from Chinese authorities. This initial release marked the beginning of a rapid development cycle that would see multiple major updates within just two years. The model architecture was based on the Llama architecture developed by Meta AI, establishing a foundation for future iterations. In December 2023, Alibaba released its 72B and 1.8B models for download to the public. These early releases included weights but did not provide training code or documented datasets.
Architectural Shifts And Variants
Qwen2 arrived in June 2024 with both dense and sparse model configurations available to developers. By November 2024, the company introduced QwQ-32B-Preview, a specialized reasoning model similar to OpenAI's o1. This variant featured a 32K token context length and outperformed competing systems on certain benchmarks despite releasing only weights without dataset documentation. The Qwen-VL series combined vision transformers with large language models to process visual data alongside text. In January 2025, Qwen2.5-VL expanded this capability with variants containing 3, 7, 32, and 72 billion parameters. The flagship vision model Qwen-VL-Max sold through Alibaba Cloud at US$0.41 per million input tokens as of 2024.