Alibaba’s Qwen 3.6-35B-A3B is a sparse MoE powerhouse with 3B active parameters, a 1M token context, and a new ‘thinking preservation’ mode for complex agentic workflows.
Read MoreMy thoughts..
Alibaba’s Qwen 3.6-35B-A3B is a sparse MoE powerhouse with 3B active parameters, a 1M token context, and a new ‘thinking preservation’ mode for complex agentic workflows.
Read More