OpenAI drops GPT-5.5 with a focus on agentic reasoning and efficiency. While API access is delayed, a ‘backdoor’ via the open-source Codex CLI is officially supported.
Read MoreTag: benchmarks
Google Releases Gemini Deep Research Max with Arbitrary MCP Support
Google launches Deep Research Max via the Interactions API, featuring Model Context Protocol support, native visualizations, and SOTA scores on Humanity’s Last Exam.
Read MoreQwen 3.6-35B-A3B: The 3B-Active MoE for Agentic Coding
Alibaba’s Qwen 3.6-35B-A3B is a sparse MoE powerhouse with 3B active parameters, a 1M token context, and a new ‘thinking preservation’ mode for complex agentic workflows.
Read MoreClaude Opus 4.7: 1M Context and the 128K Output Frontier
Anthropic drops Opus 4.7 with a massive 1M token context window, 80.9% SWE-bench score, and a new tokenizer that boosts efficiency by 35%.
Read More