Gemini 3 is Google’s latest AI model, offering improvements in reasoning, coding, and multimodal analysis. New features include the Gemini Agent tool and generative interfaces, such as visual layout ...
Anthropic releases Claude Opus 4.1, advancing AI performance in coding and reasoning. Available for paid users via API, Amazon Bedrock, and Google Cloud's Vertex AI. Anthropic has launched Claude Opus ...
OpenAI and Google LLC today disclosed that their latest reasoning models achieved gold-level performance in a recent coding competition. The ICPC, as the event is called, is the world’s most ...
According to Greg Brockman (@gdb) on Twitter, the Codex CLI now features high reasoning capabilities designed to maximize developer productivity (source: Greg Brockman, Twitter, Aug 24, 2025). This ...
A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...
Grok 4 and its reasoning-focused counterpart, Grok 4 Heavy, arrived with an immediate sense of ambition, offering multimodal AI designed to handle coding, logic, and perception tasks. In the initial ...
RE-IMAGINE synthesizes new reasoning benchmarks by (1) symbolically mutating the solution processes from existing benchmarks, and (2) asking language models to imagine what would happen if the ...
Phi-4-reasoning is a 14-billion parameter model specialized in complex reasoning tasks. It is trained using supervised finetuning (SFT) on diverse prompts and reasoning demonstrations from o3-mini.