We evaluate DeepCode on the PaperBench benchmark (released by OpenAI), a rigorous testbed requiring AI agents to independently reproduce 20 ICML 2024 papers from scratch. The benchmark comprises 8,316 ...
On February 2nd, 2025, computer scientist and OpenAI co-founder Andrej Karpathy made a flippant tweet that launched a new phrase into the internet’s collective consciousness. He posted that he’d ...
Microsoft has released a new report showing what people used its AI assistant Copilot for in 2025. The analysis is based on 37.5 million de-identified conversations and shows that in addition to ...
The exhilarating speed of AI-assisted development must be united with a human mind that bridges inspiration and engineering. Without it, vibe coding becomes a fast track to crushing technical debt. If ...
On Tuesday, French AI startup Mistral AI released Devstral 2, a 123 billion parameter open-weights coding model designed to work as part of an autonomous software engineering agent. The model achieves ...
I’ve written about how coding is so over. AI is getting smarter every day, and it won’t be long before large language models (LLMs) write better code than any human. But why is coding the one thing ...
For the past several months, my social media feed has been flooded with people bragging about spinning up apps and websites over a weekend without any engineering help or coding — with just vibes.
French AI startup Mistral today launched Devstral 2, a new generation of its AI model designed for coding, as the company seeks to catch up to bigger AI labs like Anthropic and other coding-focused ...
Volvo Car AB is looking for partnerships for its new central software stack that’ll run on all of its future electric models, a sign the carmaker has overcome earlier coding glitches that delayed ...
Anthropic is launching Claude Code in Slack, allowing developers to delegate coding tasks directly from chat threads. The beta feature, available Monday as a research preview, builds on Anthropic’s ...
Two malicious extensions on Microsoft's Visual Studio Code Marketplace infect developers' machines with information-stealing malware that can take screenshots, steal credentials, crypto wallets, and ...