Meta AI's HyperAgents performs metacognitive self-correction that optimizes improvement strategies themselves. Self-improvement appears in four non-coding domains, and strategies learned in one domain transfer to another, along with spontaneously acquired persistent memory.
GPT-5.4 Pro became the first model to solve a researcher-level open problem in FrontierMath, a benchmark managed by Epoch AI. Claude Opus 4.6 and Gemini 3.1 Pro later solved it as well.
Sarvam AI released 30B and 105B models trained entirely in India—from pretraining through RL—featuring support for 22 constitutionally recognized Indian languages and inference optimizations.
Andrej Karpathy released Autoresearch, a system where an AI agent autonomously runs machine-learning experiments on a GPU and tries 100 variants overnight. The article breaks down the mechanism and design so even readers with zero ML background can follow.