An AI system that automatically reviews Linux kernel patches. Detected 53.6% of known bugs that passed human review. Rust implementation supports both Gemini and Claude.
CLI-Anything, released by HKUDS at the University of Hong Kong, automatically generates a CLI harness from GUI software source code so AI agents can drive it directly. It passed all 1,720 tests across 16 apps including GIMP, Blender, and LibreOffice.
H Company's Holotron-12B uses a memory-efficient new design to lift PC-operation AI throughput to 8,900 tokens per second. Unsloth has released the beta of 'Studio,' a browser tool for no-code model fine-tuning.
Vera CPU announced by NVIDIA at GTC 2026. The 88-core custom design realizes twice the efficiency and 50% faster speed than the previous model, and is scheduled to be available from the second half of 2026.
Zhipu AI's GLM-OCR reaches 94.62% on OmniDocBench v1.5 despite using only 0.9B parameters. I dug into its layout parsing, vertical text handling, and math recognition.
An open-source TTS model distilled from the ZipVoice architecture into four inference steps, delivering voice cloning with 1 GB of VRAM and 150x real-time speed. It also compares itself with the other TTS models covered on this blog.
Added --autoConnect option to Chrome DevTools MCP server, allowing coding agents to connect directly to an existing browser session. A deep dive into the background of the MCP vs CLI debate, browser operations with OpenClaw, and the risks of authenticated session delegation.
Agentic pipeline, which combines ReACT loop and searcher, achieved 1st place in ViDoRe v3 and 2nd place in BRIGHT. We established versatility for multiple domains using the same pipeline.
Anthropic has GA’d a 1M‑token context window. No surcharge for long context; image/PDF per‑request limit raised from 100 to 600. Achieved a frontier‑model best score on MRCR v2.
Anthropic's new multi-agent code review feature for Claude Code, plus the design split between subagents and orchestration. Also covers the major frameworks and where Codex fits in.
OpenAI acquired AI security evaluation platform Promptfoo, and Microsoft announced that Anthropic's Claude Cowork would be integrated into Microsoft 365 Copilot. The structure of the enterprise AI market is starting to change.