#AI

253 articles

TechJan 31, 20263 min

Kimi K2.5: A 1-trillion-parameter MoE native multimodal agent model

An overview of Kimi K2.5’s technical highlights from Moonshot AI: a 1T-parameter MoE architecture, the MoonViT vision encoder, Agent Swarm (PARL), benchmark results, and more.

AI LLM Open Source

TechJan 31, 20266 min

Comparing hook systems in AI coding CLIs: Gemini CLI, Claude Code, and Codex CLI

A comparison of the hook features offered by Gemini CLI, Claude Code, and Codex CLI. The differences in design philosophy are more interesting than I expected.

Gemini CLI Claude Code Codex CLI AI CLI

TechJan 30, 20264 min

PaddleOCR-VL-1.5 - document parsing SOTA with only 0.9B parameters

Baidu's PaddleOCR-VL-1.5 reaches 94.5% accuracy on OmniDocBench v1.5 with just 0.9B parameters, surpassing large models such as GPT-4o and Qwen2.5-VL-72B.

AI OCR VLM PaddlePaddle

TechJan 22, 20265 min

Official guides for Claude Code best practices and the Agent SDK

Anthropic published official guides on how to use Claude Code effectively and how to build agents with the Agent SDK. This article summarizes the key points from both.

Claude Code Claude AI LLM

TechJan 20, 202611 min

AI E2E Testing Tools Comparison - Top 10 Picks by Reliability and Speed

A deep dive comparing 10 AI-powered E2E testing and browser automation tools including Shortest, Playwright MCP, Stagehand, Skyvern, and QA Wolf, categorized by use case with focus on reliability, speed, and cost.

AI Testing E2E Playwright CI CD

TechJan 20, 20264 min

The rise of VLM-based OCR - DeepSeek-OCR and the potential of hybrid use

An explanation of the difference between conventional OCR and VLM (vision-language model) based OCR. Introduces DeepSeek-OCR and explores the possibility of combining both approaches.

AI OCR DeepSeek VLM

TechJan 20, 20263 min

I looked into the source behind "ChatGPT lies 27% of the time"

I investigated the source behind the viral claim that a Johns Hopkins study found ChatGPT lies 27% of the time, and it turns out multiple different studies have been mixed together.

AI ChatGPT Research

TechJan 20, 20267 min

Shortest - an AI tool for writing E2E tests in natural language

An explanation of Shortest, a natural-language E2E testing framework built on Anthropic Claude API and Playwright, from the perspective of a Playwright user. Includes a comparison with Playwright MCP, caveats, and when to use each.

AI Testing Playwright E2E

TechJan 19, 20263 min

Released the Claude Code + Codex Auto-Dev Framework on GitHub

Generalized the scripts from the practice and optimization articles into a reusable framework and published it on GitHub. A walkthrough of how to use it and the design philosophy.

Claude Code OpenAI Codex tmux AI Automation Experiment

TechJan 19, 20265 min

Building a Talkable AI Environment (3): We're Finally Talking

The Web Speech API + Gemini + VOICEVOX setup is complete — an AI character you can actually have a voice conversation with. Key implementation notes and impressions.

AI Speech Recognition Speech Synthesis VOICEVOX Gemini Web Speech API SwitchBot Experiment

TechJan 17, 20266 min

Letting Claude Code and Codex Run Overnight in tmux (Optimization)

Design patterns for reducing context usage and API calls in the AI auto-dev loop: blocking waits, read-forbidden files, and session isolation.

Claude Code OpenAI Codex tmux AI Automation Experiment

TechJan 16, 20265 min

AI 3D generation tools in 2026: input specs and best practices

A comparison of major AI 3D generation tools such as TRELLIS, Hunyuan 3D, Tripo AI, and Hitem3D, with a focus on image requirements for better 3D output.

AI 3D Image Generation