An explanation of the difference between conventional OCR and VLM (vision-language model) based OCR. Introduces DeepSeek-OCR and explores the possibility of combining both approaches.
I investigated the source behind the viral claim that a Johns Hopkins study found ChatGPT lies 27% of the time, and it turns out multiple different studies have been mixed together.
jQuery 4.0 was released on January 17, 2026. This is a catch-up for people who drifted away after the painful 1.x to 2.x transition, covering what changed in 3.x and what changed again in 4.0.
An explanation of Shortest, a natural-language E2E testing framework built on Anthropic Claude API and Playwright, from the perspective of a Playwright user. Includes a comparison with Playwright MCP, caveats, and when to use each.
Generalized the scripts from the practice and optimization articles into a reusable framework and published it on GitHub. A walkthrough of how to use it and the design philosophy.
The Web Speech API + Gemini + VOICEVOX setup is complete — an AI character you can actually have a voice conversation with. Key implementation notes and impressions.
Implemented all 12 text processing tools planned in the previous article. Also reorganized the category system and switched the listing UI to a table layout.
I compiled research findings and implementation specifications for adding text-processing tools to lilting.ch/lab. Based on a comparison with DevToys, it highlights the gaps and documents detailed specs for 12 candidate tools.
A comparison of major AI 3D generation tools such as TRELLIS, Hunyuan 3D, Tripo AI, and Hitem3D, with a focus on image requirements for better 3D output.