Overview of Alibaba’s Z-Image and how it compares to FLUX and Stable Diffusion. A 6B-parameter model that runs on low VRAM and ranks first among open-source models.
A thinking block corruption error crashed my Claude Code Web session and wiped uncommitted files. An investigation into the cause, workarounds, and whether the files can be recovered.
An explanation of a new attack technique that abuses GitHub’s fork feature and commit display behavior to distribute malware via links that look like official repository URLs.
Configuration for running a Qwen-Image-Layered LoRA that automatically separates facial parts on RunPod. Comparison of RTX 6000 Ada (48GB) and RTX PRO 6000 (96GB).
Tried running the NSFW variant of Qwen-Image-Edit (Phr00t AIO) on RunPod to generate 3-view reference sheets for a 3D model base mesh. A log of failures on RTX 4090 and eventual success on RTX 5090.
When building a WebRTC voice call, you can't pass a remote MediaStream to the SpeechRecognition API. Here are three workable approaches — remote-side recognition, server-side processing, and AudioContext — plus iOS-specific implementation strategies.
A comprehensive walkthrough of data structures used in search tasks like dictionary lookup, full‑text search, and autocomplete. Summarizes how 10 structures work and when to use them—including Trie, Double‑Array Trie, Inverted Index, Suffix Array, B+ tree, and LSM tree.
Generalized the scripts from the practice and optimization articles into a reusable framework and published it on GitHub. A walkthrough of how to use it and the design philosophy.
The Web Speech API + Gemini + VOICEVOX setup is complete — an AI character you can actually have a voice conversation with. Key implementation notes and impressions.