The three-stage pipeline of BERT perplexity scan → LLM judgment → escalation packaged as a cross-platform Python tool. The installer automatically downloads llama-server and GGUF models.
Using tori29umai’s LoRA to automatically split facial parts, results from batching 28 images, and a log of running into the limits when attempting finer hair separation
Set up the CLI version of NDLOCR-Lite on Apple Silicon Mac, then tested OCR result correction with Qwen 3.5 and Swallow. Includes experiments with direct image reading and the anchoring effect.
An explanation of why Qwen-Image-Edit's VAE is so heavy, how HunyuanImage 2.1 chose a 32x high-compression VAE instead, and how Kohya's memory-optimization work fits in.
A comparison of the Nunchaku quantized build, VNCCS Pose Studio, and the official 2511 model improvements to find better ways to control pose and camera angle.
I investigated VNCCS, a character-sprite generation suite for visual novels, and its QWEN Detailer utility. Can it help generate side twin-tails more reliably?
Configuration for running a Qwen-Image-Layered LoRA that automatically separates facial parts on RunPod. Comparison of RTX 6000 Ada (48GB) and RTX PRO 6000 (96GB).
Hands-on RunPod log for Phr00t AIO Qwen-Image-Edit NSFW v18.1 (28GB). RTX 4090 24GB froze loading and --lowvram didn't help; the FP8 split version needed separate VAE/text encoder. RTX 5090 32GB worked end-to-end. Used it for 3-view reference sheets for a 3D model base mesh.
Setup notes for Qwen-Image-Edit-2511 on RunPod's RTX 4090 ($0.34/hr) using the ComfyUI template. Includes the fal Multiple-Angles LoRA (4 elevations × 8 azimuths × 3 distances) and a per-image cost breakdown that ends up cheaper than buying a 4090.