Qwen articles - Page 2 | lilting channel

Tech Apr 16, 2026 updated 13 min

Was Looking for a New WAI-Illustrious Version and Found WAI-Anima Instead

WAI0731 (creator of WAI-Illustrious) released WAI-Anima v1, a derivative model based on Anima. In the two months since the February Anima article, derivative models have surged along with a LoRA toolkit and text encoder upgrades. Hands-on comparison of preview3-base and WAI-Anima v1.

AI Image Generation ComfyUI Qwen Apple Silicon Stable Diffusion LoRA Experiment Anima WAI-Anima

Tech Apr 14, 2026 10 min

Can Qwen Image Edit Convert Photos to Pixel Art?

Tested 5 approaches including Qwen Image Edit, JS color reduction, and Illustrious i2i + LoRA. Illustrious i2i alone turned out to be the fastest and lightest solution for pixel art conversion.

Qwen Image Generation Apple Silicon Experiment

Tech Apr 14, 2026 10 min

Can Local Vision LLMs Extract RPG Stats from Character Art?

I tested local Vision LLMs (Gemma 3, Qwen2.5-VL, Llama 3.2 Vision, Gemma 4) to see if they could look at character illustrations and pixel art and generate RPG-style stats in JSON format.

AI Local LLM VLM Image Recognition Ollama Gemma Qwen Apple Silicon Experiment

Tech Apr 6, 2026 11 min

LLM-jp-4-32B-A3B on ROCm + Strix Halo: 41% Faster Than Qwen3.5

Benchmarking NII's LLM-jp-4-32B-A3B-thinking on EVO-X2 (Ryzen AI Max+ 395) with ROCm. 62.9 t/s vs Qwen3.5-35B-A3B's 44.7 t/s. Covers thinking control issues, KV cache trade-offs, knowledge cutoff, Japanese quality comparisons, code generation tests, and training data composition.

AI LLM Local LLM llama.cpp AMD ROCm MoE Qwen Experiment

Tech Mar 31, 2026 8 min

Scaling Qwen3.5-35B-A3B from 4K to 65K Context with Only 800MB Extra VRAM

Qwen3.5-35B-A3B is an SSM+Attention hybrid where only 10 of 40 layers use KV cache. Expanding ctx-size from 4096 to 65536 on llama-server added just 800MB VRAM with zero speed loss. Includes q8_0 KV quantization benchmarks and TurboQuant status.

LLM Local LLM llama.cpp AMD Vulkan KV Cache Qwen Benchmark

Tech Mar 26, 2026 11 min

Tracking Down Why Qwen Image Edit Started Taking 10 Minutes After a ComfyUI Update

Diagnosing the speed regression caused by MPS BF16 being 2x slower than FP16, combined with an FP16 Attention bug — and the fix.

ComfyUI Qwen Apple Silicon MPS PyTorch Experiment

Tech Mar 23, 2026 7 min

Flash-MoE: Running a 397B-parameter model on a 48GB MacBook

Flash-MoE is a C/Metal inference engine that runs Qwen3.5-397B-A17B on a MacBook Pro M3 Max at 4.36 tokens/s. With expert streaming from SSD and hand-written Metal shaders, it fits the 209GB model into a 48GB memory budget.

Inference MPS LLM Qwen MoE Local LLM

Tech Mar 23, 2026 14 min

Packaging the BERT + Qwen OCR Correction Pipeline as a Python Tool

The three-stage pipeline of BERT perplexity scan → LLM judgment → escalation packaged as a cross-platform Python tool. The installer automatically downloads llama-server and GGUF models.

NLP OCR Machine Learning Python BERT LLM llama.cpp Qwen NDLOCR-Lite Gradio Ollama Experiment

Tech Mar 5, 2026 updated 20 min

Testing Live2D Face-Part Separation with Qwen-Image-Layered on RunPod

Using tori29umai’s LoRA to automatically split facial parts, results from batching 28 images, and a log of running into the limits when attempting finer hair separation

RunPod Qwen diffusers Image Generation LoRA Live2D Experiment

Tech Feb 26, 2026 13 min

OCR Correction on Showa-Era Documents with NDLOCR-Lite and Local LLMs

Set up the CLI version of NDLOCR-Lite on Apple Silicon Mac, then tested OCR result correction with Qwen 3.5 and Swallow. Includes experiments with direct image reading and the anchoring effect.

OCR Python NDLOCR-Lite Mac Qwen Swallow Ollama Local LLM Experiment

Tech Feb 14, 2026 5 min

Why are image-generation VAEs so heavy? Comparing the Qwen-Image and HunyuanImage architectures

An explanation of why Qwen-Image-Edit's VAE is so heavy, how HunyuanImage 2.1 chose a 32x high-compression VAE instead, and how Kohya's memory-optimization work fits in.

AI Image Generation VAE Qwen HunyuanImage Memory Optimization

Tech Feb 3, 2026 3 min

How I looked into better pose and angle control in Qwen Image Edit

A comparison of the Nunchaku quantized build, VNCCS Pose Studio, and the official 2511 model improvements to find better ways to control pose and camera angle.

Qwen ComfyUI Image Generation

#Qwen