#画像生成

11 articles

TechJul 2, 202610 min

Qwen Max vs Claude vs Codex writing Anima prompts, tested with a 3-character LoRA: 60 images, no same-family advantage

Three LLMs converted the same 10 Japanese scene briefs into Anima (Qwen-DiT) prompts, generated as 60 fixed-seed images on an M1 Max with a merged 3-character LoRA. The Qwen-to-Qwen affinity hypothesis did not survive; a strict formatter brief with character-count locks is what actually moved the results, and two failure modes survive any prompt.

Qwen Anima Claude Codex LLM AI 画像生成 ComfyUI 実験マルチキャラ

TechMay 19, 2026updated9 min

Lance 3B unified multimodal: 40GB VRAM, RunPod costs, and why weights are split

40GB+ VRAM for a 3B model. VBench 85.11 beats dedicated 14B video generators. RunPod GPU costs from $2.2/session. The 'unified' model still ships as two checkpoint files.

AI マルチモーダル画像生成動画生成 VLM オープンソース HuggingFace

TechMay 8, 202611 min

FLUX.2 Klein 9B + NSFW LoRA on M1 Max 64GB via mflux: 1m51s/512, 5m37s/1024 q4

Tested Klein 9B + 9B NSFW LoRA on M1 Max 64GB via mflux 0.17.5: 1m51s/512, 5m37s/1024 q4, 224/224 LoRA keys match, NSFW prompts uncensored, Japanese subjects work with helper tokens.

AI 画像生成 FLUX Apple Silicon Mac MLX LoRA 実験

TechMay 4, 2026updated13 min

FLUX.2 Klein NSFW LoRA on M1 Max: why a 9B LoRA won't load on 4B mflux (variant compatibility map)

Klein 4B / 9B / Base LoRAs aren't cross-compatible — a 9B NSFW LoRA throws 'lora key not loaded' on mflux's 4B path. The variant map, what mflux runs today, and where the working hands-on test lives.

AI 画像生成 FLUX Apple Silicon Mac MLX LoRA 実験

TechMay 4, 202616 min

De-distilling Z-Image-Turbo for LoRA Training

How the de-distill training adapter works for Z-Image-Turbo LoRA learning, SDXL LoRA incompatibility with Z-Image, and caption considerations specific to the Z-Image ecosystem.

AI 画像生成 Z-Image LoRA ComfyUI

TechMay 3, 202610 min

A FastAPI wrapper that takes Japanese, runs it through Ollama, and routes to ComfyUI or mflux to drive Anima, WAI-IL, and FLUX.2 Klein from one WebUI

Three local image generation engines (WAI-Anima, WAI-IL/SDXL, FLUX.2 Klein 4B) tied together by a thin FastAPI wrapper that takes Japanese prompts. Ollama (gemma3:12b) handles JP→EN, ComfyUI workflows are built on the fly in Python, FLUX.2 runs as an mflux subprocess, and the whole thing is reachable from an iPhone over Tailscale.

AI 画像生成 ComfyUI FLUX Apple Silicon Mac Ollama FastAPI Tailscale 実験

TechApr 30, 2026updated12 min

FLUX.2 Klein 4B on M1 Max: 1024px in 30 to 40 seconds with mflux and iris.c, no H100 required

Pruna AI's FP8 speedup needs compute capability 8.9, so Apple Silicon is out. Measured what M1 Max 64GB actually does with MLX-based mflux and antirez's iris.c: install traps, real generation times, and a wrapper kit to skip the setup.

AI 画像生成 FLUX Apple Silicon Mac MLX 実験

TechApr 29, 2026updated15 min

Z-Anime turned out to be an anime-focused full fine-tune of Z-Image

Confirmed SeeSee21/Z-Anime is a full fine-tune of Z-Image Base, then ran the AIO version on local ComfyUI on an M1 Max 64GB to verify t2i, i2i, and how NSFW prompts pass through.

AI 画像生成 Z-Image ComfyUI Apple Silicon 実験

TechApr 29, 202617 min

Converting AI Illustrations to Manga BW with Screentone Instead of Grayscale

A verification log for converting color anime-style AI illustrations to manga-style monochrome. AI re-generation approaches lean to either color leakage or face drift, and pure deterministic local processing looks mechanical. Frames the next directions to try: putting a grayscale-only LoRA on Anima, and using See-through for part decomposition before mechanical composition.

AI 画像生成 ComfyUI LoRA 漫画スクリーントーン実験 Apple Silicon SDXL Qwen Z-Image Anima

TechApr 27, 202618 min

Pushing WAI-Anima Character LoRA Training to the Official 12,000-Step Recommendation Made Direction Control Worse — Half That at ep150 Hit 100%

With v3 captions kept as-is and only the training amount pushed up to Anima's official 12,000+ step recommendation, the direction hit rate went 100% at ep150-180, crashed to 0% at ep200, then partially recovered to 67% at ep227 — a non-monotonic curve. 600-720 exposures per training image is the sweet spot; over 800 triggers catastrophic forgetting. Learning rate 2e-5, ~11 hours / $10 of RunPod training plus a sweet-spot epoch scan.

LoRA AI 画像生成 Anima WAI-Anima RunPod Qwen 実験

TechApr 26, 2026updated20 min

Rewriting WAI-Anima Character LoRA Training Captions with Natural Language and Hairstyle Tags

Records of rewriting captions for the 53 training images for the WAI-Anima character LoRA retrain after side ponytail direction control failed last time. Wrote position information into natural language so Qwen3 TE could pick it up, and dropped the IL-era strategy of absorbing the entire hairstyle into the single 'kanachan' trigger by promoting hairstyle to independent Danbooru tags. Includes notes on year tag necessity, the bikini/nude swapped-caption discovery, and blazer color recognition drift.

LoRA AI 画像生成 Anima WAI-Anima Qwen 実験