Tech - Page 4 | lilting channel

TechJun 14, 2026updated7 min

WAI-illustrious-HSWQ vs WAI v17 on M1 Max: 28-34s fast mode

M1 Max ComfyUI test of WAI-illustrious-HSWQ v17.0 base vs WAI v17. Fast mode took 28-34s, quality 64s, with Kana LoRA and cafe background checks.

AI Image Generation ComfyUI Stable Diffusion SDXL Illustrious WAI LoRA Apple Silicon Experiment

TechJun 14, 2026updated12 min

ZONOS2 on an 8GB RTX 4060 Laptop (WSL2): it runs, but ~20x slower than realtime

Tested ZONOS2 on an 8GB RTX 4060 Laptop (WSL2): the 15.3GB bf16 weights run via Windows system-memory fallback, a KV-cache override, and the CUDA toolkit at ~1/20 realtime. Plus a Japanese name-accent gotcha with A/B audio.

AI TTS Speech Synthesis ZONOS2 Zyphra HuggingFace Japanese Experiment

TechJun 14, 2026updated7 min

AMIXA Async's MIX ANIMA vs JANIMA on M1 Max: Anima LoRA test

AMIXA Async's MIX ANIMA tested on M1 Max ComfyUI against anima-base-v1.0/JANIMA with Anima LoRA, two-character prompts, Turbo, and native cafe scenes.

AI Image Generation ComfyUI Anima Anima-Base LoRA Illustrious Apple Silicon Experiment

TechJun 13, 2026updated11 min

Claude Fable 5 and Mythos 5 suspended: US export controls and Opus 4.8 fallback

Anthropic disabled Fable 5 and Mythos 5 for all customers on June 12, 2026. US export controls, foreign-national employees, AI Safety gaps, Microsoft limits, and the shaky Opus 4.8 fallback.

Claude Anthropic AI Safety Export Controls AI Agents

TechJun 12, 20268 min

JANIMA vs Hexer Minimal Toon (M1 Max): LoRA fidelity flips per character

Tested on M1 Max ComfyUI: newly free JANIMA vs Hexer Minimal Toon Anima V1 vs anima-base, one character LoRA, same seed. Hexer keeps the outfit; JANIMA adds clothes but draws the quietest backgrounds.

AI Image Generation ComfyUI Anima Anima-Base LoRA Apple Silicon Experiment

TechJun 11, 2026updated6 min

Codex 'Selected model is at capacity': serving capacity, not context length, and the thread resumes on continue

Codex printed 'Selected model is at capacity. Please try a different model.' mid-task. It's model-side serving capacity, not context length, and the same thread resumed after a continue prompt — with the OpenAI maintainer's explanation (issue #17014) and which related issues stay open.

OpenAI Codex Troubleshooting AI Agents CLI

TechJun 11, 202610 min

Microsoft 73-repo Miasma: AI agent startup, not npm install

GitHub disabled 73 Microsoft repos after an Azure/durabletask commit. Miasma used Claude Code, Gemini CLI, Cursor, and VS Code config, not npm install.

Security npm Supply Chain Malware Microsoft AI Agents

TechJun 10, 20263 min

LiteLLM CVE-2026-42271: MCP stdio test RCE, CISA KEV

LiteLLM 1.74.2-1.83.6 command execution via MCP stdio test endpoints is in CISA KEV. Patch to 1.83.7+ and Starlette 1.0.1+; BadHost can remove auth.

Security CVE RCE CISA MCP LLM

TechJun 10, 20268 min

Claude Fable 5 vs Opus 4.8, Sonnet 4.6, and Codex: tiny blog benchmark

Claude Code CLI vs Codex CLI on a 7-test static-blog fixture: runtimes, estimated Claude Code cost, and the semantic diff Codex used to pass.

Claude Codex AI Agents Benchmark Experiment

TechJun 9, 202610 min

Laxhar's SenseNova U1 LoRA trainer: bf16 on 32GB GPU, ~20GB peak VRAM

Laxhar's U1 trainer needs 32GB+ GPU, bf16 only — 4bit broke gen tower. Prefix offload keeps ~20GB peak. 8-step LoRA stack, A3B MoE compat, official training code gap.

AI Image Generation LoRA HuggingFace MoE

TechJun 9, 202611 min

AFM 3: 20B sparse on-device, Cloud Pro on Google Cloud, five model tiers

AFM 3 splits into 20B on-device sparse (NAND-to-DRAM weight loading) and Cloud Pro on Google Cloud NVIDIA GPU. Three Google contexts, Foundation Models API opening, and what's still unreleased.

AI LLM Apple Silicon Google Edge AI

TechJun 8, 202617 min

LFM2.5 1.2B JP on M1 Max 64GB: 208 tok/s decode, JSON OK, name hallucinated

Tested LFM2.5-1.2B-JP-202606 on M1 Max 64GB. llama.cpp Q4_K_M: 208 tok/s decode, JSON intact, model name hallucinated (LFM→FDM). Q8_0: 157 tok/s, no hallucination. Tool calls broken via GGUF.

AI LLM Local LLM MLX Ollama Apple Silicon Edge AI Experiment Japanese LLM