#AI

253 articles

TechFeb 3, 20262 min

TJ_ComfyUI_HashRecorder and the reality of a blockchain-like copyright tool

A look at a ComfyUI copyright protection tool that sounds blockchain-like but is really local-only under the hood.

ComfyUI AI Copyright

TechFeb 3, 20263 min

ACE-Step: an open-source music generation model that runs locally

A look at ACE-Step, the 'Stable Diffusion of music,' covering its architecture, features, installation, and expected performance on Apple Silicon before trying it on an M1 Max.

AI Music Generation Apple Silicon Mac Python Local AI

TechFeb 3, 2026updated2 min

PersonaPlex: NVIDIA’s Real-Time Full‑Duplex Voice Conversational Model

Overview of PersonaPlex‑7B‑v1 released by NVIDIA in January 2026. A Moshi‑based voice dialog model that enables full‑duplex conversation and persona control.

AI Speech Synthesis Speech Recognition NVIDIA Open Source

TechFeb 3, 20263 min

Z-Image-Distilled - a Z-Image derivative that keeps diversity while speeding up inference

An overview of Z-Image-Distilled, the distilled fast-inference variant of Z-Image, including how it compares with FLUX.1 Schnell, how it runs on an M1 Max 64GB machine, and LoRA compatibility.

AI Image Generation Z-Image Apple Silicon LoRA

TechFeb 3, 2026updated4 min

Running FLUX.2 Klein 9B on Apple Silicon Macs

Overview of Black Forest Labs' FLUX.2 Klein 9B model and how it performs on M1/M2/M3/M4 Macs. Covers the key factors behind the CUDA vs MPS performance gap, including memory bandwidth and FP8 quantization.

AI Image Generation FLUX Apple Silicon Mac

TechFeb 3, 20262 min

Agent Lightning: Microsoft's reinforcement learning framework for AI agents

Microsoft released an open-source framework that can optimize almost any AI agent with reinforcement learning, with little to no code changes. It supports arbitrary frameworks such as LangChain, AutoGen, and Claude Agent SDK.

AI AI Agent Reinforcement Learning Python Microsoft

TechFeb 3, 2026updated3 min

KugelAudio — Open‑Source 7B‑Parameter TTS (ComfyUI‑Compatible)

Text‑to‑Speech covering 24 European languages with voice cloning. An open‑source model that outperformed ElevenLabs in the authors’ A/B tests.

ComfyUI TTS Speech Synthesis AI

TechFeb 3, 2026updated3 min

OpenRouter free models and Free Router tested: rate limits, tool-calling gotchas, and when they actually work

OpenRouter ships :free models and a Free Router endpoint. Tested both for rate limits (50/day → 1,000/day after a $10 top-up), the tool-calling failure on free models, and which workloads they actually fit.

AI LLM OpenRouter

TechFeb 2, 20264 min

Power Sampling: unlocking LLM reasoning without reinforcement learning

A look at how changing the inference-time sampling strategy can improve LLM reasoning performance without retraining on RL.

LLM Inference Reinforcement Learning Sampling AI

TechFeb 1, 20264 min

PageIndex - tree RAG with LLM reasoning only, no vector search

I looked into PageIndex, a RAG system that builds hierarchical document trees using only LLM reasoning, without chunking or vector databases. I also consider how it fits with layout detection and OCR pipelines.

AI RAG LLM OCR Python

TechFeb 1, 2026updated7 min

Video Generation AI: January 2026 Update Roundup and Where i2v Stands

This article organizes the major video-generation AI updates announced in January 2026 and examines whether i2v (image→video) is practically usable, including models that run locally.

AI Video Generation i2v

TechFeb 1, 2026updated3 min

Stop Gemini from auto-generating images you didn't ask for: a Saved info rule that works, and what Google changed

Gemini auto-generates images when you only ask about one or request a text prompt. The Saved info rule that stops it, a conversation-level fix, and where Google's fixes currently stand.

Gemini AI Image Generation