AI articles - Page 7 | lilting channel

Tech Mar 15, 2026 9 min

NVIDIA NeMo Retriever's Agentic RAG ranks first in ViDoRe v3

Agentic pipeline, which combines ReACT loop and searcher, achieved 1st place in ViDoRe v3 and 2nd place in BRIGHT. We established versatility for multiple domains using the same pipeline.

NVIDIA RAG Retrieval AI Embedding

Tech Mar 14, 2026 updated 7 min

Claude’s 1M context window is now GA, integrated into the standard API at no extra cost

Anthropic has GA’d a 1M‑token context window. No surcharge for long context; image/PDF per‑request limit raised from 100 to 600. Achieved a frontier‑model best score on MRCR v2.

Claude Anthropic LLM AI

Tech Mar 12, 2026 8 min

What Claude Code's multi-agent review says about subagents versus orchestration

Anthropic's new multi-agent code review feature for Claude Code, plus the design split between subagents and orchestration. Also covers the major frameworks and where Codex fits in.

AI Claude Code Review AI Agent DevOps

Tech Mar 10, 2026 5 min

OpenAI's Promptfoo acquisition and Microsoft's shift to a multimodel stack

OpenAI acquired AI security evaluation platform Promptfoo, and Microsoft announced that Anthropic's Claude Cowork would be integrated into Microsoft 365 Copilot. The structure of the enterprise AI market is starting to change.

OpenAI Microsoft Anthropic Claude Security Copilot AI AI Agents

Tech Mar 10, 2026 7 min

Sarvam 30B/105B: India’s first open‑source LLM built end‑to‑end domestically

Sarvam AI released 30B and 105B models trained entirely in India—from pretraining through RL—featuring support for 22 constitutionally recognized Indian languages and inference optimizations.

LLM OSS AI MachineLearning

Tech Mar 10, 2026 7 min

Karpathy's Autoresearch lets AI run 100 ML experiments while you sleep

Andrej Karpathy released Autoresearch, a system where an AI agent autonomously runs machine-learning experiments on a GPU and tries 100 variants overnight. The article breaks down the mechanism and design so even readers with zero ML background can follow.

AI MachineLearning LLM AI Agents OSS

Tech Mar 7, 2026 8 min

AI is starting to produce real results in code vulnerability analysis

Anthropic found 22 CVEs in Firefox's JS engine with Claude, while GitHub Security Lab reported more than 80 vulnerabilities in apps built on the OSS framework Taskflow Agent.

Security AI Firefox GitHub Anthropic Claude

Tech Mar 7, 2026 5 min

Running WAN 2.2 in ComfyUI on an RTX 4060 (8GB VRAM)

Attempting WAN 2.2 I2V video generation on Windows with RTX 4060 8GB VRAM. The 5B fp8 model had rough quality; the 14B Rapid distilled model with lowvram offloading was the practical solution.

AI Video Generation Wan ComfyUI Windows CUDA Experiment

Tech Mar 2, 2026 12 min

JPEG-XL revival and PQC migration Merkle Tree Certificates, changes in Chrome 145-146

JPEG-XL revival in Chrome 145 and how to use cjxl, RSA → Elliptic Curve → PQC cryptography transition and Merkle Tree Certificates, WebMCP implementation examples, Chrome zero-day trends, and customizable select elements.

Chrome Security AI Web

Tech Mar 1, 2026 11 min

The Reason Qwen 3.5 Failed on Radeon 8060S Was an Outdated AMD Driver

Isolating the cause of Qwen 3.5 failing on ROCm/Vulkan via CPU inference, llama-server, and LM Studio — an AMD driver update resolved everything.

AI LLM Local LLM AMD llama.cpp Ollama LM Studio Experiment

Tech Mar 1, 2026 updated 13 min

Can LTX-2 and Wan 2.2 Run on an M1 Max 64GB? I Checked, Then Actually Ran Them

Running LTX-2 and Wan 2.2 on an M1 Max 64GB. FP8 doesn't work on Metal, bypassed with GGUF. Wan 2.2 takes 82 minutes for a 2-second video. LTX-2's official pipeline produces NaN on MPS, and the KSampler fallback doesn't reach usable quality.

AI Video Generation LTX-2 Wan Apple Silicon Experiment

Tech Feb 28, 2026 updated 12 min

Abliterated Models in Ollama Were a Complete Failure — and the Official Version Was Fine All Along

All variants of huihui-ai's Qwen 3.5 abliterated produced garbage tokens. GLM-4.7-Flash abliterated had a broken chat template. The official version with thinking disabled turned out to be the right answer.

AI LLM Ollama Local LLM AMD LM Studio Vulkan ROCm Experiment

#AI