#AI Agent

22 articles

TechMay 2, 202612 min

Kana Chat v3 and Leaning Into Blog-Specific Use

From v2 to v3 of Kana Chat, an AI agent built around official CLI wrappers. The story of stepping back from the DIY OpenClaw direction and pivoting toward a blog pipeline that quickly drafts the daily flood of AI news and papers.

AI Agent Claude Code Codex OpenClaw Gemini tmux FastAPI Tailscale Custom Tool Experiment

TechApr 24, 2026updated11 min

DeepSeek V4 Preview specs: V4-Pro 1.6T and V4-Flash 284B open under MIT, 1M context, 27% inference FLOPs of V3.2

DeepSeek V4 Preview ships V4-Pro (1.6T/49B active) and V4-Flash (284B/13B active) as open weights under MIT, both with 1M context. CSA+HCA hybrid attention, mHC, and the Muon optimizer cut per-token FLOPs at 1M tokens to 27% of V3.2. Day-one API and chat.deepseek.com mode switch covered.

LLM DeepSeek Chinese AI MoE Open Model AI Agent

TechApr 24, 2026updated14 min

Tencent Hy3-preview (295B) vs Ant Ling-2.6-flash (104B): two open Chinese MoEs released the same week

Two open-weight Chinese MoEs landed within 24 hours: Ant Ling-2.6-flash (104B/7.4B active, 7x token-efficiency claim) and Tencent Hy3-preview (295B/21B active, frontier-tier open weights). Specs, licenses, and how they line up against DeepSeek-V3 and GLM-4.5.

LLM Chinese AI MoE Open Model AI Agent Local LLM OpenRouter

TechApr 13, 202610 min

I Found an OSS Tool That 'Distills Colleagues into AI' and Looked Into Distilling Myself

colleague.skill, yourself-skill, nuwa-skill and other 'human distillation' OSS tools are exploding in popularity, primarily in China. Seeing a tool that distills colleagues, I wondered 'what if I distilled myself?' and researched how.

AI OSS GitHub Claude Code AI Agent

TechApr 13, 20266 min

How 8 AI Agent Benchmarks Were Gamed to Near-Perfect Scores Without Solving a Single Task

UC Berkeley's RDI team demonstrated that major benchmarks including SWE-bench and WebArena can be manipulated to near-perfect scores without completing any tasks. They identified 7 vulnerability patterns and released BenchJack, an automated benchmark attack tool.

AI AI Agent Benchmark Security

TechApr 8, 2026updated8 min

GLM-5.1 (Zhipu, 744B / 40B MoE, MIT): 58.4% SOTA on SWE-Bench Pro, 8h / 6,000+ tool calls without degradation

Zhipu AI's GLM-5.1 is a 744B MoE (40B active, 200K context, MIT) targeting long-horizon agent tasks. Hits 58.4% SOTA on SWE-Bench Pro (edging out GPT-5.4 and Claude Opus 4.6) and sustains performance across 8-hour sessions with 6,000+ tool calls without degradation.

AI LLM Chinese AI MoE Open Model AI Agent

TechMar 31, 20265 min

Two cases of unauthenticated RCE and RCE via XSS, over 220,000 instances exposed in OpenCode

CVE-2026-22812 (CVSS 8.8) and CVE-2026-22813 (CVSS 9.4) were disclosed in the open source AI coding agent "OpenCode". Shell commands are executed via XSS of an unauthenticated HTTP server and Markdown renderer. The PoC has been published, with over 220,000 instances exposed online.

Security CVE OpenCode RCE XSS AI Agent

TechMar 30, 20266 min

The false report that Claude Code was running `git reset --hard` every 10 minutes

A GitHub issue claimed that Claude Code was destroying uncommitted changes with `git reset --hard origin/main` every ten minutes, but the culprit turned out to be a separate tool the reporter had written.

Claude Code Anthropic Git Bug AI Agent

TechMar 24, 20267 min

AWS deployment feature added to Claude Code, AI detection added to Code Security on GitHub

AWS releases "Agent Plugins for AWS" for Claude Code/Cursor, automating everything from infrastructure design to deployment. On the same day, GitHub added AI vulnerability detection to Code Security to supplement Shell, Dockerfile, Terraform, and PHP, which are not compatible with CodeQL.

Claude Code AWS MCP GitHub Copilot Vulnerability Detection CodeQL IaC AI Agent

TechMar 23, 202611 min

Kana Chat v2 Architecture Changes

Changes from v1 to v2 of Kana Chat, an AI agent built around official CLI wrappers. Covers dual-model router, Heartbeat memory, planner mode, image input, speech transcription, PWA push notifications, and the lessons learned from a month of daily use.

AI Agent Claude Code Codex OpenClaw Gemini tmux FastAPI Tailscale Custom Tool Experiment

TechMar 23, 202611 min

Severe vulnerability in 7% of OpenClaw skills, over 30,000 instances exposed

Composio publishes security analysis of OpenClaw. Approximately 7.1% of SkillHub-distributed skills were found to have critical vulnerabilities, leaving over 30,000 instances exposed to the internet in the early stages at risk of prompt injection and credential theft.

Security AI Agent OpenClaw Prompt Injection LLM

TechMar 19, 202616 min

OpenClaw agent billing security using NemoClaw and Stripe MPP

NVIDIA's NemoClaw protects OpenClaw agents with a four-layer sandbox, while Stripe's Machine Payments Protocol enables payments without handing over private keys to agents. How can I safely charge from within the sandbox?

NVIDIA AI Agent Security Sandbox OpenClaw Stripe Payments