#トークン管理

6 articles

Tech May 9, 2026 6 min

Fortress Token Optimizer trims 11% off LLM prompts but risks stripping system prompt constraints

Checked Fortress Token Optimizer's DEV article and npm/PyPI packages. Polite filler words shrink 11-22%, but running it blindly on system prompts or RAG context can strip constraints that control model output.

AI LLM API APIコストトークン管理

Tech May 8, 2026 13 min

Vektor Memory supersession chains: BM25 threshold trap and a minimum schema

Vektor Memory v1.5.4 supersession chains positioned against YourMemory decay, Cloudflare key-overwrite, and CTX, with a BM25 vs cosine threshold trap and a 5-field minimum schema for agent memory.

AI AIエージェント RAG MCP トークン管理 Node.js

Tech May 7, 2026 11 min

Agent memory is just lookup: reading arXiv:2604.27707 with CTX and OCR-Memory in mind

The paper argues that RAG, vector stores, and scratchpads are retrieval, not learning. Read alongside CTX and OCR-Memory, the gap between 'better search' and 'weight-level learning' becomes concrete.

AI AIエージェント RAG トークン管理 AIセーフティ論文

Tech May 6, 2026 9 min

Claude Code context rot starts before the first prompt, not at 45 minutes

Connecting a DEV article on context rot, Anthropic's 1M context guidance, and Chroma's context rot research with earlier CTX and Compresr posts. The places to watch are CLAUDE.md size, tool output accumulation, and information loss around compact—not the model name.

Claude Code AIエージェントトークン管理開発効率化

Tech May 3, 2026 13 min

Adding Working Memory to Claude Code with CTX

A read of CTX, which auto-injects context into Claude Code via the UserPromptSubmit hook. Compared with auto-memory, YourMemory, WUPHF, and Cloudflare Agent Memory on persistence and storage. Also looked at why 1M context still isn't enough and how each agent architecture uses its window differently.

Claude Code AIエージェントトークン管理 RAG OSS

Tech May 2, 2026 14 min

OCR-Memory Lets Agents Recall History as Images

A read of arXiv:2604.26622 OCR-Memory. It renders agent execution history into images, uses Set-of-Mark to let a VLM pick relevant segments, then retrieves verbatim text from the original logs.

AI AIエージェント OCR VLM RAG トークン管理論文