VecLite is a Rust/WASM+SIMD library that accelerates vector search inside the browser. How far can you get with Transformers.js for embeddings, IndexedDB for storage, and no server at all?
Cloudflare's Artifacts is a Git-compatible version control storage designed for AI agents to operate tens of millions of repositories. The Git server runs as a Zig-based WebAssembly binary on Durable Objects, accessible through Workers Bindings, REST API, and standard Git clients.
A three-link chain of mmap → MTLBuffer(bytesNoCopy) → Wasmtime MemoryCreator that makes a Wasm linear memory share the same physical bytes as a Metal GPU buffer. Llama 3.2 1B runs at 9ms/token on M1.
From Docker hell to Lite + LLM correction. A retrospective on my own experimentation, plus an introduction to someone else's browser-based NDLOCR-Lite implementation.