Tech Apr 3, 2026 8 min Running Lemonade on Strix Halo (EVO-X2): Vulkan Shared Memory Leaks and ROCm Stability Real-world testing of AMD Lemonade v10.0.1 on Ryzen AI Max+ 395. LLM, image generation, speech recognition, and TTS running simultaneously, NPU Hybrid execution, Vulkan vs ROCm benchmarks, and discovering shared memory leaks. AMD Local LLM Vulkan ROCm NPU llama.cpp GPU Inference Optimization Benchmark Experiment
Tech Apr 3, 2026 8 min AMD's Lemonade Local AI Server Bundles GPU, NPU, and Multi-Modal Inference Under One Roof Lemonade is AMD's open-source local AI server that manages multiple backends like llama.cpp and FastFlowLM across GPU/NPU/CPU, serving text, image, and audio generation through an OpenAI-compatible API. AMD Local LLM NPU GPU llama.cpp Inference Optimization ROCm Vulkan