TechApr 3, 20268 minRunning Lemonade on Strix Halo (EVO-X2): Vulkan Shared Memory Leaks and ROCm StabilityReal-world testing of AMD Lemonade v10.0.1 on Ryzen AI Max+ 395. LLM, image generation, speech recognition, and TTS running simultaneously, NPU Hybrid execution, Vulkan vs ROCm benchmarks, and discovering shared memory leaks.AMDLocal LLMVulkanROCmNPUllama.cppGPUInference OptimizationBenchmarkExperiment
TechApr 3, 20268 minAMD's Lemonade Local AI Server Bundles GPU, NPU, and Multi-Modal Inference Under One RoofLemonade is AMD's open-source local AI server that manages multiple backends like llama.cpp and FastFlowLM across GPU/NPU/CPU, serving text, image, and audio generation through an OpenAI-compatible API.AMDLocal LLMNPUGPUllama.cppInference OptimizationROCmVulkan