Running DeepSeek V4 Flash on AMD Strix Halo
Getting DeepSeek's newest 284B-parameter mixture-of-experts model running locally on an AMD Strix Halo APU with a custom llama.cpp build, and the surprising reasons why it fails entirely on four NVIDIA Tesla P40 GPUs despite having more teraflops on paper.