Posts about quantization
- Running DiffusionGemma on AMD Strix Halo and Decade-Old Tesla P40s
- Running DeepSeek V4 Flash on AMD Strix Halo
- Distilled Reasoning on Strix Halo: Running a Claude-Trained Thinking Model Locally
- Image Editing on 10-Year-Old GPUs: NVIDIA P40 vs AMD Strix Halo
- Discretizing Continuous ML Models: Offline Ballistic Coefficient Corrections via Lookup Table Approximation